Command Palette
Search for a command to run...
Echo-4o-Image Synthetic Image Generation Dataset
Date
Size
Paper URL
License
MIT
Echo-4o-Image is a synthetic image dataset released in 2025 by the Shanghai Artificial Intelligence Laboratory, in collaboration with Sun Yat-sen University, the Multimedia Laboratory (MMLab) of the Chinese University of Hong Kong, and other institutions. The related paper results are "Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation", which aims to improve the open source model's ability to generate text from images.
This dataset is generated by GPT-4o and contains approximately 179,000 samples, covering three different task types:
- Complex instruction execution (approximately 68,000), strengthening compliance with long/detailed texts;
- Surreal Fantasy Generation (approximately 38,000), focusing on imaginative content;
- Multi-reference image generation (about 73,000), suitable for scenes requiring multiple visual cues.
Each sample is a 2×2 image grid with a resolution of 1024×1024, containing the image path, features (attributes/subjects), and structured information of the generated prompt.

Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.