Date

8 months ago

Organization

Paper URL

2509.03516

License

Apache 2.0

Data composition

This dataset designs prompts and evaluation systems from two dimensions:

Composition dimension: Construct various composition structures around three types of scene graph elements: instance, attribute, and relation.
Reasoning dimension: Based on three types of reasoning: deductive, inductive, and abductive. To facilitate fine-grained evaluation, each prompt is accompanied by a yes/no checklist that notes whether each element implicitly or explicitly required by the prompt is correctly presented.
Data distribution chart

Citation

@inproceedings{ li2026easier, title={Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?}, author={Ouxiang Li and Yuan Wang and Xinting Hu and Huijuan Huang and Rui Chen and Jiarong Ou and Xin Tao and Pengfei Wan and Xiaojuan Qi and Fuli Feng}, booktitle={The Fourteenth International Conference on Learning Representations}, year={2026}, url={https://openreview.net/forum?id=iqAFhWistW} }

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset Discuss on Discord

Date

8 months ago

Organization

Paper URL

2509.03516

License

Apache 2.0

Data composition

This dataset designs prompts and evaluation systems from two dimensions:

Composition dimension: Construct various composition structures around three types of scene graph elements: instance, attribute, and relation.
Reasoning dimension: Based on three types of reasoning: deductive, inductive, and abductive. To facilitate fine-grained evaluation, each prompt is accompanied by a yes/no checklist that notes whether each element implicitly or explicitly required by the prompt is correctly presented.
Data distribution chart

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

3 hours ago

SAM 3D Artist Objects 3D Object Reconstruction Dataset

in an hour

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

3 hours ago

FigureBench Scientific Illustration Generation Benchmark Dataset

in a minute

EAVSD E-commerce Advertising Video Storyboard Dataset

18 days ago

DeepCrack Infrastructure Crack Detection Dataset

18 days ago

World Air Pollution and AQI Dataset

18 days ago

SMOL Multilingual Translation Parallel Dataset

19 days ago

MathNet Multimodal Mathematical Benchmark Inference Dataset

a month ago

QCalEval Quantum Calibration Graph Understanding Dataset

2 months ago

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

a day ago

PanScale Remote Sensing Pancolor Sharpening Dataset

2 months ago

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

T2I-CoReBench Multimodal Image Generation Benchmark Dataset

Data composition

Citation

Build AI with AI

HyperAI Newsletters

Command Palette

T2I-CoReBench Multimodal Image Generation Benchmark Dataset

Data composition

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

DeepCrack Infrastructure Crack Detection Dataset

World Air Pollution and AQI Dataset

SMOL Multilingual Translation Parallel Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

PanScale Remote Sensing Pancolor Sharpening Dataset

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

T2I-CoReBench Multimodal Image Generation Benchmark Dataset

Data composition

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

DeepCrack Infrastructure Crack Detection Dataset

World Air Pollution and AQI Dataset

SMOL Multilingual Translation Parallel Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

PanScale Remote Sensing Pancolor Sharpening Dataset

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

DeepCrack Infrastructure Crack Detection Dataset

World Air Pollution and AQI Dataset

SMOL Multilingual Translation Parallel Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

PanScale Remote Sensing Pancolor Sharpening Dataset

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

DeepCrack Infrastructure Crack Detection Dataset

World Air Pollution and AQI Dataset

SMOL Multilingual Translation Parallel Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

PanScale Remote Sensing Pancolor Sharpening Dataset

DRACO Cross-Disciplinary Deep Research Benchmark Dataset