Date

9 months ago

Size

228.19 MB

Organization

Paper URL

arxiv.org

Tags

Multimodal

Mathematics

EMMA (Enhanced MultiModal reAsoning) is a multimodal reasoning benchmark dataset released in 2025 by a research team from the University of Electronic Science and Technology of China, Sun Yat-sen University, University of Washington, and Microsoft. The relevant paper results are:Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark", which aims to provide a standardized testing platform for evaluating the complex reasoning capabilities of multimodal large models (MLLMs).

The dataset focuses on multimodal reasoning tasks in the fields of organic chemistry (42%), mathematics (32%), physics (6%), and programming (20%). It contains 2,788 questions, of which 1,796 are newly constructed samples. It supports fine-grained task division and aims to promote the joint understanding of images and texts. The data task types include chemical reaction simulation, mathematical graphic reasoning, physical path tracing, programming visualization, etc.

The proportion of different disciplines and their sub-tasks in the dataset

EMMA.torrent

Seeding 1Downloading 0Completed 63Total Downloads 190

EMMA/
- README.md
  1.6 KB
- README.txt
  3.21 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Download

Discuss on Discord

Date

9 months ago

Size

228.19 MB

Organization

Paper URL

arxiv.org

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

2 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

2 months ago

TxT360-3efforts Multi-Task Inference Dataset

2 months ago

MCIF Multimodal Cross-Language Instruction Following Dataset

2 months ago

PhysDriver Physiological Test Dataset

3 months ago

VOccl3D 3D Human Occlusion Video Dataset

3 months ago

VenusBench-GD Cross-Platform Interface Understanding Dataset

2 months ago

Nemotron-Math-v2 Mathematical Inference Dataset

a month ago

X-ray Contraband Detection Dataset

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

EMMA Multimodal Reasoning Benchmark Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

EMMA Multimodal Reasoning Benchmark Dataset

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

TxT360-3efforts Multi-Task Inference Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

PhysDriver Physiological Test Dataset

VOccl3D 3D Human Occlusion Video Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

X-ray Contraband Detection Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

EMMA Multimodal Reasoning Benchmark Dataset

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

TxT360-3efforts Multi-Task Inference Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

PhysDriver Physiological Test Dataset

VOccl3D 3D Human Occlusion Video Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

X-ray Contraband Detection Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

TxT360-3efforts Multi-Task Inference Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

PhysDriver Physiological Test Dataset

VOccl3D 3D Human Occlusion Video Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

X-ray Contraband Detection Dataset

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

TxT360-3efforts Multi-Task Inference Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

PhysDriver Physiological Test Dataset

VOccl3D 3D Human Occlusion Video Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

X-ray Contraband Detection Dataset