Date

3 months ago

Size

65.85 GB

Organization

License

MIT

Tags

Image Understanding

FineReason is a dataset released by OpenDataArena in 2025 for training and evaluating the visual reasoning capabilities of large multimodal models (LMMs). It aims to improve the interpretable and verifiable long-chain reasoning capabilities of models in scenarios such as visual puzzles, games, complex graph reasoning, and STEM (science, technology, engineering, and mathematics) knowledge applications.

This dataset covers various task types, including geometry problems (geometry3k / geo170k), diagram and flowchart comprehension (AI2D), visual reasoning and observation puzzles (visualwebinstruct, etc.). All samples use a uniform data format, including a unique ID, question text, corresponding image, and reasoning-based answer. The dataset is compiled from multiple public subsets and its reasoning chains are distilled using the Qwen3-VL-235B-a22B-thinking model, ensuring that all samples possess a clearly structured, verifiable step-by-step reasoning process and a final solution.

Data composition (continuously expanding):

BMMR: 42,647 entries
Euclid30K: 27,111 entries
ai2d_merged: 2,446 entries
geo170k (Q&A): 12,101 results
geometry3k / mathv360k: 9,724 results
ScienceQA: 6,146 results
TQA (TextbookQA): 12,565 items
VisualWebInstruct (filtered): 261,436 results
MMR1: 1,000 pieces
VisualSphinx: 3,781 results
MMOpenR1-8K: 7,428 entries

FineReason.torrent

Seeding 1Downloading 0Completed 1Total Downloads 78

FineReason/
- README.md
  1.9 KB
- README.txt
  3.8 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Download

Discuss on Discord

Date

3 months ago

Size

65.85 GB

Organization

License

MIT

Data composition (continuously expanding):

BMMR: 42,647 entries
Euclid30K: 27,111 entries
ai2d_merged: 2,446 entries
geo170k (Q&A): 12,101 results
geometry3k / mathv360k: 9,724 results
ScienceQA: 6,146 results
TQA (TextbookQA): 12,565 items
VisualWebInstruct (filtered): 261,436 results
MMR1: 1,000 pieces
VisualSphinx: 3,781 results
MMOpenR1-8K: 7,428 entries

FineReason.torrent

Seeding 1Downloading 0Completed 1Total Downloads 78

FineReason/
- README.md
  1.9 KB
- README.txt
  3.8 KB

Related Datasets

TxT360-3efforts Multi-Task Inference Dataset

a month ago

PhysToolBench Physics Tool Task Dataset

2 months ago

1.56 GB58

Spatial-SSRL-81k Spatial Awareness Self-Supervised Dataset

2 months ago

IF-Bench Infrared Image Understanding Benchmark Dataset

2 months ago

VenusBench-GD Cross-Platform Interface Understanding Dataset

a month ago

Open Schematics: Understanding Circuit Schematics and Generating Datasets

a month ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

FineReason Multimodal Visual Reasoning Dataset

Data composition (continuously expanding):

Build AI with AI

HyperAI Newsletters

Command Palette

FineReason Multimodal Visual Reasoning Dataset

Data composition (continuously expanding):

Related Datasets

TxT360-3efforts Multi-Task Inference Dataset

PhysToolBench Physics Tool Task Dataset

Spatial-SSRL-81k Spatial Awareness Self-Supervised Dataset

IF-Bench Infrared Image Understanding Benchmark Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Open Schematics: Understanding Circuit Schematics and Generating Datasets

Build AI with AI

HyperAI Newsletters

Command Palette

FineReason Multimodal Visual Reasoning Dataset

Data composition (continuously expanding):

Related Datasets

TxT360-3efforts Multi-Task Inference Dataset

PhysToolBench Physics Tool Task Dataset

Spatial-SSRL-81k Spatial Awareness Self-Supervised Dataset

IF-Bench Infrared Image Understanding Benchmark Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Open Schematics: Understanding Circuit Schematics and Generating Datasets

Build AI with AI

HyperAI Newsletters

Related Datasets

TxT360-3efforts Multi-Task Inference Dataset

PhysToolBench Physics Tool Task Dataset

Spatial-SSRL-81k Spatial Awareness Self-Supervised Dataset

IF-Bench Infrared Image Understanding Benchmark Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Open Schematics: Understanding Circuit Schematics and Generating Datasets

Related Datasets

TxT360-3efforts Multi-Task Inference Dataset

PhysToolBench Physics Tool Task Dataset

Spatial-SSRL-81k Spatial Awareness Self-Supervised Dataset

IF-Bench Infrared Image Understanding Benchmark Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Open Schematics: Understanding Circuit Schematics and Generating Datasets