Date

8 months ago

Size

4.89 GB

Organization

Paper URL

arxiv.org

Dataset features:

High-resolution challenge: The average resolution of each map image in the dataset is as high as 5839 × 5449, which is much higher than existing visual reasoning tasks, and places higher requirements on the image encoding capabilities of the model.

Difficulty-aware design: Images are labeled with difficulty to ensure a balanced distribution of question-answer pairs at different difficulty levels, helping to more comprehensively evaluate model capabilities.

Multi-dimensional evaluation system: not only examines the accuracy of the model's answers, but also conducts a fine-grained evaluation of the quality of the model route, including path rationality and transfer strategies.

Close to real-world usage scenarios: The tasks are directly based on image reasoning, do not rely on structured middleware, and are closer to the way humans think when using maps.

ReasonMap.torrent

Seeding 1Downloading 0Completed 89Total Downloads 132

ReasonMap/
- README.md
  2.02 KB
- README.txt
  4.04 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Download

Discuss on Discord

Date

8 months ago

Size

4.89 GB

Organization

Paper URL

arxiv.org

Dataset features:

High-resolution challenge: The average resolution of each map image in the dataset is as high as 5839 × 5449, which is much higher than existing visual reasoning tasks, and places higher requirements on the image encoding capabilities of the model.

Difficulty-aware design: Images are labeled with difficulty to ensure a balanced distribution of question-answer pairs at different difficulty levels, helping to more comprehensively evaluate model capabilities.

Multi-dimensional evaluation system: not only examines the accuracy of the model's answers, but also conducts a fine-grained evaluation of the quality of the model route, including path rationality and transfer strategies.

Close to real-world usage scenarios: The tasks are directly based on image reasoning, do not rely on structured middleware, and are closer to the way humans think when using maps.

ReasonMap.torrent

Seeding 1Downloading 0Completed 89Total Downloads 132

ReasonMap/
- README.md
  2.02 KB
- README.txt
  4.04 KB

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

2 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

a month ago

HumanSense Benchmark Dataset

3 months ago

PhysToolBench Physics Tool Task Dataset

2 months ago

1.56 GB58

MUVR Multimodal Uncropped Video Retrieval Benchmark

2 months ago

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

3 months ago

UNO-Bench full-modal Evaluation Benchmark Dataset

3 months ago

9.71 GB69

VOccl3D 3D Human Occlusion Video Dataset

2 months ago

VenusBench-GD Cross-Platform Interface Understanding Dataset

a month ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

ReasonMap Traffic Graph Reasoning Benchmark Dataset

Dataset features:

Build AI with AI

HyperAI Newsletters

Command Palette

ReasonMap Traffic Graph Reasoning Benchmark Dataset

Dataset features:

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

HumanSense Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

UNO-Bench full-modal Evaluation Benchmark Dataset

VOccl3D 3D Human Occlusion Video Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

ReasonMap Traffic Graph Reasoning Benchmark Dataset

Dataset features:

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

HumanSense Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

UNO-Bench full-modal Evaluation Benchmark Dataset

VOccl3D 3D Human Occlusion Video Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

HumanSense Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

UNO-Bench full-modal Evaluation Benchmark Dataset

VOccl3D 3D Human Occlusion Video Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

Related Datasets

IF-Bench Infrared Image Understanding Benchmark Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

HumanSense Benchmark Dataset

PhysToolBench Physics Tool Task Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

NAMD_Benchmark Molecular Dynamics Performance Benchmark Dataset

UNO-Bench full-modal Evaluation Benchmark Dataset

VOccl3D 3D Human Occlusion Video Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset