@misc{qiao2025wemath20versatilemathbook, title={We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning}, author={Runqi Qiao and Qiuna Tan and Peiqing Yang and Yanzi Wang and Xiaowan Wang and Enhui Wan and Sitong Zhou and Guanting Dong and Yuchen Zeng and Yida Xu and Jie Wang and Chong Sun and Chen Li and Honggang Zhang}, year={2025}, eprint={2508.10433}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2508.10433}, }

Date

10 months ago

Size

369.86 MB

Organization

Paper URL

2508.10433

License

Non-Commercial

*This dataset supports online use.Click here to jump.

We-Math2.0-Standard is a standard dataset for visual mathematical reasoning released by Beijing University of Posts and Telecommunications, Tencent and Tsinghua University in 2025. The related paper results are "WE-MATH 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning", aims to provide a diagnosable, explainable and comparable evaluation basis. This dataset builds a unified label space around 1,819 precisely defined knowledge principles, explicitly annotating each question with the principle and rigorously curating it, thereby achieving broad and balanced coverage overall, particularly strengthening mathematical subfields and question types that were previously underrepresented. The dataset adopts a dual expansion design:

First, multiple images per question are used to test the integration and alignment of multi-source visual evidence;
Second, multi-questions per image are used to test multi-principle transfer and conceptual flexibility in the same visual context. Each example consists of an image and a text stem, and is accompanied by annotations of the knowledge principles and standard answers that the question relies on.
Dataset Overview

Citation

@misc{qiao2025wemath20versatilemathbook,
title={We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning},
author={Runqi Qiao and Qiuna Tan and Peiqing Yang and Yanzi Wang and Xiaowan Wang and Enhui Wan and Sitong Zhou and Guanting Dong and Yuchen Zeng and Yida Xu and Jie Wang and Chong Sun and Chen Li and Honggang Zhang},
year={2025},
eprint={2508.10433},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2508.10433},
}

We-Mathv2-Standard.torrent

Seeding 1Downloading 0Completed 70Total Downloads 193

We-Mathv2-Standard/
- README.md
  1.82 KB
- README.txt
  3.65 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

10 months ago

Size

369.86 MB

Organization

Paper URL

2508.10433

License

Non-Commercial

*This dataset supports online use.Click here to jump.

First, multiple images per question are used to test the integration and alignment of multi-source visual evidence;
Second, multi-questions per image are used to test multi-principle transfer and conceptual flexibility in the same visual context. Each example consists of an image and a text stem, and is accompanied by annotations of the knowledge principles and standard answers that the question relies on.
Dataset Overview

Citation

@misc{qiao2025wemath20versatilemathbook,
title={We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning},
author={Runqi Qiao and Qiuna Tan and Peiqing Yang and Yanzi Wang and Xiaowan Wang and Enhui Wan and Sitong Zhou and Guanting Dong and Yuchen Zeng and Yida Xu and Jie Wang and Chong Sun and Chen Li and Honggang Zhang},
year={2025},
eprint={2508.10433},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2508.10433},
}

We-Mathv2-Standard.torrent

Seeding 1Downloading 0Completed 70Total Downloads 193

We-Mathv2-Standard/
- README.md
  1.82 KB
- README.txt
  3.65 KB

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

We-Math2.0-Standard Visual Mathematical Reasoning Benchmark Dataset

*This dataset supports online use.Click here to jump.

Citation

Build AI with AI

HyperAI Newsletters

Command Palette

We-Math2.0-Standard Visual Mathematical Reasoning Benchmark Dataset

*This dataset supports online use.Click here to jump.

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

ChartNet Chart Understanding Multimodal Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

VisCoR-55K Visual Inference Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Claw-Eval Real-World Benchmark Dataset

Breast Cancer: Multi-Modal Fusion Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

OpenMementos Context Memory Compressed Dataset

BRIGHT Disaster Building Assessment Dataset

OmniParsingBench Multimodal Parsing Capability Evaluation Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

GPT-5.4-step-by-step-reasoning Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

We-Math2.0-Standard Visual Mathematical Reasoning Benchmark Dataset

*This dataset supports online use.Click here to jump.

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

ChartNet Chart Understanding Multimodal Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

VisCoR-55K Visual Inference Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Claw-Eval Real-World Benchmark Dataset

Breast Cancer: Multi-Modal Fusion Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

OpenMementos Context Memory Compressed Dataset

BRIGHT Disaster Building Assessment Dataset

OmniParsingBench Multimodal Parsing Capability Evaluation Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

GPT-5.4-step-by-step-reasoning Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

ChartNet Chart Understanding Multimodal Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

VisCoR-55K Visual Inference Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Claw-Eval Real-World Benchmark Dataset

Breast Cancer: Multi-Modal Fusion Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

OpenMementos Context Memory Compressed Dataset

BRIGHT Disaster Building Assessment Dataset

OmniParsingBench Multimodal Parsing Capability Evaluation Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset