HyperAIHyperAI

Command Palette

Search for a command to run...

MathNet Multimodal Mathematical Benchmark Inference Dataset

Date

in 4 hours

Organization

MIT

Paper URL

2604.18584

License

CC BY 4.0

MathNet is a large-scale, multilingual, multimodal mathematical reasoning dataset released in 2026 by a team from MIT in collaboration with King Abdullah University of Science and Technology and other institutions. The related research papers are as follows: MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and RetrievalIt aims to evaluate and improve the capabilities of large models in Olympic-level mathematical reasoning and structured retrieval tasks, and is widely used in mathematical reasoning evaluation, RAG research, and multimodal AI training. This dataset, version v0, contains 27,817 expert-level math problems and their standard solutions. It covers official math competition problems from 58 countries and regions in 17 languages, including 5,148 illustrated problems with a total of 7,541 geometric and graphical illustrations. The dataset covers algebra, geometry, number theory, combinatorics, calculus, probability and statistics, and other Olympiad math knowledge systems. It supports three benchmark tasks: solving math problems, mathematical semantic retrieval (identifying structurally equivalent and similar problems), and retrieval enhancement problem solving.

Dataset Overview
Dataset Overview

Citation

@inproceedings{alshammari2026mathnet,
title = {MathNet: A Global Multimodal Benchmark for Mathematical
Reasoning and Retrieval},
author = {Alshammari, Shaden and Wen, Kevin and Zainal, Abrar and
Hamilton, Mark and Safaei, Navid and Albarakati, Sultan and
Freeman, William T. and Torralba, Antonio},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mathnet.mit.edu}
}

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp