Use this Dataset

Discuss on Discord

Date

a year ago

Size

5.43 MB

Organization

Paper URL

Tags

Image Understanding

The U-MATH dataset is a comprehensive benchmark test set specifically designed to evaluate the mathematical reasoning capabilities of large language models (LLMs). This dataset was created by Toloka AI and Gradarius in 2024. The relevant paper results are "U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in LLMsThis dataset contains 1,100 unpublished college-level math problems that are derived from real teaching materials and cover six core math topics: elementary mathematics, algebra, differential calculus, integral calculus, multivariable calculus, and sequences and series.

A notable feature of the U-MATH dataset is the multimodal questions it contains. About 20% of the questions involve visual elements such as graphs and charts, which increases the complexity of data processing and requires the model to be able to interpret and reason about graphical information. The features of the dataset include question ID, topic labels, whether it contains images, image data, question statements, and correct answers, which provide a comprehensive evaluation basis for the mathematical reasoning ability of the model.

U-MATH.torrent

Seeding 1Downloading 0Completed 154Total Downloads 274

U-MATH/
- README.md
  1.68 KB
- README.txt
  3.35 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

Use this Dataset

Discuss on Discord

Date

a year ago

Size

5.43 MB

Organization

Paper URL

arxiv.org

Tags

Image Understanding

The U-MATH dataset is a comprehensive benchmark test set specifically designed to evaluate the mathematical reasoning capabilities of large language models (LLMs). This dataset was created by Toloka AI and Gradarius in 2024. The relevant paper results are "U-MATH: A University-Level Benchmark for Evaluating Mathematical Skills in LLMsThis dataset contains 1,100 unpublished college-level math problems that are derived from real teaching materials and cover six core math topics: elementary mathematics, algebra, differential calculus, integral calculus, multivariable calculus, and sequences and series.

A notable feature of the U-MATH dataset is the multimodal questions it contains. About 20% of the questions involve visual elements such as graphs and charts, which increases the complexity of data processing and requires the model to be able to interpret and reason about graphical information. The features of the dataset include question ID, topic labels, whether it contains images, image data, question statements, and correct answers, which provide a comprehensive evaluation basis for the mathematical reasoning ability of the model.

U-MATH.torrent

Seeding 1Downloading 0Completed 154Total Downloads 274

U-MATH/
- README.md
  1.68 KB
- README.txt
  3.35 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp