Date

a year ago

Size

72.37 GB

License

Apache 2.0

Tags

LLM

Mathematics

Reasoning

Natural Language Processing

Reasoning-v1-20m is a large-scale reasoning dataset released by Glaiveai in 2025, containing about 20 million reasoning traces, covering complex problems in multiple fields such as mathematics, programming, science, etc. This dataset aims to help the model learn complex reasoning logic and improve its performance in multi-step reasoning tasks by providing rich examples of the reasoning process. The Reasoning-v1-20m dataset is characterized by its huge data volume and diverse reasoning tasks. It not only covers a wide range of fields, but also provides a detailed chain of thought (COT) for each question, helping the model understand the step-by-step reasoning process from question to answer. This structured data form provides rich material for model training, enabling it to learn and optimize reasoning strategies. This dataset is widely used in the fields of natural language processing and artificial intelligence, especially in training and optimizing reasoning models. It can help models show higher accuracy and logic when dealing with complex problems, such as in solving mathematical problems, solving programming problems, and reasoning about scientific problems. In addition, this dataset can also be used to study the effectiveness of different reasoning strategies and promote the advancement of natural language processing technology in reasoning tasks.

reasoning-v1-20m.torrent

Seeding 1Downloading 0Completed 85Total Downloads 191

reasoning-v1-20m/
- README.md
  1.85 KB
- README.txt
  3.7 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

a year ago

Size

72.37 GB

License

Apache 2.0

Related Datasets

GPT-5.4-step-by-step-reasoning Dataset

2 months ago

Nemotron Personas France (French Synthetic Personas Dataset)

3 months ago

Sutra 10B Pretraining Teaching and Training Dataset

3 months ago

CHIMERA General Inference Synthetic Dataset

2 days ago

Open-RL Inference Problem Dataset

4 months ago

RoVid-X Robot Video Generation Dataset

2 days ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Reasoning-v1-20m Reasoning Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Reasoning-v1-20m Reasoning Dataset

Related Datasets

GPT-5.4-step-by-step-reasoning Dataset

Nemotron Personas France (French Synthetic Personas Dataset)

Sutra 10B Pretraining Teaching and Training Dataset

CHIMERA General Inference Synthetic Dataset

Open-RL Inference Problem Dataset

RoVid-X Robot Video Generation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Reasoning-v1-20m Reasoning Dataset

Related Datasets

GPT-5.4-step-by-step-reasoning Dataset

Nemotron Personas France (French Synthetic Personas Dataset)

Sutra 10B Pretraining Teaching and Training Dataset

CHIMERA General Inference Synthetic Dataset

Open-RL Inference Problem Dataset

RoVid-X Robot Video Generation Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

GPT-5.4-step-by-step-reasoning Dataset

Nemotron Personas France (French Synthetic Personas Dataset)

Sutra 10B Pretraining Teaching and Training Dataset

CHIMERA General Inference Synthetic Dataset

Open-RL Inference Problem Dataset

RoVid-X Robot Video Generation Dataset

Related Datasets

GPT-5.4-step-by-step-reasoning Dataset

Nemotron Personas France (French Synthetic Personas Dataset)

Sutra 10B Pretraining Teaching and Training Dataset

CHIMERA General Inference Synthetic Dataset

Open-RL Inference Problem Dataset

RoVid-X Robot Video Generation Dataset