HyperAI

S1k Reasoning Problem Dataset

Date

3 months ago

Size

5.83 MB

Publish URL

github.com

*This dataset supports online use.Click here to jump.

The s1K dataset is a high-quality reasoning dataset released by Fei-Fei Li's team in 2025. It contains 1k questions and their detailed reasoning trajectories and answers. These answers are derived from the distillation results of Google's Gemini Thinking Experimental. The dataset covers 50 different fields, including probability theory, quantitative interview questions, and Olympic questions, ensuring that the model can handle various types of reasoning tasks. The related paper results are "s1: Simple test-time scaling".

The dataset is designed to enable efficient model fine-tuning through minimal data engineering and demonstrate excellent performance in inference tasks.

s1K.torrent
Seeding 1Downloading 1Completed 29Total Downloads 69
  • s1K/
    • README.md
      1.25 KB
    • README.txt
      2.51 KB
      • data/
        • s1k.zip
          5.83 MB