HyperAI

OpenThoughts-114k Reasoning Dataset

Date

3 months ago

Size

922.07 MB

Organization

Publish URL

github.com

License

Apache 2.0

*This dataset supports online use.Click here to jump.

OpenThoughts-114k is an open source reasoning dataset that focuses on areas such as mathematics, code, science, and puzzles, and contains 114,000 high-quality samples. The dataset was released by Open Thoughts in 2025 and aims to train small reasoning models to outperform existing large models (such as DeepSeek-R1-Distill-Qwen-32B and DeepSeek-R1-Distill-Qwen-7B) on mathematical and code reasoning tasks.

Dataset generation process
OpenThoughts-114k.torrent
Seeding 2Downloading 1Completed 64Total Downloads 122
  • OpenThoughts-114k/
    • README.md
      1.12 KB
    • README.txt
      2.25 KB
      • data/
        • OpenThoughts-114k.zip
          922.07 MB