OpenThoughts3-1.2M Reasoning Dataset
OpenThoughts3-1.2M is an open source reasoning dataset released by Open Thoughts in 2025. It is the third iteration of the OpenThoughts dataset series. The related paper results are:OpenThoughts: Data Recipes for Reasoning Models".
The dataset contains 850,000 math problems, 250,000 coding problems, and 100,000 science problems, and the annotations are completed using the QwQ-32B model.

Dataset Framework