Date

a year ago

Size

1.64 GB

Organization

Paper URL

Tags

II-Thought-RL-v0 is a large-scale, multi-task dataset designed for reinforcement learning and problem solving. It was released by Intelligent Internet in March 2025. The relevant blog is "II-Thought". It contains high-quality question-answer pairs that have been strictly filtered in multiple steps, covering multiple fields such as mathematics, programming, science, etc. The question pairs in the dataset are not only from public datasets, but also contain customized high-quality question pairs to ensure the diversity and practicality of the data. In terms of data processing, II-Thought-RL-v0 uses Gemini 2.0 Flash and Qwen 32B as quality assessment tools, and has gone through processes such as deduplication, quality assessment, and decontamination to ensure data integrity and training suitability. This high-quality data screening and processing method makes the dataset very suitable for training reinforcement learning models, helping the models to show higher accuracy and logic in solving complex problems. The application scenarios of this dataset are mainly concentrated in the fields of reinforcement learning and problem answering. By providing rich reasoning chains and complex problems in multiple fields, II-Thought-RL-v0 provides strong support for model training and can help the model better understand and generate complex reasoning processes.

II-Thought-RL-v0.torrent

Seeding 1Downloading 0Completed 140Total Downloads 306

II-Thought-RL-v0/
- README.md
  1.87 KB
- README.txt
  3.74 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.