Date

a year ago

Size

47.22 GB

Organization

Publish URL

github.com

Paper URL

arxiv.org

Citation

If you find our work helpful to your research, please give us a star :star: and cite our work :pencil: “BibTeX @misc{tian2025correctanswersequaldistillation, title={Not All Correct Answers Are Equal: Why Your Distillation Source Matters}, author={Xiaoyu Tian and Yunjie Ji and Haotian Wang and Shuaiting Chen and Sitong Zhao and Yiping Peng and Han Zhao and Xiangang Li}, year={2025}, eprint={2505.14464}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2505.14464}, } @misc{ji2025amthinkingv1advancingfrontierreasoning, title={AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale}, author={Yunjie Ji and Xiaoyu Tian and Sitong Zhao and Haotian Wang and Shuaiting Chen and Yiping Peng and Han Zhao and Xiangang Li}, year={2025}, eprint={2505.08311}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2505.08311}, } @misc{tian2025exploringpotentialofflinerl, title={Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study}, author={Xiaoyu Tian and Sitong Zhao and Haotian Wang and Shuaiting Chen and Yiping Peng and Yunjie Ji and Han Zhao and Xiangang Li}, year={2025}, eprint={2505.02142}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2505.02142}, } @misc{tian2025deepdistillenhancingllmreasoning, title={DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training}, author={Xiaoyu Tian and Sitong Zhao and Haotian Wang and Shuaiting Chen and Yiping Peng and Yunjie Ji and Han Zhao and Xiangang Li}, year={2025}, eprint={2504.17565}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2504.17565}, } @misc{wang2025leveragingreasoningmodelanswers, title={Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability}, author={Haotian Wang and Han Zhao and Shuaiting Chen and Xiaoyu Tian and Sitong Zhao and Yunjie Ji and Yiping Peng and Xiangang Li}, year={2025}, eprint={2504.09639}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2504.09639}, } @misc{ji2025difficultyawarestagedreinforcementlearning, title={How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study}, author={Yunjie Ji and Sitong Zhao and Xiaoyu Tian and Haotian Wang and Shuaiting Chen and Yiping Peng and Han Zhao and Xiangang Li}, year={2025}, eprint={2504.00829}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2504.00829}, } @misc{tian2025thinktwiceenhancingllm, title={Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking}, author={Xiaoyu Tian and Sitong Zhao and Haotian Wang and Shuaiting Chen and Yunjie Ji and Yiping Peng and Han Zhao and Xiangang Li}, year={2025}, eprint={2503.19855}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2503.19855}, } @misc{zhao202514millionopensourcedistilled, title={1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training}, author={Han Zhao and Haotian Wang and Yiping Peng and Sitong Zhao and Xiaoyu Tian and Shuaiting Chen and Yunjie Ji and Xiangang Li}, year={2025}, eprint={2503.19633}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2503.19633}, } “

AM-DeepSeek-R1-Distilled-1.4M.torrent

Seeding 1Downloading 0Completed 122Total Downloads 272

AM-DeepSeek-R1-Distilled-1.4M/
- README.md
  1.8 KB
- README.txt
  3.6 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

a year ago

Size

47.22 GB

Organization

Publish URL

github.com

Paper URL

arxiv.org

Citation

AM-DeepSeek-R1-Distilled-1.4M.torrent

Seeding 1Downloading 0Completed 122Total Downloads 272

AM-DeepSeek-R1-Distilled-1.4M/
- README.md
  1.8 KB
- README.txt
  3.6 KB

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

AM-DeepSeek-R1-Distilled-1.4M Large-scale General Reasoning Task Dataset

Citation

Build AI with AI

HyperAI Newsletters

Command Palette

AM-DeepSeek-R1-Distilled-1.4M Large-scale General Reasoning Task Dataset

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

TACK Targeted Chimera Knowledge Base Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

VisCoR-55K Visual Inference Dataset

AgentTrove Intelligent Agent Interaction Trajectory Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Fundus Eye Disease Classification Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

BRIGHT Disaster Building Assessment Dataset

OmniParsingBench Multimodal Parsing Capability Evaluation Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

GPT-5.4-step-by-step-reasoning Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

AM-DeepSeek-R1-Distilled-1.4M Large-scale General Reasoning Task Dataset

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

TACK Targeted Chimera Knowledge Base Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

VisCoR-55K Visual Inference Dataset

AgentTrove Intelligent Agent Interaction Trajectory Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Fundus Eye Disease Classification Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

BRIGHT Disaster Building Assessment Dataset

OmniParsingBench Multimodal Parsing Capability Evaluation Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

GPT-5.4-step-by-step-reasoning Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

TACK Targeted Chimera Knowledge Base Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

VisCoR-55K Visual Inference Dataset

AgentTrove Intelligent Agent Interaction Trajectory Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Fundus Eye Disease Classification Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

BRIGHT Disaster Building Assessment Dataset

OmniParsingBench Multimodal Parsing Capability Evaluation Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

GPT-5.4-step-by-step-reasoning Dataset

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset