HyperAI

Deploy OpenMath-Nemotron-1.5B Using vLLM+Open WebUI

1. Tutorial Introduction

The computing resources of this tutorial use a single RTX 4090 card. This model only supports calculating mathematical problems, and the answers are in English.

OpenMath-Nemotron-1.5B was released by NVIDIA team NemoSkils on April 23, 2025. The model was created by fine-tuning Qwen/Qwen2.5-Math-1.5B on the OpenMathReasoning dataset. The model achieved state-of-the-art results on popular math benchmarks and has been licensed for commercial use.AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset".

2. Project Examples

3. Operation steps

1. Start the container

If "Model" is not displayed, it means the model is being initialized. Since the model is large, please wait about 2-3 minutes and refresh the page.

2. After entering the webpage, you can start a conversation with the model

4. Discussion

🖌️ If you see a high-quality project, please leave a message in the background to recommend it! In addition, we have also established a tutorial exchange group. Welcome friends to scan the QR code and remark [SD Tutorial] to join the group to discuss various technical issues and share application effects↓

Citation Information

Thanks to Github user SuperYang  Deployment of this tutorial. The reference information of this project is as follows:

@article{moshkov2025aimo2,
  title   = {AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset},
  author  = {Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
  year    = {2025},
  journal = {arXiv preprint arXiv:2504.16891}
}