HyperAIHyperAI

Command Palette

Search for a command to run...

Deploying VibeThinker-1.5B With vLLM+OpenWebUI

1. Tutorial Introduction

Build

VibeThinker-1.5B is the first open-source large-scale model released by Weibo AI in November 2025. VibeThinker-1.5B's powerful capabilities don't rely on simply piling on parameters; instead, they stem from the SSP training concept proposed by Weibo's developers. This concept encourages the model to explore all possible solution paths during the learning phase, rather than solely focusing on accuracy. Subsequently, reinforcement learning is used for efficient policy optimization, precisely locking in the correct path and maximizing model performance. The related paper is titled "Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B".

This tutorial uses a single RTX 5090 graphics card as the default resource, but a single RTX 4090 graphics card is also possible. Asking questions in English is recommended, as the model only supports English answers.

This model is recommended for solving competitive-style mathematical and algorithmic programming problems.

2. Effect display

3. Operation steps

1. Start the container

2. Usage steps

If "Model" is not displayed, it means the model is being initialized. Since the model is large, please wait about 2-3 minutes and refresh the page.

Citation Information

The citation information for this project is as follows:

@misc{xu2025tinymodelbiglogic,
      title={Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B}, 
      author={Sen Xu and Yi Zhou and Wei Wang and Jixin Min and Zhibin Yin and Yingwei Dai and Shixi Liu and Lianyu Pang and Yirong Chen and Junlin Zhang},
      year={2025},
      eprint={2511.06221},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2511.06221}, 
}

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Deploying VibeThinker-1.5B With vLLM+OpenWebUI | Tutorials | HyperAI