1. Tutorial Introduction

VibeThinker-1.5B is the first open-source large-scale model released by Weibo AI in November 2025. VibeThinker-1.5B's powerful capabilities don't rely on simply piling on parameters; instead, they stem from the SSP training concept proposed by Weibo's developers. This concept encourages the model to explore all possible solution paths during the learning phase, rather than solely focusing on accuracy. Subsequently, reinforcement learning is used for efficient policy optimization, precisely locking in the correct path and maximizing model performance. Related research papers are available. Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B .

This tutorial uses a single RTX 5090 graphics card as the default resource, but a single RTX 4090 graphics card is also possible. Asking questions in English is recommended, as the model only supports English answers.

This model is recommended for solving competitive-style mathematical and algorithmic programming problems.

Citation Information

The citation information for this project is as follows:

@misc{xu2025tinymodelbiglogic, title={Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B}, author={Sen Xu and Yi Zhou and Wei Wang and Jixin Min and Zhibin Yin and Yingwei Dai and Shixi Liu and Lianyu Pang and Yirong Chen and Junlin Zhang}, year={2025}, eprint={2511.06221}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2511.06221}, }

HyperAI

Run this Notebook

Date

3 months ago

Size

1.12 MB

1. Tutorial Introduction

This tutorial uses a single RTX 5090 graphics card as the default resource, but a single RTX 4090 graphics card is also possible. Asking questions in English is recommended, as the model only supports English answers.

This model is recommended for solving competitive-style mathematical and algorithmic programming problems.

2. Effect display

3. Operation steps

1. Start the container

2. Usage steps

If "Model" is not displayed, it means the model is being initialized. Since the model is large, please wait about 2-3 minutes and refresh the page.

Citation Information

The citation information for this project is as follows:

@misc{xu2025tinymodelbiglogic,
      title={Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B}, 
      author={Sen Xu and Yi Zhou and Wei Wang and Jixin Min and Zhibin Yin and Yingwei Dai and Shixi Liu and Lianyu Pang and Yirong Chen and Junlin Zhang},
      year={2025},
      eprint={2511.06221},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2511.06221}, 
}

This notebook is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Notebooks

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

2 months ago

Depth-Anything-3: Restoring Visual Space From Any Perspective

2 months ago

PixelReasoner-RL: Pixel-level Visual Inference Model

2 months ago

Z-Image-Turbo: A High-Efficiency 6B-Parameter Image Generation Model

2 months ago

Supertonic: A high-speed TTS Speech Synthesis Model Based on ONNX

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Run this Notebook

Date

3 months ago

Size

1.12 MB

1. Tutorial Introduction

This tutorial uses a single RTX 5090 graphics card as the default resource, but a single RTX 4090 graphics card is also possible. Asking questions in English is recommended, as the model only supports English answers.

This model is recommended for solving competitive-style mathematical and algorithmic programming problems.

2. Effect display

3. Operation steps

1. Start the container

2. Usage steps

If "Model" is not displayed, it means the model is being initialized. Since the model is large, please wait about 2-3 minutes and refresh the page.

Citation Information

The citation information for this project is as follows:

@misc{xu2025tinymodelbiglogic,
      title={Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B}, 
      author={Sen Xu and Yi Zhou and Wei Wang and Jixin Min and Zhibin Yin and Yingwei Dai and Shixi Liu and Lianyu Pang and Yirong Chen and Junlin Zhang},
      year={2025},
      eprint={2511.06221},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2511.06221}, 
}

Related Notebooks

llama.cpp+openwebui Deploy Qwen3-VL-8B-Instruct-GGUF

4 days ago

HunyuanOCR: Tencent Hunyuan End-to-End OCR

2 months ago

SAM3: Visual Segmentation Model

2 months ago

Fara-7B: A Highly Efficient Web-Based Intelligent Agent Model

20 days ago

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

2 months ago

Depth-Anything-3: Restoring Visual Space From Any Perspective

2 months ago

PixelReasoner-RL: Pixel-level Visual Inference Model

2 months ago

Z-Image-Turbo: A High-Efficiency 6B-Parameter Image Generation Model

2 months ago

Supertonic: A high-speed TTS Speech Synthesis Model Based on ONNX

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Deploying VibeThinker-1.5B With vLLM+OpenWebUI

1. Tutorial Introduction

2. Effect display

3. Operation steps

1. Start the container

2. Usage steps

Citation Information

Build AI with AI

HyperAI Newsletters

Command Palette

Deploying VibeThinker-1.5B With vLLM+OpenWebUI

1. Tutorial Introduction

2. Effect display

3. Operation steps

1. Start the container

2. Usage steps

Citation Information

Related Notebooks

llama.cpp+openwebui Deploy Qwen3-VL-8B-Instruct-GGUF

HunyuanOCR: Tencent Hunyuan End-to-End OCR

SAM3: Visual Segmentation Model

Fara-7B: A Highly Efficient Web-Based Intelligent Agent Model

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

Depth-Anything-3: Restoring Visual Space From Any Perspective

PixelReasoner-RL: Pixel-level Visual Inference Model

Z-Image-Turbo: A High-Efficiency 6B-Parameter Image Generation Model

Supertonic: A high-speed TTS Speech Synthesis Model Based on ONNX

Build AI with AI

HyperAI Newsletters

Command Palette

Deploying VibeThinker-1.5B With vLLM+OpenWebUI

1. Tutorial Introduction

2. Effect display

3. Operation steps

1. Start the container

2. Usage steps

Citation Information

Related Notebooks

llama.cpp+openwebui Deploy Qwen3-VL-8B-Instruct-GGUF

HunyuanOCR: Tencent Hunyuan End-to-End OCR

SAM3: Visual Segmentation Model

Fara-7B: A Highly Efficient Web-Based Intelligent Agent Model

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

Depth-Anything-3: Restoring Visual Space From Any Perspective

PixelReasoner-RL: Pixel-level Visual Inference Model

Z-Image-Turbo: A High-Efficiency 6B-Parameter Image Generation Model

Supertonic: A high-speed TTS Speech Synthesis Model Based on ONNX

Build AI with AI

HyperAI Newsletters

Related Notebooks

llama.cpp+openwebui Deploy Qwen3-VL-8B-Instruct-GGUF

HunyuanOCR: Tencent Hunyuan End-to-End OCR

SAM3: Visual Segmentation Model

Fara-7B: A Highly Efficient Web-Based Intelligent Agent Model

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

Depth-Anything-3: Restoring Visual Space From Any Perspective

PixelReasoner-RL: Pixel-level Visual Inference Model

Z-Image-Turbo: A High-Efficiency 6B-Parameter Image Generation Model

Supertonic: A high-speed TTS Speech Synthesis Model Based on ONNX

Related Notebooks

llama.cpp+openwebui Deploy Qwen3-VL-8B-Instruct-GGUF

HunyuanOCR: Tencent Hunyuan End-to-End OCR

SAM3: Visual Segmentation Model

Fara-7B: A Highly Efficient Web-Based Intelligent Agent Model

SoulX-Podcast: Podcast-quality long-text Speech Generation for Multiple dialects.

Depth-Anything-3: Restoring Visual Space From Any Perspective

PixelReasoner-RL: Pixel-level Visual Inference Model

Z-Image-Turbo: A High-Efficiency 6B-Parameter Image Generation Model

Supertonic: A high-speed TTS Speech Synthesis Model Based on ONNX