1. Tutorial Introduction

Seed-OSS-36B-Instruct is an open-source large language model released by the ByteDance Seed team in August 2025. Seed-OSS was trained on 12 trillion (12 T) tokens and achieved outstanding performance on multiple mainstream open-source benchmarks. The Seed-OSS-36B architecture combines several common design choices, including causal language modeling, grouped query attention, SwiGLU activation function, RMSNorm, and RoPE positional encoding. One of its most representative features is its native long-context capability, with a maximum context length of 512k tokens, enabling it to handle extremely long documents and reasoning chains without sacrificing performance. This length is twice that of OpenAI's latest GPT-5 model series, equivalent to approximately 1,600 pages of text.

The computing resources used in this tutorial are dual-card RTX A6000.

4. Discussion

🖌️ If you see a high-quality project, please leave a message in the background to recommend it! In addition, we have also established a tutorial exchange group. Welcome friends to scan the QR code and remark [SD Tutorial] to join the group to discuss various technical issues and share application effects↓

HyperAI

Run this Notebook

Date

5 months ago

Size

1.23 MB

1. Tutorial Introduction

The computing resources used in this tutorial are dual-card RTX A6000.

2. Effect display

3. Operation steps

1. Start the container

2. Usage steps

If "Model" is not displayed, it means the model is initializing. Since the model is large, please wait about 4-5 minutes and refresh the page.

4. Discussion

Citation Information

The citation information for this project is as follows:

@misc{seed2025seed-oss,
  author={ByteDance Seed Team},
  title={Seed-OSS Open-Source Models},
  year={2025},
  howpublished={\url{https://github.com/ByteDance-Seed/seed-oss}}
}

This notebook is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Notebooks

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

1. Tutorial Introduction

The computing resources used in this tutorial are dual-card RTX A6000.

4. Discussion

Command Palette

vLLM+Open WebUI Deployment Seed-OSS-36B-Instruct

1. Tutorial Introduction

2. Effect display