HyperAIHyperAI

Command Palette

Search for a command to run...

kyutai-tts-1.6 b-en_fr Audio Generation

Date

a month ago

Size

543.77 MB

License

Apache 2.0

Paper URL

arxiv.org

1. Tutorial Introduction

Model License

Kyutai TTS 1.6B (en-fr) is a large-scale English-French bilingual speech model released by the Kyutai team on October 15, 2024. In streaming TTS benchmarks, this model outperforms traditional offline TTS by 751 TP3T and 421 TP3T in the "real-time output of long texts" and "bilingual prosodic naturalness" categories, respectively. It also achieved state-of-the-art performance in TTS benchmarks such as Moshi Benchmark and Audio-Language Alignment Dataset. Furthermore, the model demonstrates features rarely seen in previous systems, including input-output streaming generation, zero-shot switching between English and French, speech selection based on pre-computed embeddings, and fast inference with dynamically adjusted audio token counts. Related paper results are available. Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling.

This tutorial uses a single RTX 4090 graphics card. Only English and French are supported.

2. Project Examples

standard-tts

streaming-tts

3. Operation steps

1. After starting the container, click the API address to enter the Web interface

2. Usage steps

If "Bad Gateway" is displayed, it means the model is initializing. Please wait approximately 2-3 minutes and then refresh the page. When using the Safari browser, audio may not play directly and needs to be downloaded first.

Citation Information

@techreport{kyutai2025streaming,
      title={Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling}, 
      author={Neil Zeghidour and Eugene Kharitonov and Manu Orsini and Václav Volhejn and Gabriel de Marmiesse and Edouard Grave and Patrick Pérez and Laurent Mazaré and Alexandre Défossez},
      year={2025},
      eprint={2509.08753},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2509.08753}, 
}

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp