HyperAI超神経

Text To Speech Synthesis On Ljspeech

評価指標

Audio Quality MOS

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Audio Quality MOS
fastspeech-fast-robust-and-controllable-text3.84
fastdiff-a-fast-conditional-diffusion-model4.03
naturalspeech-end-to-end-text-to-speech4.34
grad-tts-a-diffusion-probabilistic-model-for4.37
flowtron-an-autoregressive-flow-based-
neural-speech-synthesis-with-transformer3.88
naturalspeech-end-to-end-text-to-speech4.43
matcha-tts-a-fast-tts-architecture-with-
flowtron-an-autoregressive-flow-based-
fastspeech-2-fast-and-high-quality-end-to-end4.32
モデル 111.25
fastdiff-a-fast-conditional-diffusion-model4.28
overflow-putting-flows-on-top-of-neural3.37
naturalspeech-end-to-end-text-to-speech4.56
fastspeech-fast-robust-and-controllable-text2.4
glow-tts-a-generative-flow-for-text-to-speech4.34