Speech Recognition On Gigaspeech Dev
評価指標
Word Error Rate (WER)
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
モデル名 | Word Error Rate (WER) | Paper Title | Repository |
---|---|---|---|
Zipformer+pruned transducer w/ CR-CTC (no external language model) | 9.95 | CR-CTC: Consistency regularization on CTC for improved speech recognition | |
Zipformer+CR-CTC (no external language model) | 10.15 | CR-CTC: Consistency regularization on CTC for improved speech recognition | |
Zipformer+pruned transducer (no external language model) | 10.09 | CR-CTC: Consistency regularization on CTC for improved speech recognition | |
SAMBA ASR | 9.12 | Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models | - |
Conformer/Transformer-AED | 10.90 | GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio |
0 of 5 row(s) selected.