HyperAI

Speech Recognition On Gigaspeech Dev

Metrics

Word Error Rate (WER)

Results

Performance results of various models on this benchmark

Model Name
Word Error Rate (WER)
Paper TitleRepository
Zipformer+pruned transducer w/ CR-CTC (no external language model)9.95CR-CTC: Consistency regularization on CTC for improved speech recognition
Zipformer+CR-CTC (no external language model)10.15CR-CTC: Consistency regularization on CTC for improved speech recognition
Zipformer+pruned transducer (no external language model)10.09CR-CTC: Consistency regularization on CTC for improved speech recognition
SAMBA ASR9.12Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models-
Conformer/Transformer-AED10.90GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
0 of 5 row(s) selected.