wav2vec 2.0 XLS-R 1B + TEVR (5-gram) | 3.64% | TEVR: Improving Speech Recognition by Token Entropy Variance Reduction | |
wav2vec 2.0 XLS-R 1B + TEVR (4-gram) | 3.70% | TEVR: Improving Speech Recognition by Token Entropy Variance Reduction | |
Conformer Transducer (no LM) | 6.28% | Automatic Speech Recognition in German: A Detailed Error Analysis | - |
QuartzNet15x5DE (CV-only, 5-gram) | 7.7% | Scribosermo: Fast Speech-to-Text models for German and other Languages | |
wav2vec 2.0 XLS-R 1B + TEVR (no LM) | 10.10% | TEVR: Improving Speech Recognition by Token Entropy Variance Reduction | |