HyperAI

Speech Recognition On Librispeech Test Other

المقاييس

Word Error Rate (WER)

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجWord Error Rate (WER)
espresso-a-fast-end-to-end-neural-speech8.7
jasper-an-end-to-end-convolutional-neural7.84
end-to-end-asr-from-supervised-to-semi5.18
contextnet-improving-convolutional-neural4.1
samba-asr-state-of-the-art-speech-recognition2.48
specaugment-a-simple-data-augmentation-method5.8
jasper-an-end-to-end-convolutional-neural8.79
fully-convolutional-speech-recognition10.47
fadam-adam-is-a-natural-gradient-optimizer2.49
iterative-pseudo-labeling-for-speech3.83
conformer-convolution-augmented-transformer4.3
state-of-the-art-speech-recognition-using5.80
contextnet-improving-convolutional-neural4.5
hubert-self-supervised-speech-representation2.9
cr-ctc-consistency-regularization-on-ctc-for3.95
semi-supervised-speech-recognition-via-local15.28
mt4ssl-boosting-self-supervised-speech9.6
conformer-convolution-augmented-transformer5.0
graph-convolutions-enrich-the-self-attention4.94
cr-ctc-consistency-regularization-on-ctc-for4.35
squeezeformer-an-efficient-transformer-for5.97
crf-based-single-stage-acoustic-modeling-with10.65
speechstew-simply-mix-all-available-speech3.3
improved-noisy-student-training-for-automatic3.4
quartznet-deep-automatic-speech-recognition7.25
self-training-and-pre-training-are3.1
qwen-audio-advancing-universal-audio4.2
transformer-based-acoustic-modeling-for4.85
speechstew-simply-mix-all-available-speech4.0
asapp-asr-multistream-cnn-and-self-attentive4.46
rwth-asr-systems-for-librispeech-hybrid-vs5.0
specaugment-a-simple-data-augmentation-method6.5
improving-rnn-transducer-based-asr-with4.20
wav2vec-2-0-a-framework-for-self-supervised3.0
e-branchformer-branchformer-with-enhanced3.65
wavlm-large-scale-self-supervised-pre3.2
fast-simpler-and-more-accurate-hybrid-asr4.20
semi-supervised-speech-recognition-via-local20.84
librispeech-transducer-model-with-internal5.6
a-comparative-study-on-transformer-vs-rnn-in5.7
pushing-the-limits-of-semi-supervised2.6
end-to-end-asr-from-supervised-to-semi4.11
snips-voice-platform-an-embedded-spoken16.5
zipformer-a-faster-and-better-encoder-for4.38
conformer-convolution-augmented-transformer3.9
w2v-bert-combining-contrastive-learning-and2.5
neural-network-language-modeling-with-letter7.63
deep-speech-2-end-to-end-speech-recognition13.25
data2vec-a-general-framework-for-self-13.7
النموذج 5012.5
contextnet-improving-convolutional-neural5.5
relaxed-attention-a-simple-method-to-boost6.85
wav2vec-2-0-a-framework-for-self-supervised4.1