HyperAI

Lipreading On Lrs3 Ted

المقاييس

Word Error Rate (WER)

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجWord Error Rate (WER)
syncvsr-data-efficient-visual-speech31.2
syncvsr-data-efficient-visual-speech21.5
sub-word-level-lip-reading-with-visual30.7
auto-avsr-audio-visual-speech-recognition19.1
where-visual-speech-meets-language-vsp-llm25.4
visual-speech-recognition-for-multiple31.5
learning-audio-visual-speech-representation-126.9
end-to-end-audio-visual-speech-recognition43.3
audio-visual-representation-learning-via26.2
large-scale-visual-speech-recognition55.1
discriminative-multi-modality-speech57.8
conformers-are-all-you-need-for-visual-speech12.8
spatio-temporal-fusion-based-convolutional60.1
asr-is-all-you-need-cross-modal-distillation59.8
es3-evolving-self-supervised-learning-of37.1
jointly-learning-visual-and-auditory-speech23.4
relaxed-attention-for-transformer-models25.51
unified-speech-recognition-a-single-model-for21.5
recurrent-neural-network-transducer-for-audio33.6
es3-evolving-self-supervised-learning-of40.3
deep-audio-visual-speech-recognition58.9
unified-speech-recognition-a-single-model-for22.3
sub-word-level-lip-reading-with-visual40.6