HyperAI

Lipreading On Lrs2

Metrics

Word Error Rate (WER)

Results

Performance results of various models on this benchmark

Comparison Table
Model NameWord Error Rate (WER)
es3-evolving-self-supervised-learning-of26.7
es3-evolving-self-supervised-learning-of31.4
asr-is-all-you-need-cross-modal-distillation53.2
unified-speech-recognition-a-single-model-for15.4
sub-word-level-lip-reading-with-visual22.6
audio-visual-recognition-of-overlapped-speech48.86
visual-speech-recognition-for-multiple25.5
distinguishing-homophenes-using-multi-head-144.5
es3-evolving-self-supervised-learning-of24.6
syncvsr-data-efficient-visual-speech28.9
leveraging-uni-modal-self-supervised-learning-143.2
es3-evolving-self-supervised-learning-of28.7
deep-audio-visual-speech-recognition48.3
end-to-end-audio-visual-speech-recognition39.1
spatio-temporal-fusion-based-convolutional51.7
auto-avsr-audio-visual-speech-recognition14.6
deep-audio-visual-speech-recognition54.7
syncvsr-data-efficient-visual-speech16.5
jointly-learning-visual-and-auditory-speech18.6
es3-evolving-self-supervised-learning-of29.3
hearing-lips-improving-lip-reading-by65.29
audio-visual-speech-recognition-with-a-hybrid50
es3-evolving-self-supervised-learning-of30.7
sub-word-level-lip-reading-with-visual28.9
visual-speech-recognition-for-multiple32.9