Visual Speech Recognition On Lrs3 Ted
평가 지표
Word Error Rate (WER)
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | Word Error Rate (WER) | Paper Title | Repository |
---|---|---|---|
CTC/Attention | 19.1 | Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels | |
VTP | 40.6 | Sub-word Level Lip Reading With Visual Attention | - |
VTP with more data | 30.7 | Sub-word Level Lip Reading With Visual Attention | - |
0 of 3 row(s) selected.