Visual Speech Recognition On Lrs3 Ted
评估指标
Word Error Rate (WER)
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Word Error Rate (WER) | Paper Title | Repository |
---|---|---|---|
CTC/Attention | 19.1 | Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels | |
VTP | 40.6 | Sub-word Level Lip Reading With Visual Attention | - |
VTP with more data | 30.7 | Sub-word Level Lip Reading With Visual Attention | - |
0 of 3 row(s) selected.