HyperAI
HyperAI超神经
首页
算力平台
文档
资讯
论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
服务条款
隐私政策
中文
HyperAI
HyperAI超神经
Toggle Sidebar
全站搜索…
⌘
K
Command Palette
Search for a command to run...
算力平台
首页
SOTA
自动语音识别 (ASR)
Automatic Speech Recognition On Lrs2
Automatic Speech Recognition On Lrs2
评估指标
Test WER
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Test WER
Paper Title
TM-CTC
10.1
Deep Audio-Visual Speech Recognition
TM-seq2seq
9.7
Deep Audio-Visual Speech Recognition
CTC/attention
8.2
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture
LF-MMI TDNN
6.7
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
Whisper-LLaMA
6.6
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
End2end Conformer
3.9
End-to-end Audio-visual Speech Recognition with Conformers
MoCo + wav2vec (w/o extLM)
2.7
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
CTC/Attention
1.5
Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Whisper
1.3
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
0 of 9 row(s) selected.
Previous
Next