HyperAIHyperAI초신경
홈뉴스연구 논문튜토리얼데이터셋백과사전SOTALLM 모델GPU 랭킹컨퍼런스
전체 검색
소개
한국어
HyperAIHyperAI초신경
  1. 홈
  2. SOTA
  3. 자동 음성 인식
  4. Automatic Speech Recognition On Lrs2

Automatic Speech Recognition On Lrs2

평가 지표

Test WER

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Test WER
Paper TitleRepository
TM-CTC10.1Deep Audio-Visual Speech Recognition
LF-MMI TDNN6.7Audio-visual Recognition of Overlapped speech for the LRS2 dataset-
CTC/attention8.2Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture-
MoCo + wav2vec (w/o extLM)2.7Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Whisper-LLaMA6.6Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
CTC/Attention1.5Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
TM-seq2seq9.7Deep Audio-Visual Speech Recognition
Whisper1.3Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
End2end Conformer3.9End-to-end Audio-visual Speech Recognition with Conformers
0 of 9 row(s) selected.
HyperAI

학습, 이해, 실천, 커뮤니티와 함께 인공지능의 미래를 구축하다

한국어

소개

회사 소개데이터셋 도움말

제품

뉴스튜토리얼데이터셋백과사전

링크

TVM 한국어Apache TVMOpenBayes

© HyperAI초신경

TwitterBilibili