HyperAI

Speech Recognition On Lrs3 Ted

Metriken

Word Error Rate (WER)

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameWord Error Rate (WER)
whisper-flamingo-integrating-visual-features0.68
jointly-learning-visual-and-auditory-speech1.4
large-language-models-are-strong-audio-visual0.81
learning-audio-visual-speech-representation-11.3