HyperAI

Speech Recognition On Lrs3 Ted

Metrics

Word Error Rate (WER)

Results

Performance results of various models on this benchmark

Comparison Table
Model NameWord Error Rate (WER)
whisper-flamingo-integrating-visual-features0.68
jointly-learning-visual-and-auditory-speech1.4
large-language-models-are-strong-audio-visual0.81
learning-audio-visual-speech-representation-11.3