HyperAI

Automatic Lyrics Transcription On Jam Alt 3

Metrics

Case-Sensitive Word Error Rate
Line break F-1
Punctuation F-1
Word Error Rate (WER)

Results

Performance results of various models on this benchmark

Model Name
Case-Sensitive Word Error Rate
Line break F-1
Punctuation F-1
Word Error Rate (WER)
Paper TitleRepository
Whisper v2 +lang26.071.748.419.9Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v2 +demucs70.467.349.165.2Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v3 +demucs47.471.945.443.5Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v344.671.147.340.7Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v2 +demucs +lang30.470.649.223.9Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v2-69.938.745.4Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
OWSM v3.1 +lang71.840.728.663.3Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v3 +demucs +lang44.970.546.940.8Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v259.370.047.154.5Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v3 +lang40.471.147.435.9Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v2 +demucs-67.530.265.2Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
AudioShake v1- 81.248.524.4Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Whisper v3 +demucs-72.034.043.5Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Whisper v3-71.241.240.7Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
OWSM v3.1 +demucs +lang62.041.424.751.8Lyrics Transcription for Humans: A Readability-Aware Benchmark
AudioShake v317.583.7 57.112.6Lyrics Transcription for Humans: A Readability-Aware Benchmark
0 of 16 row(s) selected.