HyperAI초신경

Automatic Lyrics Transcription On Jam Alt 4

평가 지표

Case Error Rate
Line break F-1
Punctuation F-1
Word Error Rate (WER)

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Case Error Rate
Line break F-1
Punctuation F-1
Word Error Rate (WER)
Paper TitleRepository
Whisper v2 +demucs3.2 66.134.9 43.3Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Whisper v2-73.445.927.7Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v2 +lang-73.745.327.1Lyrics Transcription for Humans: A Readability-Aware Benchmark
AudioShake v12.084.945.834.9Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Whisper v3 +demucs3.269.430.944.9Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Whisper v23.273.445.827.7Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Whisper v2 +demucs +lang-65.636.138.2Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v33.377.842.434.7Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark
Whisper v3 +lang-77.942.334.7Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v3 +demucs-69.332.044.9Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v3-77.942.534.7Lyrics Transcription for Humans: A Readability-Aware Benchmark
OWSM v3.1 +lang-36.030.671.6Lyrics Transcription for Humans: A Readability-Aware Benchmark
OWSM v3.1 +demucs +lang-40.922.378.5Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v3 +demucs +lang-69.332.044.9Lyrics Transcription for Humans: A Readability-Aware Benchmark
Whisper v2 +demucs-66.038.043.3Lyrics Transcription for Humans: A Readability-Aware Benchmark
AudioShake v3-88.646.1 20.8Lyrics Transcription for Humans: A Readability-Aware Benchmark
0 of 16 row(s) selected.