HyperAI
HyperAI超神経
ホーム
プラットフォーム
ドキュメント
ニュース
論文
チュートリアル
データセット
百科事典
SOTA
LLMモデル
GPU ランキング
学会
検索
サイトについて
日本語
HyperAI
HyperAI超神経
Toggle sidebar
サイトを検索…
⌘
K
Command Palette
Search for a command to run...
ホーム
SOTA
音声認識
Speech Recognition On Common Voice German
Speech Recognition On Common Voice German
評価指標
Test WER
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
Columns
モデル名
Test WER
Paper Title
Repository
wav2vec 2.0 XLS-R (no LM)
12.06%
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction
wav2vec 2.0 XLS-R 1B + TEVR (no LM)
10.10%
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction
VoxPopuli (n-gram)
7.8%
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
QuartzNet15x5DE (CV-only, 5-gram)
7.7%
Scribosermo: Fast Speech-to-Text models for German and other Languages
ConformerCTC-L (no LM)
7.33%
Scribosermo: Fast Speech-to-Text models for German and other Languages
ConformerCTC-L (no LM)
6.68%
NeMo: a toolkit for building AI applications using Neural Modules
QuartzNet15x5DE (D37, 5-gram)
6.6%
Scribosermo: Fast Speech-to-Text models for German and other Languages
Whisper (Large v2)
6.4%
Robust Speech Recognition via Large-Scale Weak Supervision
Conformer Transducer (no LM)
6.28%
Automatic Speech Recognition in German: A Detailed Error Analysis
-
ConformerCTC-L (4-gram)
6.03%
NeMo: a toolkit for building AI applications using Neural Modules
wav2vec 2.0 XLS-R 1B (5-gram)
4.38%
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction
ConformerCTC-L (5-gram)
4.05%
Scribosermo: Fast Speech-to-Text models for German and other Languages
wav2vec 2.0 XLS-R 1B + TEVR (4-gram)
3.70%
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction
wav2vec 2.0 XLS-R 1B + TEVR (5-gram)
3.64%
TEVR: Improving Speech Recognition by Token Entropy Variance Reduction
0 of 14 row(s) selected.
Previous
Next
Speech Recognition On Common Voice German | SOTA | HyperAI超神経