HyperAI超神经

首页算力平台文档资讯论文教程数据集百科 SOTA LLM 模型天梯 GPU 天梯顶会

中文

HyperAI超神经

Speech Recognition On Swb_Hub_500 Wer

评估指标

Percentage error

评测结果

各个模型在此基准测试上的表现结果

		Paper Title	Repository
HMM-TDNN + pNorm + speed up/down speech	19.3	-	-
DNN + Dropout	19.1	Building DNN Acoustic Models for Large Vocabulary Speech Recognition
HMM-DNN +sMBR	18.4	-	-
HMM-TDNN + iVectors	17.1	-	-
CNN + Bi-RNN + CTC (speech to letters), 25.9% WER if trainedonlyon SWB	16	Deep Speech: Scaling up end-to-end speech recognition
HMM-TDNN trained with MMI + data augmentation (speed) + iVectors + 3 regularizations + Fisher (10% / 15.1% respectively trained on SWBD only)	13.3	-	-
HMM-BLSTM trained with MMI + data augmentation (speed) + iVectors + 3 regularizations + Fisher	13	-	-
RNN + VGG + LSTM acoustic model trained on SWB+Fisher+CH, N-gram + "model M" + NNLM language model	12.2	The IBM 2016 English Conversational Telephone Speech Recognition System	-
VGG/Resnet/LACE/BiLSTM acoustic model trained on SWB+Fisher+CH, N-gram + RNNLM language model trained on Switchboard+Fisher+Gigaword+Broadcast	11.9	The Microsoft 2016 Conversational Speech Recognition System	-
ResNet + BiLSTMs acoustic model	10.3	English Conversational Telephone Speech Recognition by Humans and Machines	-
IBM (LSTM encoder-decoder)	7.8	Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard	-
IBM (LSTM+Conformer encoder-decoder)	6.8	On the limit of English conversational speech recognition	-

0 of 12 row(s) selected.