CNN over RAW speech (wav) | 5.6 | - | - |
test-set on open vocabulary (i.e. harder), model = HMM-DNN + pNorm* | 3.6 | - | - |
Convolutional Speech Recognition | 3.5 | Fully Convolutional Speech Recognition | - |
Task activating prompting generative correction | 2.11 | Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting | - |