Speech To Text Translation
Speech-to-Text Translation是自然语言处理领域的一个重要子任务,旨在将一种语言的语音信号转换为另一种语言的文本形式,可通过端到端或级联方式实现。该任务的目标是提高跨语言交流的效率和准确性,广泛应用于多语言会议记录、国际电话通话转录、在线教育和远程医疗等场景,具有重要的应用价值。
CoVoST 2 eng-X
CoVoST 2 X-eng
FLEURS eng-X
FLEURS X-eng
libri-trans
Transformer + ASR Pretrain + SpecAug
MuST-C
Transformer with Adapters
MuST-C EN->DE
Task Modulation + Multitask Learning(ASR/MT) + Data Augmentation
MuST-C EN->ES
Transformer with Adapters
MuST-C EN->FR
Dual-decoder Transformer
MuST-C EN->NL
Speechformer