End To End Dialogue Modelling On Multiwoz 2 1
评估指标
BLEU
MultiWOZ (Inform)
MultiWOZ (Success)
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | BLEU | MultiWOZ (Inform) | MultiWOZ (Success) |
---|---|---|---|
a-simple-language-model-for-task-oriented | 15.2 | 85.0 | 70.5 |
galaxy-a-generative-pre-trained-model-for | 20.01 | 95.30 | 86.20 |
augpt-dialogue-with-pre-trained-language | 17.2 | 91.4 | 72.9 |
a-probabilistic-end-to-end-task-oriented | 18.3 | 78.1 | 67.1 |