HyperAI超神经

Natural Language Understanding On Dialoglue

评估指标

Average
Banking77 (Acc)
CLINC150 (Acc)
DSTC8 (F-1)
HWU64 (Acc)
MultiWOZ (Joint Goal Acc)
Restaurant8k (F-1)
TOP (EM)

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称AverageBanking77 (Acc)CLINC150 (Acc)DSTC8 (F-1)HWU64 (Acc)MultiWOZ (Joint Goal Acc)Restaurant8k (F-1)TOP (EM)
模型 185.8391.1795.888.3391.3658.2294.8581.1
模型 286.8993.4492.3891.297.1156.5695.4482.08
模型 385.3492.9991.8286.4997.1158.2994.3476.36