Natural Language Understanding On Dialoglue 1
Metrics
Average
Banking77 (Acc)
CLINC150 (Acc)
DSTC8 (F-1)
HWU64 (Acc)
MultiWOZ (Joint Goal Acc)
Restaurant8k (F-1)
TOP (EM)
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Average | Banking77 (Acc) | CLINC150 (Acc) | DSTC8 (F-1) | HWU64 (Acc) | MultiWOZ (Joint Goal Acc) | Restaurant8k (F-1) | TOP (EM) |
---|---|---|---|---|---|---|---|---|
Model 1 | 68.22 | 83.99 | 84.52 | 48.4 | 92.75 | 6.87 | 86.17 | 78.84 |
Model 2 | 73.8 | 85.06 | 85.69 | 44.36 | 93.06 | 48.89 | 87.58 | 72.01 |
Model 3 | 39.16 | 88.99 | 95.64 | 0.0 | 89.5 | 0.0 | 0.0 | 0.0 |
Model 4 | 74.6 | 84.84 | 93.53 | 46.63 | 86.71 | 49.59 | 87.33 | 73.56 |
Model 5 | 73.49 | 78.47 | 88.98 | 56.88 | 82.51 | 49.46 | 85.31 | 72.84 |