HyperAI

Natural Language Understanding On Dialoglue 1

Metriken

Average
Banking77 (Acc)
CLINC150 (Acc)
DSTC8 (F-1)
HWU64 (Acc)
MultiWOZ (Joint Goal Acc)
Restaurant8k (F-1)
TOP (EM)

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameAverageBanking77 (Acc)CLINC150 (Acc)DSTC8 (F-1)HWU64 (Acc)MultiWOZ (Joint Goal Acc)Restaurant8k (F-1)TOP (EM)
Modell 168.2283.9984.5248.492.756.8786.1778.84
Modell 273.885.0685.6944.3693.0648.8987.5872.01
Modell 339.1688.9995.640.089.50.00.00.0
Modell 474.684.8493.5346.6386.7149.5987.3373.56
Modell 573.4978.4788.9856.8882.5149.4685.3172.84