Dialogue State Tracking On Simmc2 0
평가 지표
Act F1
Slot F1
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | Act F1 | Slot F1 | Paper Title | Repository |
---|---|---|---|---|
MTN | 93.4 | 76.7 | Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems | |
BART-base | 95.2 | 82.0 | Learning to Embed Multi-Modal Contexts for Situated Conversational Agents | - |
GPT-2 | 94.5 | 81.7 | Language Models are Unsupervised Multitask Learners | - |
BART-large | 96.3 | 88.3 | Learning to Embed Multi-Modal Contexts for Situated Conversational Agents | - |
PaCE | 97.1 | 87.0 | PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts |
0 of 5 row(s) selected.