Multimodal Intent Recognition On Photochat
평가 지표
F1
Precision
Recall
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | F1 | Precision | Recall | Paper Title | Repository |
---|---|---|---|---|---|
PaCE | 63.8 | 63.3 | 68 | PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts | |
ViLT | 52.4 | 55.4 | 58.9 | ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision | |
ALBERT-base | 52.2 | 44.8 | 62.7 | ALBERT: A Lite BERT for Self-supervised Learning of Language Representations | |
T5-3B | 58.9 | 54.1 | 64.6 | Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer | |
T5-base | 58.1 | 58.2 | 57.9 | Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer | |
BERT | 53.2 | 56.1 | 50.6 | BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding |
0 of 6 row(s) selected.