HyperAI초신경

홈 뉴스 연구 논문 튜토리얼 데이터셋 백과사전 SOTA LLM 모델 GPU 랭킹 컨퍼런스

한국어

HyperAI초신경

Multimodal Intent Recognition On Photochat

평가 지표

F1

Precision

Recall

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	F1	Precision	Recall	Paper Title	Repository
PaCE	63.8	63.3	68	PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts
ViLT	52.4	55.4	58.9	ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
ALBERT-base	52.2	44.8	62.7	ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
T5-3B	58.9	54.1	64.6	Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
T5-base	58.1	58.2	57.9	Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
BERT	53.2	56.1	50.6	BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

0 of 6 row(s) selected.