HyperAI초신경

Explanation Generation On Whoops

평가 지표

Human (%)

평가 결과

이 벤치마크에서 각 모델의 성능 결과

		Paper Title
Ground-truth Caption -> GPT3 (Oracle)	68	Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images
Predicted Caption -> GPT3	33	Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images
BLIP2 FlanT5-XXL (Fine-tuned)	27	Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images
BLIP2 FlanT5-XL (Fine-tuned)	15	Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images
BLIP2 FlanT5-XXL (Zero-shot)	0	Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images
VLIS (LLaVA)	-	VLIS: Unimodal Language Models Guide Multimodal Language Generation
VLIS (Lynx)	-	VLIS: Unimodal Language Models Guide Multimodal Language Generation

0 of 7 row(s) selected.

Explanation Generation On Whoops | SOTA | HyperAI초신경