Command Palette
Search for a command to run...
Explanation Generation On Whoops
평가 지표
Human (%)
평가 결과
이 벤치마크에서 각 모델의 성능 결과
| Paper Title | ||
|---|---|---|
| Ground-truth Caption -> GPT3 (Oracle) | 68 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images |
| Predicted Caption -> GPT3 | 33 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images |
| BLIP2 FlanT5-XXL (Fine-tuned) | 27 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images |
| BLIP2 FlanT5-XL (Fine-tuned) | 15 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images |
| BLIP2 FlanT5-XXL (Zero-shot) | 0 | Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images |
| VLIS (LLaVA) | - | VLIS: Unimodal Language Models Guide Multimodal Language Generation |
| VLIS (Lynx) | - | VLIS: Unimodal Language Models Guide Multimodal Language Generation |
0 of 7 row(s) selected.