Temporal Casual Qa On Next Qa
평가 지표
WUPS
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | WUPS | Paper Title | Repository |
---|---|---|---|
Flamingo(0-shot) | 26.7 | Flamingo: a Visual Language Model for Few-Shot Learning | |
PaLI-X | 38.3 | PaLI-X: On Scaling up a Multilingual Vision and Language Model | |
PaLI-3 | 37.7 | PaLI-3 Vision Language Models: Smaller, Faster, Stronger | |
R2A | 34.7 | Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models | - |
Flamingo(32-shot) | 33.5 | Flamingo: a Visual Language Model for Few-Shot Learning | |
Emu(0-shot) | 23.4 | Emu: Generative Pretraining in Multimodality | |
Gemini Ultra (zero-shot) | 29.9 | Gemini: A Family of Highly Capable Multimodal Models | |
Gemini Pro (zero-shot) | 28.0 | Gemini: A Family of Highly Capable Multimodal Models |
0 of 8 row(s) selected.