Temporal Casual Qa On Next Qa
評価指標
WUPS
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
モデル名 | WUPS | Paper Title | Repository |
---|---|---|---|
Flamingo(0-shot) | 26.7 | Flamingo: a Visual Language Model for Few-Shot Learning | |
PaLI-X | 38.3 | PaLI-X: On Scaling up a Multilingual Vision and Language Model | |
PaLI-3 | 37.7 | PaLI-3 Vision Language Models: Smaller, Faster, Stronger | |
R2A | 34.7 | Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models | - |
Flamingo(32-shot) | 33.5 | Flamingo: a Visual Language Model for Few-Shot Learning | |
Emu(0-shot) | 23.4 | Emu: Generative Pretraining in Multimodality | |
Gemini Ultra (zero-shot) | 29.9 | Gemini: A Family of Highly Capable Multimodal Models | |
Gemini Pro (zero-shot) | 28.0 | Gemini: A Family of Highly Capable Multimodal Models |
0 of 8 row(s) selected.