Video Question Answering On Tgif Qa
Metriken
Accuracy
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
| Paper Title | ||
|---|---|---|
| LocVLM-Vid-B | 51.8 | Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs |
0 of 1 row(s) selected.