Video Question Answering On Msvd Qa
Metriken
Accuracy
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
| Paper Title | ||
|---|---|---|
| LocVLM-Vid-B | 66.1 | Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs |
0 of 1 row(s) selected.