Video Text Retrieval On Test Of Time
Metriken
2-Class Accuracy
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Modellname | 2-Class Accuracy | Paper Title | Repository |
---|---|---|---|
TACT | 64.4 | Test of Time: Instilling Video-Language Models with a Sense of Time | |
Time-Chat | 76.67 | TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | |
Video-LLAMA | 88.33 | Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding | |
VideoPrompter | 60.0 | Videoprompter: an ensemble of foundational models for zero-shot video understanding | - |
0 of 4 row(s) selected.