HyperAIHyperAI超神经
首页资讯论文教程数据集百科SOTALLM 模型天梯GPU 天梯顶会
全站搜索
关于
中文
HyperAIHyperAI超神经
  1. 首页
  2. SOTA
  3. 视频问答
  4. Video Question Answering On Agqa 2 0 Balanced

Video Question Answering On Agqa 2 0 Balanced

评估指标

Average Accuracy

评测结果

各个模型在此基准测试上的表现结果

模型名称
Average Accuracy
Paper TitleRepository
MIST - AIO50.96MIST: Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering
GF (uns) - S3D53.33Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
MIST - CLIP54.39MIST: Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering
SHG-VQA (trained from scratch)49.2Learning Situation Hyper-Graphs for Video Question Answering
AIO - ViT48.59Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
MMTF44.36MMTF: Multi-Modal Temporal Fusion for Commonsense Video Question Answering-
SViTT52.7SViTT: Temporal Learning of Sparse Video-Text Transformers
GF (sup) - Faster RCNN55.08Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
0 of 8 row(s) selected.
HyperAI

学习、理解、实践,与社区一起构建人工智能的未来

中文

关于

关于我们数据集帮助

产品

资讯教程数据集百科

链接

TVM 中文Apache TVMOpenBayes

© HyperAI超神经

津ICP备17010941号-1京公网安备11010502038810号京公网安备11010502038810号
TwitterBilibili