HyperAIHyperAI초신경
홈뉴스연구 논문튜토리얼데이터셋백과사전SOTALLM 모델GPU 랭킹컨퍼런스
전체 검색
소개
한국어
HyperAIHyperAI초신경
  1. 홈
  2. SOTA
  3. 비디오 질문 답변
  4. Video Question Answering On Agqa 2 0 Balanced

Video Question Answering On Agqa 2 0 Balanced

평가 지표

Average Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Average Accuracy
Paper TitleRepository
MIST - AIO50.96MIST: Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering
GF (uns) - S3D53.33Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
MIST - CLIP54.39MIST: Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering
SHG-VQA (trained from scratch)49.2Learning Situation Hyper-Graphs for Video Question Answering
AIO - ViT48.59Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
MMTF44.36MMTF: Multi-Modal Temporal Fusion for Commonsense Video Question Answering-
SViTT52.7SViTT: Temporal Learning of Sparse Video-Text Transformers
GF (sup) - Faster RCNN55.08Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
0 of 8 row(s) selected.
HyperAI

학습, 이해, 실천, 커뮤니티와 함께 인공지능의 미래를 구축하다

한국어

소개

회사 소개데이터셋 도움말

제품

뉴스튜토리얼데이터셋백과사전

링크

TVM 한국어Apache TVMOpenBayes

© HyperAI초신경

TwitterBilibili