Question Answering On Sqa3D
평가 지표
AnswerExactMatch (Question Answering)
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | AnswerExactMatch (Question Answering) | Paper Title | Repository |
---|---|---|---|
Situation3D | 52.6 | Situational Awareness Matters in 3D Vision Language Reasoning | |
MCAN | 43.42 | Deep Modular Co-Attention Networks for Visual Question Answering | |
ScanQA | 46.58 | SQA3D: Situated Question Answering in 3D Scenes | |
LM4VisualEncoding | 48.09 | Frozen Transformers in Language Models Are Effective Visual Encoder Layers | |
ScanQA (w/ auxiliary loss) | 47.20 | SQA3D: Situated Question Answering in 3D Scenes | |
CREMA | 54.6 | CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion | |
Lexicon3D | 50.7 | Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding |
0 of 7 row(s) selected.