HyperAI
HyperAI
الرئيسية
المنصة
الوثائق
الأخبار
الأوراق البحثية
الدروس
مجموعات البيانات
الموسوعة
SOTA
نماذج LLM
لوحة الأداء GPU
الفعاليات
البحث
حول
شروط الخدمة
سياسة الخصوصية
العربية
HyperAI
HyperAI
Toggle Sidebar
البحث في الموقع...
⌘
K
Command Palette
Search for a command to run...
المنصة
الرئيسية
SOTA
الأسئلة المرئية والإجابة عليها (VQA)
Visual Question Answering On Coco Visual 4
Visual Question Answering On Coco Visual 4
المقاييس
Percentage correct
النتائج
نتائج أداء النماذج المختلفة على هذا المعيار القياسي
Columns
اسم النموذج
Percentage correct
Paper Title
MCB 7 att.
66.5
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Dual-MFA
66.09
Co-attending Free-form Regions and Detections with Multi-modal Multiplicative Feature Embedding for Visual Question Answering
QGHC+Att+Concat
65.90
Question-Guided Hybrid Convolution for Visual Question Answering
RelAtt
65.69
R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering
joint-loss
63.2
Training Recurrent Answering Units with Joint Loss Minimization for VQA
HQI+ResNet
62.1
Hierarchical Question-Image Co-Attention for Visual Question Answering
MRN + global features
61.8
Multimodal Residual Learning for Visual QA
DMN+ [xiong2016dynamic]
60.4
Dynamic Memory Networks for Visual and Textual Question Answering
CNN-RNN
59.5
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge
FDA
59.5
A Focused Dynamic Attention Model for Visual Question Answering
SAN
58.9
Stacked Attention Networks for Image Question Answering
LSTM Q+I
58.2
VQA: Visual Question Answering
SMem-VQA
58.2
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
iBOWIMG baseline
55.9
Simple Baseline for Visual Question Answering
0 of 14 row(s) selected.
Previous
Next
Visual Question Answering On Coco Visual 4 | SOTA | HyperAI