HyperAI超神经

Visual Question Answering On Vizwiz 2020 Vqa

评估指标

number
other
overall
unanswerable
yes/no

评测结果

各个模型在此基准测试上的表现结果

模型名称
number
other
overall
unanswerable
yes/no
Paper TitleRepository
sudoku26.8342.2955.9388.9573.45--
Katya27.3740.9254.7686.8280.52--
e5018.1628.8844.984.1360.08--
Video-LaVIT--56.0--Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
VWTest114.0917.5734.1378.225.31--
knight77717.3427.3444.0185.8653.01--
PaLI--73.3--PaLI: A Jointly-Scaled Multilingual Language-Image Model
pk18.726.1341.9281.5449.86--
BERT-RG2.711.216.257.1379.85--
CLIP-Ensemble--61.64--Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model-
CLIP-Single--60.66--Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model-
HSSLab27.142.356.3389.4978.89--
shaunakh22.2234.2148.3983.4360.65--
Modified Attention20.634.1449.5888.2659.79--
Tartans23.0419.0534.9671.4560.08--
SKP18.9728.1244.6284.3263.8--
0 of 16 row(s) selected.