Command Palette

Search for a command to run...

Visual Question Answering On Vizwiz 2020 Vqa

평가 지표

number
other
overall
unanswerable
yes/no

평가 결과

이 벤치마크에서 각 모델의 성능 결과

Paper Title
PaLI--73.3--PaLI: A Jointly-Scaled Multilingual Language-Image Model
CLIP-Ensemble--61.64--Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model
CLIP-Single--60.66--Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model
HSSLab27.142.356.3389.4978.89-
Video-LaVIT--56.0--Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
sudoku26.8342.2955.9388.9573.45-
Katya27.3740.9254.7686.8280.52-
Modified Attention20.634.1449.5888.2659.79-
shaunakh22.2234.2148.3983.4360.65-
e5018.1628.8844.984.1360.08-
SKP18.9728.1244.6284.3263.8-
knight77717.3427.3444.0185.8653.01-
pk18.726.1341.9281.5449.86-
Tartans23.0419.0534.9671.4560.08-
VWTest114.0917.5734.1378.225.31-
BERT-RG2.711.216.257.1379.85-
0 of 16 row(s) selected.
Visual Question Answering On Vizwiz 2020 Vqa | SOTA | HyperAI초신경