HyperAI

Visual Grounding On Refcoco Test B

Metrics

Accuracy (%)

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAccuracy (%)
toward-building-general-foundation-models-for79.8
x-2-vlm-all-in-one-pre-trained-model-for78.4
mplug-2-a-modularized-multi-modal-foundation86.05
multi-grained-vision-language-pre-training76.91
florence-2-advancing-a-unified-representation92.0
x-2-vlm-all-in-one-pre-trained-model-for81.8