HyperAI
Startseite
Neuigkeiten
Neueste Forschungsarbeiten
Tutorials
Datensätze
Wiki
SOTA
LLM-Modelle
GPU-Rangliste
Veranstaltungen
Suche
Über
Deutsch
HyperAI
Toggle sidebar
Seite durchsuchen…
⌘
K
Startseite
SOTA
Visual Grounding
Visual Grounding On Refcoco Testa
Visual Grounding On Refcoco Testa
Metriken
IoU
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Columns
Modellname
IoU
Paper Title
Repository
HYDRA
61.1
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
XFM (base)
-
Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks
X2-VLM (large)
-
X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks
X2-VLM (base)
-
X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks
X-VLM (base)
-
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts
mPLUG-2
-
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Florence-2-large-ft
-
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
0 of 7 row(s) selected.
Previous
Next