HyperAI
Accueil
Actualités
Articles de recherche récents
Tutoriels
Ensembles de données
Wiki
SOTA
Modèles LLM
Classement GPU
Événements
Recherche
À propos
Français
HyperAI
Toggle sidebar
Rechercher sur le site...
⌘
K
Accueil
SOTA
Visual Grounding
Visual Grounding On Refcoco Testa
Visual Grounding On Refcoco Testa
Métriques
IoU
Résultats
Résultats de performance de divers modèles sur ce benchmark
Columns
Nom du modèle
IoU
Paper Title
Repository
HYDRA
61.1
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
XFM (base)
-
Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks
X2-VLM (large)
-
X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks
X2-VLM (base)
-
X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks
X-VLM (base)
-
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts
mPLUG-2
-
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Florence-2-large-ft
-
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
0 of 7 row(s) selected.
Previous
Next