Home News Papers Tutorials Datasets Wiki SOTA LLM Models GPU Leaderboard Events

English

Described Object Detection On Description

Metrics

Intra-scenario ABS mAP

Intra-scenario FULL mAP

Intra-scenario PRES mAP

Results

Performance results of various models on this benchmark

Model Name	Intra-scenario ABS mAP	Intra-scenario FULL mAP	Intra-scenario PRES mAP	Paper Title	Repository
SPHINX-7B	7.9	10.6	11.4	SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
CORA-R50	5.0	6.2	6.7	CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching
OFA-DOD-base	15.4	21.6	23.7	Described Object Detection: Liberating Object Detection with Flexible Expressions
GLIP-T	21.5	19.1	18.3	Grounded Language-Image Pre-training
UNINEXT-large	15.9	17.9	18.6	Universal Instance Perception as Object Discovery and Retrieval
FIBER-B	26.0	22.7	21.5	Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
MM-Grounding-DINO	26.0	22.9	21.9	An Open and Comprehensive Pipeline for Unified Object Grounding and Detection
OWL-ViT-base	8.8	8.6	8.5	Simple Open-Vocabulary Object Detection with Vision Transformers

0 of 8 row(s) selected.