Described Object Detection On Description
Metrics
Intra-scenario ABS mAP
Intra-scenario FULL mAP
Intra-scenario PRES mAP
Results
Performance results of various models on this benchmark
Model Name | Intra-scenario ABS mAP | Intra-scenario FULL mAP | Intra-scenario PRES mAP | Paper Title | Repository |
---|---|---|---|---|---|
SPHINX-7B | 7.9 | 10.6 | 11.4 | SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models | |
CORA-R50 | 5.0 | 6.2 | 6.7 | CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching | |
OFA-DOD-base | 15.4 | 21.6 | 23.7 | Described Object Detection: Liberating Object Detection with Flexible Expressions | |
GLIP-T | 21.5 | 19.1 | 18.3 | Grounded Language-Image Pre-training | |
UNINEXT-large | 15.9 | 17.9 | 18.6 | Universal Instance Perception as Object Discovery and Retrieval | |
FIBER-B | 26.0 | 22.7 | 21.5 | Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone | |
MM-Grounding-DINO | 26.0 | 22.9 | 21.9 | An Open and Comprehensive Pipeline for Unified Object Grounding and Detection | |
OWL-ViT-base | 8.8 | 8.6 | 8.5 | Simple Open-Vocabulary Object Detection with Vision Transformers |
0 of 8 row(s) selected.