HyperAI
Home
News
Latest Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
English
HyperAI
Toggle sidebar
Search the site…
⌘
K
Home
SOTA
Open Vocabulary Object Detection
Open Vocabulary Object Detection On Mscoco
Open Vocabulary Object Detection On Mscoco
Metrics
AP 0.5
Results
Performance results of various models on this benchmark
Columns
Model Name
AP 0.5
Paper Title
Repository
VL-PLM (RN50)
34.4
Exploiting Unlabeled Data with Vision and Language Models for Object Detection
CORA
41.7
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching
LP-OVOD
40.5
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing
OV-DQUO(R50)
39.2
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
RALF
41.3
Retrieval-Augmented Open-Vocabulary Object Detection
OADP
30.0
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
Cooperative Foundational Models
50.3
Enhancing Novel Object Detection via Cooperative Foundational Models
CFM-ViT
34.1
Contrastive Feature Masking Open-Vocabulary Vision Transformer
-
Detic
27.8
Detecting Twenty-thousand Classes using Image-level Supervision
OV-DQUO(RN50x4)
45.6
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
DE-ViT
50
Detect Everything with Few Examples
BARON
42.7
Aligning Bag of Regions for Open-Vocabulary Object Detection
Region-CLIP (RN50-C4)
31.4
RegionCLIP: Region-based Language-Image Pretraining
OADP (G-OVD)
35.6
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
LocOv (RN50-C4)
28.6
Localized Vision-Language Matching for Open-vocabulary Object Detection
SIA-OVD (RN50)
35.5
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection
Object-Centric-OVD
36.9
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
DITO
46.1
Region-centric Image-Language Pretraining for Open-Vocabulary Detection
-
OV-DERT
29.4
Open-Vocabulary DETR with Conditional Matching
ViLD
27.6
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
0 of 30 row(s) selected.
Previous
Next