SSD (VGG-16) | 13.6 | 0.36 | SSD: Single Shot MultiBox Detector | |
UniverseNet (R2-101-DCN) | - | 1.86 | USB: Universal-Scale Object Detection Benchmark | |
VFNet
(RX-101-64x4d) | 28.0 | 5.27 | VarifocalNet: An IoU-aware Dense Object Detector | |
RetinaNet
(ResNet-50) | 16.6 | 0.18 | Focal Loss for Dense Object Detection | |
GLIP-L
(Swin-L) | 48.0 | 24.89 | Grounded Language-Image Pre-training | |
YOLOv3
(DarkNet-53) | 14.8 | -0.37 | YOLOv3: An Incremental Improvement | |
RepPointsV2
(RX-101-64x4d-DCN) | 24.9 | 2.7 | RepPoints V2: Verification Meets Regression for Object Detection | |