CenterNet(DLA34, Flip, 512x512) | 80.7% | Objects as Points | |
Perona Malik (Perona and Malik, 1990) | 74.37% | Learning Visual Representations for Transfer Learning by Suppressing Texture | |
ThunderNet SNet535 Backbone | 78.6% | ThunderNet: Towards Real-time Generic Object Detection | |
HSD (VGG16, 512x512, single-scale test) | 83.0% | Hierarchical Shot Detector | |
Deformable Parts Model (DeepPyramid) | 45.2% | Deformable Part Models are Convolutional Neural Networks | |
HSD (VGG16, 320x320, single-scale test) | 81.7% | Hierarchical Shot Detector | |
SSD512 (07+12+COCO) | 81.6% | SSD: Single Shot MultiBox Detector | |
VGG-16 + KL Loss + var voting + soft-NMS | 71.6% | Bounding Box Regression with Uncertainty for Accurate Object Detection | |