F2DNet (extra data) | 26.23 | 7.8 | 9.43 | 0.44s/img | F2DNet: Fast Focal Detection Network for Pedestrian Detection | |
FRCNN+FPN-Res50+refined feature map+Crowdhuman | - | 10.67 | - | - | CrowdHuman: A Benchmark for Detecting Human in a Crowd | |
CSP (with offset) + ResNet-50 | 49.3 | 11.0 | 16.0 | 0.33s/img | Center and Scale Prediction: Anchor-free Approach for Pedestrian and Face Detection | |