HyperAIHyperAI
2 months ago

3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Qiu, Rui ; Xu, Ming ; Yan, Yuyao ; Smith, Jeremy S. ; Yang, Xi
3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera
  Pedestrian Localization
Abstract

Although deep-learning based methods for monocular pedestrian detection havemade great progress, they are still vulnerable to heavy occlusions. Usingmulti-view information fusion is a potential solution but has limitedapplications, due to the lack of annotated training samples in existingmulti-view datasets, which increases the risk of overfitting. To address thisproblem, a data augmentation method is proposed to randomly generate 3Dcylinder occlusions, on the ground plane, which are of the average size ofpedestrians and projected to multiple views, to relieve the impact ofoverfitting in the training. Moreover, the feature map of each view isprojected to multiple parallel planes at different heights, by usinghomographies, which allows the CNNs to fully utilize the features across theheight of each pedestrian to infer the locations of pedestrians on the groundplane. The proposed 3DROM method has a greatly improved performance incomparison with the state-of-the-art deep-learning based methods for multi-viewpedestrian detection.

3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization | Latest Papers | HyperAI