HyperAIHyperAI
2 months ago

Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection

Li, Xiang ; Wang, Wenhai ; Hu, Xiaolin ; Li, Jun ; Tang, Jinhui ; Yang, Jian
Generalized Focal Loss V2: Learning Reliable Localization Quality
  Estimation for Dense Object Detection
Abstract

Localization Quality Estimation (LQE) is crucial and popular in the recentadvancement of dense object detectors since it can provide accurate rankingscores that benefit the Non-Maximum Suppression processing and improvedetection performance. As a common practice, most existing methods predict LQEscores through vanilla convolutional features shared with object classificationor bounding box regression. In this paper, we explore a completely novel anddifferent perspective to perform LQE -- based on the learned distributions ofthe four parameters of the bounding box. The bounding box distributions areinspired and introduced as "General Distribution" in GFLV1, which describes theuncertainty of the predicted bounding boxes well. Such a property makes thedistribution statistics of a bounding box highly correlated to its reallocalization quality. Specifically, a bounding box distribution with a sharppeak usually corresponds to high localization quality, and vice versa. Byleveraging the close correlation between distribution statistics and the reallocalization quality, we develop a considerably lightweight Distribution-GuidedQuality Predictor (DGQP) for reliable LQE based on GFLV1, thus producing GFLV2.To our best knowledge, it is the first attempt in object detection to use ahighly relevant, statistical representation to facilitate LQE. Extensiveexperiments demonstrate the effectiveness of our method. Notably, GFLV2(ResNet-101) achieves 46.2 AP at 14.6 FPS, surpassing the previousstate-of-the-art ATSS baseline (43.6 AP at 14.6 FPS) by absolute 2.6 AP on COCO{\tt test-dev}, without sacrificing the efficiency both in training andinference. Code will be available at https://github.com/implus/GFocalV2.