Open Vocabulary Object Detection
Open Vocabulary Object Detection (OVD) is a cutting-edge task in the field of computer vision that aims to overcome the limitations of the finite number of annotated categories during the training phase, enabling the detection of unseen object categories. This task leverages an infinite vocabulary to automatically recognize and classify novel objects during inference, significantly enhancing the model's generalization and adaptability capabilities, and holds important application value.