Referring Expression
Referring Expression is a subtask in the field of computer vision that aims to precisely locate and draw bounding boxes around instances corresponding to the given descriptions in images. This task not only enhances the machine's ability to understand natural language but also improves the accuracy of recognizing specific objects in images, with wide applications in human-computer interaction, image annotation, and intelligent search scenarios.