Video Adverb Retrieval
Video-Adverb Retrieval is a subtask in the field of computer vision that aims to retrieve adverbs matching specific actions through video content, as well as to search for corresponding video clips based on a given adverb. This task can enhance the understanding and description accuracy of video content, providing significant support for applications such as video annotation, search, and intelligent editing.