HyperAIHyperAI

Command Palette

Search for a command to run...

Video Description

The goal of automatic video captioning is to narrate events in a video. Early methods generated captions for manually segmented short clips, describing single events, while recent dense video captioning techniques simultaneously achieve event segmentation over time and coherent description. This task not only generalizes dense image region captioning but also has practical applications such as generating textual summaries for the visually impaired and detecting and describing important events in surveillance videos.

No Data
No benchmark data available for this task
Video Description | SOTA | HyperAI