HyperAI

Long Video Activity Recognition

Long-video Activity Recognition (LAR) focuses on modeling the long-term relationships between all actions in a long video. It aims to identify all actions within each long video through weak supervision using a set of video-level action categories. This task utilizes mean Average Precision (mAP) as an evaluation metric and has significant application value, such as in intelligent surveillance, sports analysis, and film content understanding.