S3R (without audio imformation) | 80.26 | Self-supervised Sparse Representation for Video Anomaly Detection | |
Contrastive Attention for Video Anomaly Detection | 76.9 | - | - |
A Neural Network Containing Three Parallel Branches (holistic, localized, and score branch) | 78.64 | Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision | |