ActionMamba(InternVideo2-6B) | 29.04 | 45.44 | 28.82 | 6.79 | Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding | |
DyFADet (VideoMAE v2-g) | 23.8 | 37.1 | 23.7 | 5.9 | DyFADet: Dynamic Feature Aggregation for Temporal Action Detection | |
G-TAD (i3d feature) | 9.06 | 13.74 | 8.83 | 3.06 | G-TAD: Sub-Graph Localization for Temporal Action Detection | |