HyperAI超神经

Action Recognition In Videos On Ntu Rgbd 120

评估指标

Accuracy (Cross-Setup)
Accuracy (Cross-Subject)

评测结果

各个模型在此基准测试上的表现结果

模型名称
Accuracy (Cross-Setup)
Accuracy (Cross-Subject)
Paper TitleRepository
DSCNet (RGB + Pose)96.795.6A Dense-Sparse Complementary Network for Human Action Recognition based on RGB and Skeleton Modalities
Body Pose Evolution Map64.666.9Recognizing Human Actions as the Evolution of Pose Estimation Maps-
3DA (RGB + Pose)91.490.5Cross-Modal Learning with 3D Deformable Attention for Action Recognition-
Gimme Signals (AIS)70.871.59Gimme Signals: Discriminative signal encoding for multimodal activity recognition
Skelemotion + Yang et al. (skeleton only)66.967.7SkeleMotion: A New Representation of Skeleton Joint Sequences Based on Motion Information for 3D Action Recognition
π-ViT (RGB only)91.992.9Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living
VPN++ (RGB + Pose)90.792.5VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living
π-ViT (RGB + Pose)96.195.1Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living
DVANet (RGB only)90.491.6DVANet: Disentangling View and Action Features for Multi-View Action Recognition
TSRJI67.962.8Skeleton Image Representation for 3D Action Recognition based on Tree Structure and Reference Joints
ST-GCN + AS-GCN w/DH-TCN78.379.2Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition-
VPN (RGB + Pose)86.387.8VPN: Learning Video-Pose Embedding for Activities of Daily Living
EPP-Net (Parsing + Pose)92.891.1Explore Human Parsing Modality for Action Recognition
ViewCon (RGB)87.585.6Multi-View Action Recognition Using Contrastive Learning
IPP-Net (Parsing + Pose)91.790.0Integrating Human Parsing and Pose Network for Human Action Recognition
STAR-Transformer (RGB + Pose)92.790.3STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition-
MMNet (RGB + Pose)94.492.9MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos
PoseC3D (RGB + Pose)96.495.3Revisiting Skeleton-based Action Recognition
0 of 18 row(s) selected.