Video Clip Ordering (R3D) | false | UCF101 | 29.5 | Self-Supervised Spatiotemporal Learning via Video Clip Order Prediction | - |
3D Cubic Puzzles (3D ResNet-18) | false | Kinetics400 | 33.7 | Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles | - |
3D RotNet (3D ResNet-18) | false | Kinetics400 | 33.7 | Self-Supervised Spatiotemporal Feature Learning via Video Rotation Prediction | - |
CVRL (R3D-152 2x; K600) | false | Kinetics600 | 69.9 | Spatiotemporal Contrastive Video Representation Learning | |
BraVe:V-FA (TSM-50x2) | false | - | 70.5 | Broaden Your Views for Self-Supervised Video Learning | |
Shuffle and Learn (AlexNet) | false | UCF101 | 19.8 | Shuffle and Learn: Unsupervised Learning using Temporal Order Verification | - |
DPC (Modified 3D ResNet-18) | false | Kinetics400 | 34.5 | Video Representation Learning by Dense Predictive Coding | |