SVG-LP (from Grid-keypoints) | 10 | 157.9 | 0.129 | 23.91 | 22.8 | 40 | 0.800 | 10 | Stochastic Video Generation with a Learned Prior | |
SAVP-VAE | 10 | - | - | 27.77 | - | 20 | 0.852 | - | Stochastic Adversarial Video Prediction | |
SLAMP | 10 | 228 ± 5 | 0.0795±0.0034 | 29.39±0.30 | - | 30 | 0.8646±0.0050 | 10 | SLAMP: Stochastic Latent Appearance and Motion Prediction | |
SV2P time-invariant (from Grid-keypoints) | 10 | 253.5 | 0.260 | 25.70 | 8.3 | 40 | 0.772 | 10 | Stochastic Variational Video Prediction | |
SRVP | 10 | 222 ± 3 | 0.0736±0.0029 | 29.69±032 | - | 30 | 0.8697±0.0046 | 10 | Stochastic Latent Residual Video Prediction | |
Struct-VRNN (from Grid-keypoints) | 10 | 395.0 | 0.124 | 24.29 | 2.3 | 40 | 0.766 | 10 | Unsupervised Learning of Object Structure and Dynamics from Videos | |
SV2P time-invariant (from Grid-keypoints) | 10 | 209.5 | 0.232 | 25.87 | 8.3 | 40 | 0.782 | 10 | Stochastic Variational Video Prediction | |
SAVP-VAE (from Grid-keypoints) | 10 | 145.7 | 0.116 | 26.00 | 7.3 | 40 | 0.806 | 10 | Stochastic Adversarial Video Prediction | |
Grid-keypoints | 10 | 144.2 | 0.092 | 27.11 | 2.0 | 40 | 0.837 | 10 | Accurate Grid Keypoint Learning for Efficient Video Prediction | |