Video Transformer | 5 | 170±5 | 11 | Scaling Autoregressive Video Models | - |
MAGVIT (-L-FP) | 5 | 9.9±0.3 | 11 | MAGVIT: Masked Generative Video Transformer | - |
MAGVIT (-B-FP) | 5 | 24.5±0.9 | 11 | MAGVIT: Masked Generative Video Transformer | - |
Video VQ-VAE FVD | 4 | 64.30±2.04 | 12 | Predicting Video with VQVAE | - |