HyperAI

Video Generation On Bair Robot Pushing

Metrics

Cond
FVD score
Pred
Train

Results

Performance results of various models on this benchmark

Model Name
Cond
FVD score
Pred
Train
Paper TitleRepository
SVG-FP (from FVD)2315.51414Stochastic Video Generation with a Learned Prior
Baseline (from LVT)1320.91515Latent Video Transformer
WAM2159.62814Exploring Spatial-Temporal Multi-Frequency Analysis for High-Fidelity and Temporal-Consistency Video Prediction
SVG-LP (from vRNN)2256.622810Stochastic Video Generation with a Learned Prior
Video Transformer194± 21515Scaling Autoregressive Video Models
SAVP (from SRVP)2152±92812Stochastic Adversarial Video Prediction
NUWA186.91515NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
SAVP (from vRNN)2143.432810Stochastic Adversarial Video Prediction
VideoFlow3131±514 (total 16)10VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation
SV2P (from FVD)2262.51414Stochastic Variational Video Prediction
VRNN 1L2149.222810Improved Conditional VRNNs for Video Prediction
TrIVD-GAN-FP1103.31515Transformation-based Adversarial Video Prediction on Large-Scale Data-
SRVP2162 ± 42812Stochastic Latent Residual Video Prediction
FitVid193.61515FitVid: Overfitting in Pixel-Level Video Prediction
MoCoGAN45031212MoCoGAN: Decomposing Motion and Content for Video Generation
DVD-GAN-FP1109.81515Adversarial Video Generation on Complex Datasets
SAVP (from FVD)2116.41414Stochastic Adversarial Video Prediction
RaMViD184.201520Diffusion Models for Video Prediction and Infilling
MAGVIT1621515MAGVIT: Masked Generative Video Transformer
MCVD : c2t5p14287.9145MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
0 of 31 row(s) selected.