HyperAI

Video Prediction On Kinetics 600 12 Frames

Metriken

Cond
FVD
Pred

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameCondFVDPred
scaling-autoregressive-video-models5170±511
omnitokenizer-a-joint-image-video-tokenizer-32.9-
larp-tokenizing-videos-with-a-learned-155.111
efficient-video-generation-on-complex569.15±0.7811
latent-video-transformer5224.7311
scalable-adaptive-computation-for-iterative-10.8-
transformation-based-adversarial-video525.74±0.6611
magvit-masked-generative-video-transformer59.9±0.311
language-model-beats-diffusion-tokenizer-is-4.3±0.1-
ccvs-context-aware-controllable-video555±111
magvit-masked-generative-video-transformer524.5±0.911
photorealistic-video-generation-with-3.3-
predicting-video-with-vqvae-1464.30±2.0412
scalable-adaptive-computation-for-iterative-11.5-
diffusion-models-for-video-prediction-and516.4611