HyperAI초신경

Video Generation On Ucf 101

평가 지표

FVD16
Inception Score
KVD16

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름FVD16Inception ScoreKVD16
generating-videos-with-dynamics-aware-146559.6839.6
masked-conditional-video-diffusion-for1143--
magvit-masked-generative-video-transformer265--
preserve-your-own-correlation-a-noise-prior355.1947.76-
latent-video-diffusion-models-for-high552-42
long-video-generation-with-time-agnostic33279.28-
make-pixels-dance-high-dynamic-video242.8242.10-
acdit-interpolating-autoregressive90--
lumiere-a-space-time-diffusion-model-for332.4937.54-
tell-me-what-happened-unifying-text-guided32873.7-
magvit-masked-generative-video-transformer76±289.27±0.15-
make-a-video-text-to-video-generation-without367.2333-
omnitokenizer-a-joint-image-video-tokenizer191--
make-a-video-text-to-video-generation-without81.2582.55-
grid-diffusion-models-for-text-to-video-1340.062.88-
decomposed-diffusion-models-for-high-quality22072.22-
latent-video-diffusion-models-for-high1396-116
videoassembler-identity-consistent-video346.8448.01-
video-lavit-unified-video-language-pre280.5744.26-
language-model-beats-diffusion-tokenizer-is58±3--
larp-tokenizing-videos-with-a-learned-157--
lavie-high-quality-video-generation-with526.30--
towards-end-to-end-generative-modeling-of43865.93-
preserve-your-own-correlation-a-noise-prior31060.01-
long-video-generation-with-time-agnostic635-55
decomposed-diffusion-models-for-high-quality17380.03-
generating-videos-with-dynamics-aware-157732.70-
align-your-latents-high-resolution-video550.6133.45-
regis-refining-generated-videos-via-iterative141--
cogvideo-large-scale-pretraining-for-text-to30551.11-
videopoet-a-large-language-model-for-zero35538.44-
magicvideo-efficient-video-generation-with699--
latent-video-diffusion-models-for-high2460-148
photorealistic-video-generation-with36±2--
vidm-video-implicit-diffusion-models294.7--
latent-video-diffusion-models-for-high1209--
latent-video-diffusion-models-for-high372-27
a-good-image-generator-is-what-you-need-for-170033.95-
long-context-autoregressive-video-modeling57--
photorealistic-video-generation-with258.135.1-
hierarchical-patch-diffusion-models-for-high-166.3287.68-
fifo-diffusion-generating-infinite-videos-74.44-
tell-me-what-happened-unifying-text-guided39558.3-
long-video-generation-with-time-agnostic42057.63-
language-model-beats-diffusion-tokenizer-is109--
magvit-masked-generative-video-transformer159±283.55±0.14-