Lumiere (Zero-shot, 1024x1024) | 332.49 | Lumiere: A Space-Time Diffusion Model for Video Generation | |
PixelDance (Zero-shot, 256x256) | 242.82 | Make Pixels Dance: High-Dynamic Video Generation | - |
Snap Video (Zero-shot, 288×288) | 260.1 | Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis | - |
MagicVideo (Zero-shot, 256x256) | 699 | MagicVideo: Efficient Video Generation With Latent Diffusion Models | - |
Snap Video (Zero-shot, 512x288) | 200.2 | Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis | - |