HyperAI超神经

ComfyUI Wan2.1-VACE-14B 图生视频工作流教程

一、教程简介

Wan2.1-VACE-14B 是由阿里巴巴通义万相团队于 2025 年 5 月 15 日开源的全能型视频生成与编辑统一模型。该模型基于通义万相 V2.1 基座训练,是当前业界首个支持多任务灵活组合的视频 AI 工具,能够一站式完成从视频生成到精细化编辑的全流程需求。支持文本到视频、图像到视频、首尾帧到视频等。相关论文成果为「Wan: Open and Advanced Large-Scale Video Generative Models」。

本教程采用资源为单卡 A6000,生成视频大约花费 30 分钟,推荐使用更高的算力。

该工作流教程,总共使用了下列模型文件:

  • wan2.1_vace_14B_fp16.safetensors
  • wan_2.1_vae.safetensors
  • umt5_xxl_fp8_e4m3fn_scaled.safetensors

二、项目示例

三、运行步骤

1. 启动容器后点击 API 地址即可进入 Web 界面

若显示「Bad Gateway」,这表示模型正在初始化,由于模型较大,请等待约 1-2 分钟后刷新页面。

2. 功能演示

使用步骤

首次克隆需要手动打开文件夹内的工作流进行加载。

四、交流探讨

🖌️ 如果大家看到优质项目,欢迎后台留言推荐!另外,我们还建立了教程交流群,欢迎小伙伴们扫码备注【SD 教程】入群探讨各类技术问题、分享应用效果↓ 

引用信息

感谢 Github 用户 SuperYang  对本教程的部署,本项目引用信息如下:

@article{wan2025,
      title={Wan: Open and Advanced Large-Scale Video Generative Models}, 
      author={Team Wan and Ang Wang and Baole Ai and Bin Wen and Chaojie Mao and Chen-Wei Xie and Di Chen and Feiwu Yu and Haiming Zhao and Jianxiao Yang and Jianyuan Zeng and Jiayu Wang and Jingfeng Zhang and Jingren Zhou and Jinkai Wang and Jixuan Chen and Kai Zhu and Kang Zhao and Keyu Yan and Lianghua Huang and Mengyang Feng and Ningyi Zhang and Pandeng Li and Pingyu Wu and Ruihang Chu and Ruili Feng and Shiwei Zhang and Siyang Sun and Tao Fang and Tianxing Wang and Tianyi Gui and Tingyu Weng and Tong Shen and Wei Lin and Wei Wang and Wei Wang and Wenmeng Zhou and Wente Wang and Wenting Shen and Wenyuan Yu and Xianzhong Shi and Xiaoming Huang and Xin Xu and Yan Kou and Yangyu Lv and Yifei Li and Yijing Liu and Yiming Wang and Yingya Zhang and Yitong Huang and Yong Li and You Wu and Yu Liu and Yulin Pan and Yun Zheng and Yuntao Hong and Yupeng Shi and Yutong Feng and Zeyinzi Jiang and Zhen Han and Zhi-Fan Wu and Ziyu Liu},
      journal = {arXiv preprint arXiv:2503.20314},
      year={2025}
}