Soul-Bench Audio-Driven Human Animation Evaluation Dataset
Date
Paper URL
License
Non-Commercial
Soul-Bench is an evaluation benchmark for audio-driven human animation tasks, released by Tencent YouTu Lab in 2025. Related research papers include... Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal AnimationThe aim is to systematically evaluate the generation quality, consistency, and generalization ability of relevant methods in real-world application scenarios.
This dataset contains 226 video test samples, exhibiting a relatively rich distribution across multiple dimensions, as detailed below:
- Main body type distribution
- Upper body scenes: 107
- Full-body scenes: 72
- Portraits, animated characters, and animals: 47 items
- Audio type distribution
- Dialogue-based audio: 177 pieces
- Vocal performances: 49 items
- Video resolution distribution
- 1080P: 118 items
- 720P: 55 items
- 4K: 51 items
- 480P: 2 items
- Screen proportions
- 1 < r ≤ 2: 170 entries
- r = 1 (square): 44 lines
- 0.5 ≤ r < 1 (vertical): 12 lines
- Video duration distribution
- 27–30 second interval: 70 lines

Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.