AVSpeech – Audiovisual Speech Dataset
Date
6 years ago
Size
867.36 GB
Publish URL
AVSpeech is a new, large-scale audio-visual dataset consisting of video clips of speech without interfering background noise. The clips are 3-10 seconds long, and in each clip, the voice heard in the original soundtrack belongs to the only person visible speaking in the video.
The dataset contains approximately 4,700 hours of video clips from 290,000 YouTube videos, covering a wide variety of people, languages, and facial poses.
AVSpeech.torrent
Seeding 3Downloading 0Completed 2,818Total Downloads 4,253
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.
AI Co-coding
Ready-to-use GPUs
Best Pricing
Hyper Newsletters
Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp