AVSpeech is a new, large-scale audio-visual dataset comprising speech video clips with no interfering backgruond noises. The segments are 3-10 seconds long, and in each clip the audible sound in the soundtrack belongs to a single speaking person, visible in the video. In total, the dataset contains roughly 4700 hours of video segments, from a total of 290k YouTube videos, spanning a wide variety of people, languages and face poses. For more details on how we created the dataset see our paper.Previous






3 做种 0 下载 1258 已完成
文件名 | 大小 |
---|---|
README.md | 1 KB |
README.txt | 1 KB |
avspeech_train.csv | 128 MB |
avspeech_train.part0.csv | 25 MB |
avspeech_train.part1.csv | 25 MB |
avspeech_train.part2.csv | 25 MB |
avspeech_train.part3.csv | 25 MB |
avspeech_train.part4.csv | 25 MB |
data.z01 | 97 GB |
data.z02 | 97 GB |
data.z03 | 97 GB |
data.z04 | 97 GB |
data.z05 | 97 GB |
data.z06 | 97 GB |
data.z07 | 97 GB |
data.z08 | 97 GB |
data.zip | 85 GB |
download.sh | 6 KB |
parallel-20190822.tar.bz2 | 1 MB |
parallel-20190822.tar.bz2.sig | 2 KB |
CITATION | 641 B |
COPYING | 34 KB |
CREDITS | 2 KB |
Makefile | 28 KB |
Makefile.am | 5 KB |
Makefile.in | 28 KB |
NEWS | 146 KB |
README | 4 KB |
aclocal.m4 | 27 KB |
cc-by-sa.txt | 27 KB |
config.h | 793 B |
config.h.in | 625 B |
config.log | 5 KB |
config.status | 28 KB |
configure | 107 KB |
configure.ac | 442 B |
fdl.txt | 22 KB |
install-sh | 14 KB |
missing | 6 KB |
Makefile | 31 KB |
Makefile.am | 13 KB |
Makefile.in | 31 KB |
env_parallel | 4 KB |
env_parallel.1 | 30 KB |
env_parallel.ash | 13 KB |
env_parallel.bash | 13 KB |
env_parallel.csh | 5 KB |
env_parallel.dash | 13 KB |
env_parallel.fish | 6 KB |
env_parallel.html | 30 KB |
env_parallel.ksh | 12 KB |
env_parallel.mksh | 12 KB |
env_parallel.pdf | 62 KB |
env_parallel.pdksh | 5 KB |
env_parallel.pod | 21 KB |
env_parallel.sh | 13 KB |
env_parallel.tcsh | 5 KB |
env_parallel.texi | 26 KB |
env_parallel.zsh | 12 KB |
niceload | 32 KB |
niceload.1 | 16 KB |
niceload.html | 14 KB |
niceload.pdf | 40 KB |
niceload.pod | 9 KB |
niceload.texi | 13 KB |
parallel | 374 KB |
parallel.1 | 187 KB |
parallel.html | 197 KB |
parallel.pdf | 428 KB |
parallel.pod | 151 KB |
parallel.texi | 186 KB |
parallel_alternatives.7 | 95 KB |
parallel_alternatives.html | 101 KB |
parallel_alternatives.pdf | 214 KB |
parallel_alternatives.pod | 75 KB |
parallel_alternatives.texi | 88 KB |
parallel_book.7 | 16 KB |
parallel_book.html | 15 KB |
parallel_book.pdf | 41 KB |
parallel_book.pod | 9 KB |
parallel_book.texi | 12 KB |
parallel_cheat.fodt | 87 KB |
parallel_cheat.pdf | 147 KB |
parallel_design.7 | 57 KB |
parallel_design.html | 58 KB |
parallel_design.pdf | 152 KB |
parallel_design.pod | 46 KB |
parallel_design.texi | 53 KB |
parallel_tutorial.7 | 93 KB |
parallel_tutorial.html | 98 KB |
parallel_tutorial.pdf | 209 KB |
parallel_tutorial.pod | 72 KB |
parallel_tutorial.texi | 90 KB |
parcat | 3 KB |
parcat.1 | 8 KB |
parcat.html | 5 KB |
parcat.pdf | 24 KB |
parcat.pod | 3 KB |
parcat.texi | 4 KB |
parset | 3 KB |
parset.1 | 11 KB |
parset.html | 10 KB |
parset.pdf | 31 KB |
parset.pod | 6 KB |
parset.texi | 8 KB |
sem | 374 KB |
sem.1 | 14 KB |
sem.html | 12 KB |
sem.pdf | 37 KB |
sem.pod | 8 KB |
sem.texi | 11 KB |
sql | 30 KB |
sql.1 | 16 KB |
sql.html | 15 KB |
sql.pdf | 40 KB |
sql.texi | 13 KB |
stamp-h1 | 23 B |