HyperAI초신경

Video To Sound Generation On Vgg Sound

평가 지표

FAD
FD

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름FADFD
read-watch-and-scream-sound-generation-from2.1615.24
frieren-efficient-video-to-audio-generation1.3212.26
taming-multimodal-joint-training-for-high0.795.22
masked-generative-video-to-audio-transformers2.04-
taming-multimodal-joint-training-for-high0.974.72
temporally-aligned-audio-for-video-with1.92-
v2a-mapper-a-lightweight-solution-for-vision0.84124.168
tell-what-you-hear-from-what-you-see-video-to2.38-