HyperAI

Audio Generation On Audiocaps

المقاييس

FAD
FD

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجFADFD
make-an-audio-2-temporal-enhanced-text-to1.8011.75
fast-timing-conditioned-latent-audio--
audiobox-unified-audio-generation-with0.778.30
taming-data-and-transformers-for-audio-11.2116.51
2406-154872.5417.19
audioldm-2-learning-holistic-audio-generation1.42-
retrieval-augmented-text-to-audio-generation1.37-
auffusion-leveraging-the-power-of-diffusion1.7623.08
etta-elucidating-the-design-space-of-text-to2.5113.12
audiogen-textually-guided-audio-generation3.13-
make-an-audio-text-to-audio-generation-with2.6618.32
tangoflux-super-fast-and-faithful-text-to--
etta-elucidating-the-design-space-of-text-to2.0310.10
diffsound-discrete-diffusion-model-for-text7.7547.68
audioldm-2-learning-holistic-audio-generation2.0226.18
long-form-music-generation-with-latent--
accelerating-diffusion-based-text-to-audio2.1820.44
auffusion-leveraging-the-power-of-diffusion1.6321.99
any-to-any-generation-via-composable1.8022.90
audioldm-text-to-audio-generation-with-latent1.9623.31
text-to-audio-generation-using-instruction1.5924.52
stable-audio-open--
tangoflux-super-fast-and-faithful-text-to--