HyperAI

Text To Music Generation On Musiccaps

Metrics

FAD

Results

Performance results of various models on this benchmark

Comparison Table
Model NameFAD
musiclm-generating-music-from-text9.6
stable-audio-open3.51
2406-154872.21
musiclm-generating-music-from-text4.0
audioldm-2-learning-holistic-audio-generation3.13
quality-aware-masked-diffusion-transformer1.65
fast-timing-conditioned-latent-audio-
flux-that-plays-music1.43
musiclm-generating-music-from-text13.4
jen-1-text-guided-universal-music-generation2.00
simple-and-controllable-music-generation5.0
melfusion-synthesizing-music-from-image-and1.12
noise2music-text-conditioned-music-generation2.134
audioldm-2-learning-holistic-audio-generation-
uniaudio-an-audio-foundation-model-toward-13.65
audioldm-2-learning-holistic-audio-generation2.93
noise2music-text-conditioned-music-generation3.840
etta-elucidating-the-design-space-of-text-to1.91
simple-and-controllable-music-generation3.8
efficient-neural-music-generation5.41
simple-and-controllable-music-generation3.4