HyperAI超神経

Audio Captioning On Audiocaps

評価指標

CIDEr
METEOR
SPICE
SPIDEr

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名CIDErMETEORSPICESPIDEr
enclap-combining-neural-audio-codec-and-audio0.80290.25540.18790.4954
enclap-analyzing-the-enclap-framework-for0.8230.2690.1970.510
vast-a-vision-audio-subtitle-text-omni-10.7810.247--
improving-audio-language-learning-with-mixgen0.755-0.1770.466
slam-aac-enhancing-audio-captioning-with0.8410.2680.1940.518
enhancing-automated-audio-captioning-via0.8160.2670.1930.505
audiocaps-generating-captions-for-audios-in0.593-0.1440.369
モデル 80.769-0.1810.475
enclap-analyzing-the-enclap-framework-for0.8150.2570.1880.501
モデル 100.80610.25270.18410.4951
automated-audio-captioning-by-fine-tuning0.753-0.1760.465
enclap-combining-neural-audio-codec-and-audio0.77950.24730.18630.4829
audio-captioning-transformer0.693-0.1590.426
valor-vision-audio-language-omni-perception0.7410.231--
taming-data-and-transformers-for-audio-10.8320.2530.1820.507
rethinking-transfer-and-auxiliary-learning0.7640.2420.1800.472