Zero Shot Audio Captioning On Audiocaps
Métriques
BLEU-4
CIDEr
METEOR
ROUGE-L
SPICE
SPIDEr
Résultats
Résultats de performance de divers modèles sur ce benchmark
Nom du modèle | BLEU-4 | CIDEr | METEOR | ROUGE-L | SPICE | SPIDEr | Paper Title | Repository |
---|---|---|---|---|---|---|---|---|
ZerAuCap | 6.8 | 28.1 | 12.3 | 33.1 | 8.6 | 18.3 | Zero-shot audio captioning with audio-language model guidance and audio context keywords | |
No audio (baseline) | 0 | 0.1 | 4.1 | 17.8 | 0 | 0 | Zero-shot audio captioning with audio-language model guidance and audio context keywords | |
Shaharabany et al. | 9.8 | 9.2 | 8.6 | 8.2 | - | - | Zero-Shot Audio Captioning via Audibility Guidance | - |
Audio Flamingo | 14.3 | 50.2 | 20.5 | 40.8 | 15.1 | 32.6 | Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities |
0 of 4 row(s) selected.