HyperAIHyperAI超神经
首页资讯最新论文教程数据集百科SOTALLM 模型天梯GPU 天梯顶会
全站搜索
关于
中文
HyperAIHyperAI超神经
  1. 首页
  2. SOTA
  3. 音频字幕生成
  4. Audio Captioning On Clotho

Audio Captioning On Clotho

评估指标

BLEU-4
CIDEr
METEOR
ROUGE-L

评测结果

各个模型在此基准测试上的表现结果

模型名称
BLEU-4
CIDEr
METEOR
ROUGE-L
Paper TitleRepository
VALOR16.20.42317.438.2VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset-
RNN-GRU-EncDec + VGGish + Word2Vec-0.18--Audio Captioning using Gated Recurrent Units-
VAST190.51919.340.8VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset-
Ensemble-RL-0.468--THE SJTU SYSTEM FOR DCASE2021 CHALLENGE TASK 6: AUDIO CAPTIONING BASED ON ENCODER PRE-TRAINING AND REINFORCEMENT LEARNING
Ensemble-0.400--THE DCASE 2021 CHALLENGE TASK 6 SYSTEM: AUTOMATED AUDIO CAPTIONING WITH WEAKLY SUPERVISED PRE-TRAING AND WORD SELECTION METHODS-
Ensemble-0.319--The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation-
Qwen-Audio-0.441--Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models-
SLAM-AAC-0.5150.197-SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs-
LOAE-0.5130.197-Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding-
Audio Flamingo (Pengi trainset)17.40.48918.739.4Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities-
0 of 10 row(s) selected.
HyperAI

学习、理解、实践,与社区一起构建人工智能的未来

中文

关于

关于我们数据集帮助

产品

资讯教程数据集百科

链接

TVM 中文Apache TVMOpenBayes

© HyperAI超神经

津ICP备17010941号-1京公网安备11010502038810号京公网安备11010502038810号
TwitterBilibili