HyperAIHyperAI

Multi Modal Classification On Audioset

Metrics

Average mAP

Results

Performance results of various models on this benchmark

Model Name
Average mAP
Paper TitleRepository
UAVM0.504UAVM: Towards Unifying Audio and Visual Models-
CAV-MAE0.512Contrastive Audio-Visual Masked Autoencoder-
0 of 2 row(s) selected.
Multi Modal Classification On Audioset | SOTA | HyperAI