Multi Modal Classification On Audioset
Metrics
Average mAP
Results
Performance results of various models on this benchmark
| Paper Title | ||
|---|---|---|
| CAV-MAE | 0.512 | Contrastive Audio-Visual Masked Autoencoder |
| UAVM | 0.504 | UAVM: Towards Unifying Audio and Visual Models |
0 of 2 row(s) selected.