HyperAI

Multi Modal Classification On Vgg Sound

Metrics

Top-1 Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model NameTop-1 Accuracy
uavm-a-unified-model-for-audio-visual65.8
multiscale-multimodal-transformer-for66.2
contrastive-audio-visual-masked-autoencoder65.9
avt-audio-video-transformer-for-multimodal63.9