HyperAI

Sound Prompted Semantic Segmentation On

Metrics

mAP
mIoU

Results

Performance results of various models on this benchmark

Comparison Table
Model NamemAPmIoU
separating-the-chirp-from-the-chat-self32.724.7
jointly-discovering-visual-objects-and-spoken16.818.1
imagebind-one-embedding-space-to-bind-them19.720.5
contrastive-audio-visual-masked-autoencoder26.017.0