HyperAI

Cross Modal Retrieval On Soundingearth

Metrics

Image-to-sound R@100
Median Rank
Sound-to-image R@100

Results

Performance results of various models on this benchmark

Comparison Table
Model NameImage-to-sound R@100Median RankSound-to-image R@100
learning-tri-modal-embeddings-for-zero-shot0.4341590.434
self-supervised-audiovisual-representation0.2915650.250