HyperAI

Cross Modal Retrieval On Soundingearth

Metrics

Image-to-sound R@100
Median Rank
Sound-to-image R@100

Results

Performance results of various models on this benchmark

Model Name
Image-to-sound R@100
Median Rank
Sound-to-image R@100
Paper TitleRepository
GeoCLAP0.4341590.434Learning Tri-modal Embeddings for Zero-Shot Soundscape Mapping-
ResNet-180.2915650.250Self-supervised Audiovisual Representation Learning for Remote Sensing Data
0 of 2 row(s) selected.