Cross Modal Retrieval On Soundingearth
Métriques
Image-to-sound R@100
Median Rank
Sound-to-image R@100
Résultats
Résultats de performance de divers modèles sur ce benchmark
Nom du modèle | Image-to-sound R@100 | Median Rank | Sound-to-image R@100 | Paper Title | Repository |
---|---|---|---|---|---|
GeoCLAP | 0.434 | 159 | 0.434 | Learning Tri-modal Embeddings for Zero-Shot Soundscape Mapping | - |
ResNet-18 | 0.291 | 565 | 0.250 | Self-supervised Audiovisual Representation Learning for Remote Sensing Data |
0 of 2 row(s) selected.