Command Palette
Search for a command to run...
Composed Image Retrieval for Remote Sensing
Composed Image Retrieval for Remote Sensing
Bill Psomas Ioannis Kakogeorgiou Nikos Efthymiadis Giorgos Tolias Ondřej Chum Yannis Avrithis Konstantinos Karantzalos
Abstract
This work introduces composed image retrieval to remote sensing. It allows toquery a large image archive by image examples alternated by a textualdescription, enriching the descriptive power over unimodal queries, eithervisual or textual. Various attributes can be modified by the textual part, suchas shape, color, or context. A novel method fusing image-to-image andtext-to-image similarity is introduced. We demonstrate that a vision-languagemodel possesses sufficient descriptive power and no further learning step ortraining data are necessary. We present a new evaluation benchmark focused oncolor, context, density, existence, quantity, and shape modifications. Our worknot only sets the state-of-the-art for this task, but also serves as afoundational step in addressing a gap in the field of remote sensing imageretrieval. Code at: https://github.com/billpsomas/rscir