Multi Modal
Image Retrieval with Multi-Modal Query refers to the task of retrieving images from a database using both image and text as query inputs. This task involves making specific modifications to the query image based on textual prompts, aiming to accurately retrieve images that meet the modified requirements. This technology has significant application value in image search, content recommendation, and intelligent editing, and can greatly enhance user interaction experience and retrieval efficiency.