HyperAI
HyperAI
Startseite
Plattform
Dokumentation
Neuigkeiten
Forschungsarbeiten
Tutorials
Datensätze
Wiki
SOTA
LLM-Modelle
GPU-Rangliste
Veranstaltungen
Suche
Über
Nutzungsbedingungen
Datenschutzrichtlinie
Deutsch
HyperAI
HyperAI
Toggle Sidebar
Seite durchsuchen…
⌘
K
Command Palette
Search for a command to run...
Plattform
Startseite
SOTA
Nullschusskompositionsbild-Retrieval
Zero Shot Composed Image Retrieval Zs Cir On
Zero Shot Composed Image Retrieval Zs Cir On
Metriken
mAP@10
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Columns
Modellname
mAP@10
Paper Title
MMRet-MLLM
43.4
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
MMRet-Large (CLIP L/14)
40.2
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
SEIZE (CLIP G/14 & GPT-4o)
37.23
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
MagicLens (CoCa L)
35.4
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
MMRet-Base (CLIP B/16)
35.0
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
IP-CIR + LDRE (CLIP G/14)
34.26
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
SEIZE (CLIP G/14)
33.77
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
LDRE (CLIP G/14)
32.24
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval
MagicLens (CoCa B)
32.0
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
OSrCIR (CLIP G/14)
31.14
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval
MagicLens (CLIP L)
30.8
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
CoVR-BLIP-2
29.55
CoVR-2: Automatic Data Construction for Composed Video Retrieval
ImageScope (CLIP-ViT-L/14)
28.36
ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning
CIReVL (CLIP G/14)
27.59
Vision-by-Language for Training-Free Compositional Image Retrieval
IP-CIR + LDRE (CLIP L/14)
27.41
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
SEIZE (CLIP L/14)
25.82
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
OSrCIR (CLIP L/14)
25.33
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval
LDRE (CLIP L/14)
24.03
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval
MagicLens (CLIP B)
23.8
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
RTD + LinCIR (CLIP G/14)
22.29
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval
0 of 42 row(s) selected.
Previous
Next