HyperAI
HyperAI超神経
ホーム
プラットフォーム
ドキュメント
ニュース
論文
チュートリアル
データセット
百科事典
SOTA
LLMモデル
GPU ランキング
学会
検索
サイトについて
利用規約
プライバシーポリシー
日本語
HyperAI
HyperAI超神経
Toggle Sidebar
サイトを検索…
⌘
K
Command Palette
Search for a command to run...
プラットフォーム
ホーム
SOTA
ゼロショット合成画像検索
Zero Shot Composed Image Retrieval Zs Cir On
Zero Shot Composed Image Retrieval Zs Cir On
評価指標
mAP@10
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
Columns
モデル名
mAP@10
Paper Title
MMRet-MLLM
43.4
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
MMRet-Large (CLIP L/14)
40.2
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
SEIZE (CLIP G/14 & GPT-4o)
37.23
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
MagicLens (CoCa L)
35.4
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
MMRet-Base (CLIP B/16)
35.0
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
IP-CIR + LDRE (CLIP G/14)
34.26
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
SEIZE (CLIP G/14)
33.77
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
LDRE (CLIP G/14)
32.24
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval
MagicLens (CoCa B)
32.0
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
OSrCIR (CLIP G/14)
31.14
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval
MagicLens (CLIP L)
30.8
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
CoVR-BLIP-2
29.55
CoVR-2: Automatic Data Construction for Composed Video Retrieval
ImageScope (CLIP-ViT-L/14)
28.36
ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning
CIReVL (CLIP G/14)
27.59
Vision-by-Language for Training-Free Compositional Image Retrieval
IP-CIR + LDRE (CLIP L/14)
27.41
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
SEIZE (CLIP L/14)
25.82
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
OSrCIR (CLIP L/14)
25.33
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval
LDRE (CLIP L/14)
24.03
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval
MagicLens (CLIP B)
23.8
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
RTD + LinCIR (CLIP G/14)
22.29
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval
0 of 42 row(s) selected.
Previous
Next
Zero Shot Composed Image Retrieval Zs Cir On | SOTA | HyperAI超神経