Image Retrieval On Localized Narratives
Metrics
Text-to-image R@1
Text-to-image R@10
Text-to-image R@5
Results
Performance results of various models on this benchmark
Model Name | Text-to-image R@1 | Text-to-image R@10 | Text-to-image R@5 | Paper Title | Repository |
---|---|---|---|---|---|
OPT | 0.4196 | 0.8126 | 0.72 | OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation |
0 of 1 row(s) selected.