Image To Text Retrieval On Aic Icc
Metrics
Recall@1
Recall@10
Recall@5
Results
Performance results of various models on this benchmark
Model Name | Recall@1 | Recall@10 | Recall@5 | Paper Title | Repository |
---|---|---|---|---|---|
CMCL | 20.3 | 45.6 | 37 | WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training | |
ERNIE-ViL2.0 | 33.7 | 60.0 | 52.1 | ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training |
0 of 2 row(s) selected.