HyperAI
Home
News
Latest Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
English
System
HyperAI
Toggle sidebar
Search the site…
⌘
K
Login
Login
Home
SOTA
Image Text Matching
Image Text Matching On Commercialadsdataset
Image Text Matching On Commercialadsdataset
Metrics
ADD(S) AUC
Results
Performance results of various models on this benchmark
Columns
Model Name
ADD(S) AUC
Paper Title
Repository
OSCAR
87.45
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Unicoder-VL
83.16
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training
-
VinVL
88.56
VinVL: Revisiting Visual Representations in Vision-Language Models
BLIP
83.51
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
AlignCMSS
91.73
Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
ALBEF
82.74
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
VL-BERT
86.27
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
AdsCVLR
87.90
AdsCVLR: Commercial Visual-Linguistic Representation Modeling in Sponsored Search
-
0 of 8 row(s) selected.
Previous
Next