HyperAI
Startseite
Neuigkeiten
Neueste Forschungsarbeiten
Tutorials
Datensätze
Wiki
SOTA
LLM-Modelle
GPU-Rangliste
Veranstaltungen
Suche
Über
Deutsch
HyperAI
Toggle sidebar
Seite durchsuchen…
⌘
K
Startseite
SOTA
Image Text Matching
Image Text Matching On Commercialadsdataset
Image Text Matching On Commercialadsdataset
Metriken
ADD(S) AUC
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Columns
Modellname
ADD(S) AUC
Paper Title
Repository
OSCAR
87.45
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Unicoder-VL
83.16
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training
-
VinVL
88.56
VinVL: Revisiting Visual Representations in Vision-Language Models
BLIP
83.51
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
AlignCMSS
91.73
Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search
ALBEF
82.74
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
VL-BERT
86.27
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
AdsCVLR
87.90
AdsCVLR: Commercial Visual-Linguistic Representation Modeling in Sponsored Search
-
0 of 8 row(s) selected.
Previous
Next