HyperAI
HyperAI
Home
Console
Docs
News
Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
Search the site…
⌘
K
Command Palette
Search for a command to run...
Console
Home
SOTA
Meme Classification
Meme Classification On Hateful Memes
Meme Classification On Hateful Memes
Metrics
Accuracy
ROC-AUC
Results
Performance results of various models on this benchmark
Columns
Model Name
Accuracy
ROC-AUC
Paper Title
LMM-RGCL (Qwen2-VL-7B)
0.821
0.911
Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection
LMM-RGCL (LLaVA-1.5-7B)
0.809
0.897
Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection
PaLI-X-VPD
-
0.892
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models
LMM-RGCL (Qwen2-VL-2B)
0.791
0.884
Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection
RGCL (CLIP)
0.788
0.870
Improving Hateful Meme Detection through Retrieval-Guided Contrastive Learning
Flamingo (fine-tuned)
-
0.866
Flamingo: a Visual Language Model for Few-Shot Learning
Hate-CLIPper - Align
-
0.858
Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features
ISSUES
-
0.855
Mapping Memes to Words for Multimodal Hateful Meme Classification
Ron Zhu
0.732
0.845
Enhance Multimodal Transformer With External Label And In-Domain Pretrain: Hateful Meme Challenge Winning Solution
Human
0.847
0.8265
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
Vilio
0.695
0.825
Vilio: State-of-the-art Visio-Linguistic Models applied to Hateful Memes
HateDetectron27
0.765
0.811
Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
Pro-Cap
0.723
0.809
Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Visual BERT COCO
0.695
0.754
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
SEER (RegNet10B)
-
0.734
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision
Flamingo (few-shot:32)
-
0.700
Flamingo: a Visual Language Model for Few-Shot Learning
CLIP (zero-shot)
-
0.661
Learning Transferable Visual Models From Natural Language Supervision
0 of 17 row(s) selected.
Previous
Next
Meme Classification On Hateful Memes | SOTA | HyperAI