Open Vocabulary Semantic Segmentation On
Métriques
mIoU
Résultats
Résultats de performance de divers modèles sur ce benchmark
Nom du modèle | mIoU | Paper Title | Repository |
---|---|---|---|
TTD (MaskCLIP) | 27.0 | TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias | |
TTD (TCL) | 32.0 | TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias | |
CLIP Surgery (CLIP without any fine-tuning) | 31.4 | A Closer Look at the Explainability of Contrastive Language-Image Pre-training | |
FC-CLIP | 56.2 | Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | |
SimSeg | 34.5 | A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model |
0 of 5 row(s) selected.