HyperAI초신경

Open Vocabulary Semantic Segmentation On 2

평가 지표

mIoU

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
mIoU
Paper TitleRepository
TTD (MaskCLIP)12.7TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
MAFT-ViTL32.0Learning Mask-aware CLIP Representations for Zero-Shot Segmentation-
FC-CLIP34.1Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
MAFT+36.1Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation
CAT-Seg37.9CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
OVSeg + OpenDAS35.8OpenDAS: Open-Vocabulary Domain Adaptation for 2D and 3D Segmentation-
SimSeg20.5A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model
SCAN33.5Open-Vocabulary Segmentation with Semantic-Assisted Calibration
Mask-Adapter38.2Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation-
MaskCLIP23.7Open-Vocabulary Universal Image Segmentation with MaskCLIP
POMP20.7--
EBSeg-L32.8Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
TTD (TCL)17.0TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
ODISE29.9Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
PACL31.4Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
SILC37.7SILC: Improving Vision Language Pretraining with Self-Distillation-
OVSeg Swin-B29.6Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
LaVG15.8In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
MaskCLIP++38.2MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
CLIPSelf34.5CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction-
0 of 21 row(s) selected.