HyperAIHyperAI

Referring Expression Segmentation On Refcocog 1

Metriken

Overall IoU

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
Overall IoU
Paper TitleRepository
GROUNDHOG74.6GROUNDHOG: Grounding Large Language Models to Holistic Segmentation-
SafaRi-B71.06SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation-
UniLSeg-2079.47Universal Segmentation at Arbitrary Granularity with Language Instruction-
EVF-SAM77.4EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model-
MaskRIS (Swin-B)66.5MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation-
VATEX-Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding-
PolyFormer-L70.19PolyFormer: Referring Image Segmentation as Sequential Polygon Generation-
PolyFormer-B69.05PolyFormer: Referring Image Segmentation as Sequential Polygon Generation-
VLT (Darknet53)56.65Vision-Language Transformer and Query Generation for Referring Segmentation-
MagNet66.03Mask Grounding for Referring Image Segmentation-
DETRIS75.3Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation-
UniLSeg-10080.54Universal Segmentation at Arbitrary Granularity with Language Instruction-
MaskRIS (Swin-B, combined DB)71.09MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation-
HyperSeg78.9HyperSeg: Towards Universal Visual Segmentation with Large Language Model-
C3VG76.39Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints-
LAVT (Swin-B)62.09LAVT: Language-Aware Vision Transformer for Referring Image Segmentation-
MLCD-Seg-7B80.5Multi-label Cluster Discrimination for Visual Representation Learning-
0 of 17 row(s) selected.