HyperAI

Referring Expression Segmentation On Refcoco 5

Métriques

Overall IoU

Résultats

Résultats de performance de divers modèles sur ce benchmark

Nom du modèle
Overall IoU
Paper TitleRepository
MaIL56.06MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image Segmentation-
HyperSeg75.2HyperSeg: Towards Universal Visual Segmentation with Large Language Model
UNINEXT-H66.22Universal Instance Perception as Object Discovery and Retrieval
MaskRIS (Swin-B)59.39MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
PolyFormer-L61.87PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
MattNet40.08MAttNet: Modular Attention Network for Referring Expression Comprehension
VLT56.92VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
CRIS53.68CRIS: CLIP-Driven Referring Image Segmentation
VATEX-Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding
SafaRi-B64.88SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation-
MagNet58.14Mask Grounding for Referring Image Segmentation
CMSA37.89Cross-Modal Self-Attention Network for Referring Image Segmentation
EVF-SAM70.1EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
STEP (5-fold)40.41See-Through-Text Grouping for Referring Image Segmentation-
PolyFormer-B59.33PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
DETRIS70.2Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
LAVT55.1LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
UniLSeg-2066.99Universal Segmentation at Arbitrary Granularity with Language Instruction
UniLSeg-10068.15Universal Segmentation at Arbitrary Granularity with Language Instruction
MaskRIS (Swin-B, combined DB)62.83MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
0 of 29 row(s) selected.