HyperAI

Referring Expression Segmentation On Refcoco 3

Métriques

Overall IoU

Résultats

Résultats de performance de divers modèles sur ce benchmark

Nom du modèle
Overall IoU
Paper TitleRepository
VLT63.53VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
CPMC49.56Referring Image Segmentation via Cross-Modal Progressive Comprehension
UniLSeg-2072.70Universal Segmentation at Arbitrary Granularity with Language Instruction
MagNet66.16Mask Grounding for Referring Image Segmentation
BRINet48.57Bi-Directional Relationship Inferring Network for Referring Image Segmentation-
SafaRi-B70.78SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation-
GLEE-Pro69.6General Object Foundation Model for Images and Videos at Scale
CMSA43.76Cross-Modal Self-Attention Network for Referring Image Segmentation
LAVT62.14LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
MaskRIS (Swin-B, combined DB)70.26MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
CRIS62.27CRIS: CLIP-Driven Referring Image Segmentation
UNINEXT-H72.47Universal Instance Perception as Object Discovery and Retrieval
VLT55.50Vision-Language Transformer and Query Generation for Referring Segmentation
UniLSeg-10073.18Universal Segmentation at Arbitrary Granularity with Language Instruction
PolyFormer-L69.33PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
ReLA66.04GRES: Generalized Referring Expression Segmentation
HyperSeg79.0HyperSeg: Towards Universal Visual Segmentation with Large Language Model
GROUNDHOG70.5GROUNDHOG: Grounding Large Language Models to Holistic Segmentation-
DETRIS75.2Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
MaIL62.23MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image Segmentation-
0 of 31 row(s) selected.