Referring Expression Segmentation On Refcoco 3

평가 지표

Overall IoU

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Overall IoU
Paper TitleRepository
VLT63.53VLT: Vision-Language Transformer and Query Generation for Referring Segmentation-
CPMC49.56Referring Image Segmentation via Cross-Modal Progressive Comprehension-
UniLSeg-2072.70Universal Segmentation at Arbitrary Granularity with Language Instruction-
MagNet66.16Mask Grounding for Referring Image Segmentation-
BRINet48.57Bi-Directional Relationship Inferring Network for Referring Image Segmentation-
SafaRi-B70.78SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation-
GLEE-Pro69.6General Object Foundation Model for Images and Videos at Scale-
CMSA43.76Cross-Modal Self-Attention Network for Referring Image Segmentation-
LAVT62.14LAVT: Language-Aware Vision Transformer for Referring Image Segmentation-
MaskRIS (Swin-B, combined DB)70.26MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation-
CRIS62.27CRIS: CLIP-Driven Referring Image Segmentation-
UNINEXT-H72.47Universal Instance Perception as Object Discovery and Retrieval-
VLT55.50Vision-Language Transformer and Query Generation for Referring Segmentation-
UniLSeg-10073.18Universal Segmentation at Arbitrary Granularity with Language Instruction-
PolyFormer-L69.33PolyFormer: Referring Image Segmentation as Sequential Polygon Generation-
ReLA66.04GRES: Generalized Referring Expression Segmentation-
HyperSeg79.0HyperSeg: Towards Universal Visual Segmentation with Large Language Model-
GROUNDHOG70.5GROUNDHOG: Grounding Large Language Models to Holistic Segmentation-
DETRIS75.2Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation-
MaIL62.23MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image Segmentation-
0 of 31 row(s) selected.
Referring Expression Segmentation On Refcoco 3 | SOTA | HyperAI초신경