HyperAI超神经
首页
资讯
最新论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
首页
SOTA
Referring Expression Segmentation
Referring Expression Segmentation On Refcocog 1
Referring Expression Segmentation On Refcocog 1
评估指标
Overall IoU
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Overall IoU
Paper Title
Repository
GROUNDHOG
74.6
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
-
SafaRi-B
71.06
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
-
UniLSeg-20
79.47
Universal Segmentation at Arbitrary Granularity with Language Instruction
EVF-SAM
77.4
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
MaskRIS (Swin-B)
66.5
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
VATEX
-
Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding
PolyFormer-L
70.19
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
PolyFormer-B
69.05
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
VLT (Darknet53)
56.65
Vision-Language Transformer and Query Generation for Referring Segmentation
MagNet
66.03
Mask Grounding for Referring Image Segmentation
DETRIS
75.3
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
UniLSeg-100
80.54
Universal Segmentation at Arbitrary Granularity with Language Instruction
MaskRIS (Swin-B, combined DB)
71.09
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
HyperSeg
78.9
HyperSeg: Towards Universal Visual Segmentation with Large Language Model
C3VG
76.39
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
LAVT (Swin-B)
62.09
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
MLCD-Seg-7B
80.5
Multi-label Cluster Discrimination for Visual Representation Learning
0 of 17 row(s) selected.
Previous
Next