HyperAI
Startseite
Neuigkeiten
Neueste Forschungsarbeiten
Tutorials
Datensätze
Wiki
SOTA
LLM-Modelle
GPU-Rangliste
Veranstaltungen
Suche
Über
Deutsch
HyperAI
Toggle sidebar
Seite durchsuchen…
⌘
K
Startseite
SOTA
Referring Expression Segmentation
Referring Expression Segmentation On Refcoco 4
Referring Expression Segmentation On Refcoco 4
Metriken
Overall IoU
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Columns
Modellname
Overall IoU
Paper Title
Repository
C3VG
77.96
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
UniLSeg-100
78.29
Universal Segmentation at Arbitrary Granularity with Language Instruction
STEP (5-fold)
52.33
See-Through-Text Grouping for Referring Image Segmentation
-
ReLA
71.02
GRES: Generalized Referring Expression Segmentation
MaskRIS (Swin-B)
74.46
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
DETRIS
78.6
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
CRIS
68.08
CRIS: CLIP-Driven Referring Image Segmentation
MagNet
71.32
Mask Grounding for Referring Image Segmentation
VLT
68.43
VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
SHNet
58.46
Comprehensive Multi-Modal Interactions for Referring Image Segmentation
GROUNDHOG
75.0
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
-
LAVT
68.38
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
MaIL
65.92
MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image Segmentation
-
HyperSeg
83.5
HyperSeg: Towards Universal Visual Segmentation with Large Language Model
SafaRi-B
74.53
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
-
MattNet
52.39
MAttNet: Modular Attention Network for Referring Expression Comprehension
CPMC
53.44
Referring Image Segmentation via Cross-Modal Progressive Comprehension
PolyFormer-B
72.89
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
VATEX
-
Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding
CMSA
47.60
Cross-Modal Self-Attention Network for Referring Image Segmentation
0 of 29 row(s) selected.
Previous
Next