HyperAI초신경

Referring Video Object Segmentation On Refer

평가 지표

F
J
Ju0026F

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
F
J
Ju0026F
Paper TitleRepository
HTML-Video-SwinT63.059.561.2HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation-
HTR68.965.367.1Temporally Consistent Referring Video Object Segmentation with Hybrid Memory
VLT65.661.963.8VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
GLEE-Plus69.765.667.7General Object Foundation Model for Images and Videos at Scale
HTML-SwinL65.361.563.4HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation-
ReferFormer (Large)64.661.362.9Language as Queries for Referring Video Object Segmentation
SOC67.964.166.0SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
HTML-Video-SwinB65.261.563.4HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation-
HTML-ResNet10159.857.358.5HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation-
HTML-ResNet5059.056.557.8HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation-
VATEX67.563.365.4Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding
CMSA38.134.836.4Cross-Modal Self-Attention Network for Referring Image Segmentation
SgMg67.463.965.7Spectrum-guided Multi-granularity Referring Video Object Segmentation
GLEE-Pro72.968.270.6General Object Foundation Model for Images and Videos at Scale
FindTrack72.068.670.3Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
HyperSeg--68.5HyperSeg: Towards Universal Visual Segmentation with Large Language Model
R2VOS (Swin-T)61.558.960.2Towards Robust Referring Video Object Segmentation with Cyclic Relational Consensus
HTML-Video-SwinS62.959.961.4HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation-
0 of 18 row(s) selected.