HyperAI超神経

Referring Video Object Segmentation On Mevis

評価指標

F
J
Ju0026F

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名
F
J
Ju0026F
Paper TitleRepository
DsHmp + MTCM51.144.147.6Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation
HTR45.539.942.7Temporally Consistent Referring Video Object Segmentation with Hybrid Memory
VRS-HQ (Chat-UniVi-13B)53.74850.9The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
InternVideo2.5--32InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling-
MPG-SAM 256.750.753.7MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation-
ReferFormer32.229.831.0Language as Queries for Referring Video Object Segmentation
DsHmp49.84346.4Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
SAMWISE51.245.448.3SAMWISE: Infusing wisdom in SAM2 for Text-Driven Video Segmentation
LBDT30.827.829.3Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation-
VLT+TC37.333.635.5VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
LMPM40.234.237.2MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
MTTR31.228.830.0End-to-End Referring Video Object Segmentation with Multimodal Transformers
URVOS29.925.727.8URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark
ReferDINO (Swin-B)53.944.749.3ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations-
FindTrack50.745.648.2Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
0 of 15 row(s) selected.