HyperAI超神経

Video Instance Segmentation On Ovis 1

評価指標

AP50
AP75
AR1
AR10
mask AP

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名
AP50
AP75
AR1
AR10
mask AP
Paper TitleRepository
DVIS++(R50, Online)62.837.315.842.937.2DVIS++: Improved Decoupled Framework for Universal Video Segmentation
DVIS-DAQ(VIT-L, Offline)83.862.9--57.1DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries
DVIS++(R50, Offline)68.940.916.847.341.2DVIS++: Improved Decoupled Framework for Universal Video Segmentation
InstanceFormer (Swin-L)42.521.6112.929.322.8InstanceFormer: An Online Video Instance Segmentation Framework
UNINEXT (ViT-H, Online)72.552.2--49.0Universal Instance Perception as Object Discovery and Retrieval
ROVIS (Swin-L)64.742.618.449.142.6Robust Online Video Instance Segmentation with Track Queries
InstanceFormer(ResNet-50)40.718.11227.120.0InstanceFormer: An Online Video Instance Segmentation Framework
CrossVIS (ResNet-50)32.712.1--14.9Crossover Learning for Fast Online Video Instance Segmentation
BoxVIS(Swin-L & Box-sup)68.439.9--40.6BoxVIS: Video Instance Segmentation with Box Annotations
TarViS (ResNet-50)52.530.415.939.931.1TarViS: A Unified Approach for Target-based Video Segmentation
DVIS++(VIT-L,Offline)78.958.5--53.4DVIS++: Improved Decoupled Framework for Universal Video Segmentation
TeViT (ResNet-50)34.915.0--17.4Temporally Efficient Vision Transformer for Video Instance Segmentation
STC (ResNet-50)33.513.4--15.5STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation-
DeVIS (Swin-L)59.338.316.639.835.5DeVIS: Making Deformable Transformers Work for Video Instance Segmentation
MinVIS (Swin-L)61.541.318.143.339.4MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
DVIS(Swin-L, Offline)75.953.019.455.349.9DVIS: Decoupled Video Instance Segmentation Framework
Tube-Link(ResNet-50)51.530.215.534.529.5Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
Mask2Former-VIS36.914.19.924.716.6Mask2Former for Video Instance Segmentation
DVIS++(VIT-L, Online)72.555.020.854.649.6DVIS++: Improved Decoupled Framework for Universal Video Segmentation
IDOL (ResNet-50)51.3301537.530.2In Defense of Online Models for Video Instance Segmentation
0 of 44 row(s) selected.