Video Instance Segmentation On Ovis 1

المقاييس

AP50

AP75

AR1

AR10

mask AP

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

اسم النموذج	AP50	AP75	AR1	AR10	mask AP	Paper Title	Repository
DVIS++(R50, Online)	62.8	37.3	15.8	42.9	37.2	DVIS++: Improved Decoupled Framework for Universal Video Segmentation
DVIS-DAQ(VIT-L, Offline)	83.8	62.9	-	-	57.1	DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries
DVIS++(R50, Offline)	68.9	40.9	16.8	47.3	41.2	DVIS++: Improved Decoupled Framework for Universal Video Segmentation
InstanceFormer (Swin-L)	42.5	21.61	12.9	29.3	22.8	InstanceFormer: An Online Video Instance Segmentation Framework
UNINEXT (ViT-H, Online)	72.5	52.2	-	-	49.0	Universal Instance Perception as Object Discovery and Retrieval
ROVIS (Swin-L)	64.7	42.6	18.4	49.1	42.6	Robust Online Video Instance Segmentation with Track Queries
InstanceFormer(ResNet-50)	40.7	18.1	12	27.1	20.0	InstanceFormer: An Online Video Instance Segmentation Framework
CrossVIS (ResNet-50)	32.7	12.1	-	-	14.9	Crossover Learning for Fast Online Video Instance Segmentation
BoxVIS(Swin-L & Box-sup)	68.4	39.9	-	-	40.6	BoxVIS: Video Instance Segmentation with Box Annotations
TarViS (ResNet-50)	52.5	30.4	15.9	39.9	31.1	TarViS: A Unified Approach for Target-based Video Segmentation
DVIS++(VIT-L,Offline)	78.9	58.5	-	-	53.4	DVIS++: Improved Decoupled Framework for Universal Video Segmentation
TeViT (ResNet-50)	34.9	15.0	-	-	17.4	Temporally Efficient Vision Transformer for Video Instance Segmentation
STC (ResNet-50)	33.5	13.4	-	-	15.5	STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation	-
DeVIS (Swin-L)	59.3	38.3	16.6	39.8	35.5	DeVIS: Making Deformable Transformers Work for Video Instance Segmentation
MinVIS (Swin-L)	61.5	41.3	18.1	43.3	39.4	MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
DVIS(Swin-L, Offline)	75.9	53.0	19.4	55.3	49.9	DVIS: Decoupled Video Instance Segmentation Framework
Tube-Link(ResNet-50)	51.5	30.2	15.5	34.5	29.5	Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
Mask2Former-VIS	36.9	14.1	9.9	24.7	16.6	Mask2Former for Video Instance Segmentation
DVIS++(VIT-L, Online)	72.5	55.0	20.8	54.6	49.6	DVIS++: Improved Decoupled Framework for Universal Video Segmentation
IDOL (ResNet-50)	51.3	30	15	37.5	30.2	In Defense of Online Models for Video Instance Segmentation

0 of 44 row(s) selected.