Video Instance Segmentation On Youtube Vis 1

Metriken

AP50

AP75

AR1

AR10

mask AP

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname	AP50	AP75	AR1	AR10	mask AP	Paper Title	Repository
CrossVIS (ResNet-101)	57.3	39.7	36	42	36.6	Crossover Learning for Fast Online Video Instance Segmentation
MDQE(Swin-L)	84.9	67.3	53.5	65.0	59.9	MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
IDOL (ResNet-50)	74	52.9	47.7	58.7	49.5	In Defense of Online Models for Video Instance Segmentation
MSN	69.4	54.9	40.1	55.0	48.8	MSN: Efficient Online Mask Selection Network for Video Instance Segmentation
IFC (ResNet-50)	65.8	46.8	43.8	51.2	42.8	Video Instance Segmentation using Inter-Frame Communication Transformers
STMask(R101-DCN-FPN)	56.8	38.0	34.8	41.8	36.8	Spatial Feature Calibration and Temporal Fusion for Effective One-stage Video Instance Segmentation
VisTR(ResNet-50)	59.8	36.9	37.2	42.4	36.2	End-to-End Video Instance Segmentation with Transformers
OSMN	28.6	33.1	-	-	29.1	Efficient Video Object Segmentation via Network Modulation
DeepSORT	31.3	-	-	-	27.8	Simple Online and Realtime Tracking with a Deep Association Metric
VisTR(ResNet-101)	64.0	45.0	38.3	44.9	40.1	End-to-End Video Instance Segmentation with Transformers
SipMask (ResNet-50, single-scale test)	53	33.3	33.5	38.9	32.5	SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation
VSTAM	-	-	-	-	39.0	Video Sparse Transformer With Attention-Guided Memory for Video Object Detection	-
ObjProp (ResNet-50)	59.4	39.2	39.1	47.7	36.0	Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation
SeqFormer (ResNet-101)	71.1	55.7	46.8	56.9	49.0	SeqFormer: Sequential Transformer for Video Instance Segmentation
TraDeS	52.6	32.8	-	-	32.6	Track to Detect and Segment: An Online Multi-Object Tracker
STEm-Seg (ResNet-101)	55.8	37.9	34.4	41.6	34.6	STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos
SeqFormer (ResNet-50)	69.8	51.8	45.5	54.8	47.4	SeqFormer: Sequential Transformer for Video Instance Segmentation
InstanceFormer(Swin-L)	78.0	64.2	50.9	61.6	56.3	InstanceFormer: An Online Video Instance Segmentation Framework
MaskTrack R-CNN (ResNet-50, single-scale training and test)	51.1	32.6	31	35.5	30.3	Video Instance Segmentation
NOVIS (ResNet-50)	75.7	56.9	50.3	60.6	52.8	NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation	-

0 of 43 row(s) selected.