HyperAI

Video Instance Segmentation On Youtube Vis 1

Metriken

AP50
AP75
AR1
AR10
mask AP

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
AP50
AP75
AR1
AR10
mask AP
Paper TitleRepository
CrossVIS (ResNet-101)57.339.7364236.6Crossover Learning for Fast Online Video Instance Segmentation
MDQE(Swin-L)84.967.353.565.059.9MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
IDOL (ResNet-50)7452.947.758.749.5In Defense of Online Models for Video Instance Segmentation
MSN69.454.940.155.048.8MSN: Efficient Online Mask Selection Network for Video Instance Segmentation
IFC (ResNet-50)65.846.843.851.242.8Video Instance Segmentation using Inter-Frame Communication Transformers
STMask(R101-DCN-FPN)56.838.034.841.836.8Spatial Feature Calibration and Temporal Fusion for Effective One-stage Video Instance Segmentation
VisTR(ResNet-50)59.836.937.242.436.2End-to-End Video Instance Segmentation with Transformers
OSMN28.633.1--29.1Efficient Video Object Segmentation via Network Modulation
DeepSORT31.3---27.8Simple Online and Realtime Tracking with a Deep Association Metric
VisTR(ResNet-101)64.045.038.344.940.1End-to-End Video Instance Segmentation with Transformers
SipMask (ResNet-50, single-scale test)5333.333.538.932.5SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation
VSTAM----39.0Video Sparse Transformer With Attention-Guided Memory for Video Object Detection
ObjProp (ResNet-50)59.439.239.147.736.0Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation
SeqFormer (ResNet-101)71.155.746.856.949.0SeqFormer: Sequential Transformer for Video Instance Segmentation
TraDeS52.632.8--32.6Track to Detect and Segment: An Online Multi-Object Tracker-
STEm-Seg (ResNet-101)55.837.934.441.634.6STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos
SeqFormer (ResNet-50)69.851.845.554.847.4SeqFormer: Sequential Transformer for Video Instance Segmentation
InstanceFormer(Swin-L)78.064.250.961.656.3InstanceFormer: An Online Video Instance Segmentation Framework
MaskTrack R-CNN (ResNet-50, single-scale training and test)51.132.63135.530.3Video Instance Segmentation
NOVIS (ResNet-50)75.756.950.360.652.8NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation-
0 of 43 row(s) selected.