HyperAI

Video Instance Segmentation On Ovis 1

Metrics

AP50
AP75
AR1
AR10
mask AP

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAP50AP75AR1AR10mask AP
dvis-improved-decoupled-framework-for62.837.315.842.937.2
dvis-daq-improving-video-segmentation-via83.862.9--57.1
dvis-improved-decoupled-framework-for68.940.916.847.341.2
instanceformer-an-online-video-instance42.521.6112.929.322.8
universal-instance-perception-as-object72.552.2--49.0
robust-online-video-instance-segmentation64.742.618.449.142.6
instanceformer-an-online-video-instance40.718.11227.120.0
crossover-learning-for-fast-online-video32.712.1--14.9
boxvis-video-instance-segmentation-with-box68.439.9--40.6
tarvis-a-unified-approach-for-target-based52.530.415.939.931.1
dvis-improved-decoupled-framework-for78.958.5--53.4
temporally-efficient-vision-transformer-for34.915.0--17.4
stc-spatio-temporal-contrastive-learning-for33.513.4--15.5
devis-making-deformable-transformers-work-for59.338.316.639.835.5
minvis-a-minimal-video-instance-segmentation61.541.318.143.339.4
dvis-decoupled-video-instance-segmentation75.953.019.455.349.9
tube-link-a-flexible-cross-tube-baseline-for51.530.215.534.529.5
mask2former-for-video-instance-segmentation36.914.19.924.716.6
dvis-improved-decoupled-framework-for72.555.020.854.649.6
in-defense-of-online-models-for-video51.3301537.530.2
a-generalized-framework-for-video-instance69.247.818.949.045.4
vita-video-instance-segmentation-via-object51.924.914.933.027.7
novis-a-case-for-end-to-end-near-online-video68.343.819.446.943.5
univs-unified-and-universal-video----41.7
mdqe-mining-discriminative-query-embeddings67.844.318.346.542.6
crossover-learning-for-fast-online-video35.516.9--18.1
spatial-feature-calibration-and-temporal35.415.28.423.117.3
gratt-vis-gated-residual-attention-for-auto60.836.816.840.136.2
tarvis-a-unified-approach-for-target-based67.844.618.050.443.2
refinevis-video-instance-segmentation-with70.448.419.151.246
dvis-decoupled-video-instance-segmentation71.949.219.452.547.1
occluded-video-instance-segmentation29.912.5--14.3
occluded-video-instance-segmentation33.913.1--15.4
novis-a-case-for-end-to-end-near-online-video56.232.615.737.132.7
universal-instance-perception-as-object55.535.6--34.0
context-aware-video-instance-segmentation82.663.521.261.857.1
tarvis-a-unified-approach-for-target-based55.034.416.140.934.0
ctvis-consistent-training-for-online-video60.834.9--35.5
gratt-vis-gated-residual-attention-for-auto69.147.819.249.445.7
d2conv3d-dynamic-dilated-convolutions-for33.813.7--15.2
general-object-foundation-model-for-images-55.5--50.4
in-defense-of-online-models-for-video65.745.217.949.642.6
devis-making-deformable-transformers-work-for47.620.812.028.923.7
ctvis-consistent-training-for-online-video71.547.5--46.9