HyperAI

Video Object Detection On Imagenet Vid

Metrics

MAP

Results

Performance results of various models on this benchmark

Comparison Table
Model NameMAP
yolov-making-still-image-object-detectors87.5
sequence-level-semantics-aggregation-for82.69
robust-and-efficient-post-processing-for84.2
boxmask-revisiting-bounding-box-supervision80.7
sequence-level-semantics-aggregation-for84.3
practical-video-object-detection-via-feature93.2
objects-do-not-disappear-video-object87.2
objects-do-not-disappear-video-object91.3
diffusionvid-denoising-object-boxes-with87.1
spatio-temporal-learnable-proposals-for-end80.3
temporal-shift-module-for-efficient-video76.3
diffusionvid-denoising-object-boxes-with92.5
learning-motion-priors-for-efficient-video81.7
robust-and-efficient-post-processing-for75.1
integrated-object-detection-and-tracking-with83.5
transvod-end-to-end-video-object-detection90.1
identity-consistent-aggregation-for-video85.8
memory-enhanced-global-local-aggregation-for85.4
ptseformer-progressive-temporal-spatial88.1
looking-fast-and-slow-memory-guided-mobile63.9
flow-guided-feature-aggregation-for-video80.1
dafa-diversity-aware-feature-aggregation-for84.5
temporal-roi-align-for-video-object84.3
tgbformer-transformer-graphformer-blender90.3
objects-do-not-disappear-video-object87.9
robust-and-efficient-post-processing-for80.1
boxmask-revisiting-bounding-box-supervision84.8
mining-inter-video-proposal-relations-for85.5
short-term-anchor-linking-and-long-term-self82.4
video-sparse-transformer-with-attention91.1
dafa-diversity-aware-feature-aggregation-for85.9
robust-and-efficient-post-processing-for68.6
mining-inter-video-proposal-relations-for83.8