Towards Grand Unification of Object Tracking

We present a unified method, termed Unicorn, that can simultaneously solvefour tracking problems (SOT, MOT, VOS, MOTS) with a single network using thesame model parameters. Due to the fragmented definitions of the object trackingproblem itself, most existing trackers are developed to address a single orpart of tasks and overspecialize on the characteristics of specific tasks. Bycontrast, Unicorn provides a unified solution, adopting the same input,backbone, embedding, and head across all tracking tasks. For the first time, weaccomplish the great unification of the tracking network architecture andlearning paradigm. Unicorn performs on-par or better than its task-specificcounterparts in 8 tracking datasets, including LaSOT, TrackingNet, MOT17,BDD100K, DAVIS16-17, MOTS20, and BDD100K MOTS. We believe that Unicorn willserve as a solid step towards the general vision model. Code is available athttps://github.com/MasterBin-IIAU/Unicorn.