RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose

Recent studies on 2D pose estimation have achieved excellent performance onpublic benchmarks, yet its application in the industrial community stillsuffers from heavy model parameters and high latency. In order to bridge thisgap, we empirically explore key factors in pose estimation including paradigm,model architecture, training strategy, and deployment, and present ahigh-performance real-time multi-person pose estimation framework, RTMPose,based on MMPose. Our RTMPose-m achieves 75.8% AP on COCO with 90+ FPS on anIntel i7-11700 CPU and 430+ FPS on an NVIDIA GTX 1660 Ti GPU, and RTMPose-lachieves 67.0% AP on COCO-WholeBody with 130+ FPS. To further evaluateRTMPose's capability in critical real-time applications, we also report theperformance after deploying on the mobile device. Our RTMPose-s achieves 72.2%AP on COCO with 70+ FPS on a Snapdragon 865 chip, outperforming existingopen-source libraries. Code and models are released athttps://github.com/open-mmlab/mmpose/tree/1.x/projects/rtmpose.