8 months ago

Object Detection

Video Understanding

Computer Vision

Computer Vision

Qi Zhang¹, Yunfei Gong¹, Daijie Chen², Antoni B. Chan³, Hui Huang¹*

Abstract

Recent deep learning-based multi-view people detection (MVD) methods haveshown promising results on existing datasets. However, current methods aremainly trained and evaluated on small, single scenes with a limited number ofmulti-view frames and fixed camera views. As a result, these methods may not bepractical for detecting people in larger, more complex scenes with severeocclusions and camera calibration errors. This paper focuses on improvingmulti-view people detection by developing a supervised view-wise contributionweighting approach that better fuses multi-camera information under largescenes. Besides, a large synthetic dataset is adopted to enhance the model'sgeneralization ability and enable more practical evaluation and comparison. Themodel's performance on new testing scenes is further improved with a simpledomain adaptation technique. Experimental results demonstrate the effectivenessof our approach in achieving promising cross-scene multi-view people detectionperformance. See code here: https://vcc.tech/research/2024/MVD.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

8 months ago

Object Detection

Video Understanding

Computer Vision

Computer Vision

Qi Zhang¹, Yunfei Gong¹, Daijie Chen², Antoni B. Chan³, Hui Huang¹*

Abstract

Recent deep learning-based multi-view people detection (MVD) methods haveshown promising results on existing datasets. However, current methods aremainly trained and evaluated on small, single scenes with a limited number ofmulti-view frames and fixed camera views. As a result, these methods may not bepractical for detecting people in larger, more complex scenes with severeocclusions and camera calibration errors. This paper focuses on improvingmulti-view people detection by developing a supervised view-wise contributionweighting approach that better fuses multi-camera information under largescenes. Besides, a large synthetic dataset is adopted to enhance the model'sgeneralization ability and enable more practical evaluation and comparison. Themodel's performance on new testing scenes is further improved with a simpledomain adaptation technique. Experimental results demonstrate the effectivenessof our approach in achieving promising cross-scene multi-view people detectionperformance. See code here: https://vcc.tech/research/2024/MVD.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp