8 months ago

Depth Estimation

Object Detection

3D Machine Vision

Computer Vision

Yilun Chen Shu Liu Xiaoyong Shen Jiaya Jia

Abstract

Most state-of-the-art 3D object detectors heavily rely on LiDAR sensorsbecause there is a large performance gap between image-based and LiDAR-basedmethods. It is caused by the way to form representation for the prediction in3D scenarios. Our method, called Deep Stereo Geometry Network (DSGN),significantly reduces this gap by detecting 3D objects on a differentiablevolumetric representation -- 3D geometric volume, which effectively encodes 3Dgeometric structure for 3D regular space. With this representation, we learndepth information and semantic cues simultaneously. For the first time, weprovide a simple and effective one-stage stereo-based 3D detection pipelinethat jointly estimates the depth and detects 3D objects in an end-to-endlearning manner. Our approach outperforms previous stereo-based 3D detectors(about 10 higher in terms of AP) and even achieves comparable performance withseveral LiDAR-based methods on the KITTI 3D object detection leaderboard. Ourcode is publicly available at https://github.com/chenyilun95/DSGN.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

8 months ago

Depth Estimation

Object Detection

3D Machine Vision

Computer Vision

Yilun Chen Shu Liu Xiaoyong Shen Jiaya Jia

Abstract

Most state-of-the-art 3D object detectors heavily rely on LiDAR sensorsbecause there is a large performance gap between image-based and LiDAR-basedmethods. It is caused by the way to form representation for the prediction in3D scenarios. Our method, called Deep Stereo Geometry Network (DSGN),significantly reduces this gap by detecting 3D objects on a differentiablevolumetric representation -- 3D geometric volume, which effectively encodes 3Dgeometric structure for 3D regular space. With this representation, we learndepth information and semantic cues simultaneously. For the first time, weprovide a simple and effective one-stage stereo-based 3D detection pipelinethat jointly estimates the depth and detects 3D objects in an end-to-endlearning manner. Our approach outperforms previous stereo-based 3D detectors(about 10 higher in terms of AP) and even achieves comparable performance withseveral LiDAR-based methods on the KITTI 3D object detection leaderboard. Ourcode is publicly available at https://github.com/chenyilun95/DSGN.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp