8 months ago

3D Machine Vision

Computer Vision

Computer Vision

Lei Ke Shichao Li Yanan Sun Yu-Wing Tai Chi-Keung Tang

Abstract

We present a novel end-to-end framework named as GSNet (Geometric andScene-aware Network), which jointly estimates 6DoF poses and reconstructsdetailed 3D car shapes from single urban street view. GSNet utilizes a uniquefour-way feature extraction and fusion scheme and directly regresses 6DoF posesand shapes in a single forward pass. Extensive experiments show that ourdiverse feature extraction and fusion scheme can greatly improve modelperformance. Based on a divide-and-conquer 3D shape representation strategy,GSNet reconstructs 3D vehicle shape with great detail (1352 vertices and 2700faces). This dense mesh representation further leads us to consider geometricalconsistency and scene context, and inspires a new multi-objective loss functionto regularize network training, which in turn improves the accuracy of 6D poseestimation and validates the merit of jointly performing both tasks. Weevaluate GSNet on the largest multi-task ApolloCar3D benchmark and achievestate-of-the-art performance both quantitatively and qualitatively. Projectpage is available at https://lkeab.github.io/gsnet/.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

8 months ago

3D Machine Vision

Computer Vision

Computer Vision

Lei Ke Shichao Li Yanan Sun Yu-Wing Tai Chi-Keung Tang

Abstract

We present a novel end-to-end framework named as GSNet (Geometric andScene-aware Network), which jointly estimates 6DoF poses and reconstructsdetailed 3D car shapes from single urban street view. GSNet utilizes a uniquefour-way feature extraction and fusion scheme and directly regresses 6DoF posesand shapes in a single forward pass. Extensive experiments show that ourdiverse feature extraction and fusion scheme can greatly improve modelperformance. Based on a divide-and-conquer 3D shape representation strategy,GSNet reconstructs 3D vehicle shape with great detail (1352 vertices and 2700faces). This dense mesh representation further leads us to consider geometricalconsistency and scene context, and inspires a new multi-objective loss functionto regularize network training, which in turn improves the accuracy of 6D poseestimation and validates the merit of jointly performing both tasks. Weevaluate GSNet on the largest multi-task ApolloCar3D benchmark and achievestate-of-the-art performance both quantitatively and qualitatively. Projectpage is available at https://lkeab.github.io/gsnet/.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp