Command Palette
Search for a command to run...
3D visual grounding
3D Visual Grounding is a key technology in the field of computer vision, aiming to accurately locate and identify target objects in a three-dimensional environment through natural language descriptions. This technology combines image understanding and natural language processing, enabling the mapping from text to specific objects within a 3D scene, and has broad application value, such as in augmented reality, robot navigation, and intelligent interaction.