HyperAI超神経
Back to Headlines

Nvidia's AI Tool Turns 3D Scenes into Composable Images

11日前

Nvidia has introduced a groundbreaking new tool called Nvidia AI Blueprint for 3D-guided generative AI, designed to allow developers to generate AI images by first creating scenes in a 3D environment. This innovative tool was recently released and is supported on computers equipped with Nvidia RTX 4080 and higher GPUs. The core functionality of the tool is achieved through the integration of Blender, a popular 3D modeling software, with Black Forest Lab's FLUX.1 image generator. Users can build their desired scenes in Blender using 3D objects such as buildings, plants, animals, and vehicles. These 3D models serve as reference points for generating 2D images, offering users more precise control over details like the shape and height of structures, the number of trees and cars, and the viewing angles. This method significantly reduces the time needed to achieve ideal images and minimizes frustration from repeatedly adjusting textual descriptions. Nvidia positions this tool as a "predefined, customizable AI workflow" intended to streamline the development of generative AI applications. It provides detailed documentation, example assets, and pre-configured environments to guide users through the process. The tool's workflow involves generating depth maps from 3D scenes created in Blender and combining these maps with user prompts to create high-quality 2D images. Thanks to its reliance on a 3D environment, users have greater creative freedom, as they can easily manipulate objects and camera angles. Behind the scenes, the Nvidia AI Blueprint leverages a powerful tool called ComfyUI, which allows creators to link generative AI models in novel ways. For instance, a ComfyUI plugin enables seamless integration between Blender and ComfyUI. Additionally, Nvidia's NIM microservices facilitate the deployment of the FLUX.1-dev model on GeForce RTX GPUs, utilizing NVIDIA TensorRT for optimized performance. The FLUX.1-dev model runs in quantized formats like FP4 and FP8, providing up to twice the inference speed compared to native PyTorch FP16. For users with Ada Lovelace generation GPUs, an FP8 version is available, also accelerated by TensorRT, further reducing VRAM usage. The blueprint includes everything users need to get started, such as Blender, ComfyUI, the Blender-ComfyUI plugin, the FLUX.1-dev NIM microservice, and the necessary ComfyUI nodes. An intuitive installer and comprehensive deployment instructions make the setup straightforward. Step-by-step guides, sample resources, and pre-configured environments ensure that the creative process remains controllable and yields high-quality results. For developers, the blueprint serves as a robust foundation for building or extending similar workflows, complete with source code, sample data, documentation, and working examples, enabling rapid prototyping and iteration. In the current market, Nvidia's 3D-guided generative AI tool stands out as a pioneering solution, though competitors like Adobe are also making strides in the field. At the MAX conference in October, Adobe showcased an experimental tool called "Project Concept" that similarly uses 3D environments to guide image generation. However, Project Concept is still in the experimental phase, and its future availability to the public remains uncertain. Industry experts recognize the significant impact of Nvidia's new tool on the advancement of generative AI, particularly in creative and computational industries. As a global leader in graphics processing units (GPUs), Nvidia continues to innovate by developing cutting-edge technologies and tools. The AI Blueprint not only enhances Nvidia's product portfolio but also empowers creators and developers with more efficient and precise means of generating images. With the growing importance of creative control in AI art, this tool addresses a critical need by bridging the gap between detailed scene creation and automated image generation. Nvidia's AI Blueprint capitalizes on the performance advantages of its RTX AI PCs and workstations, especially the latest breakthroughs in the Blackwell architecture. The FLUX.1-dev NIM microservice, optimized with TensorRT, offers superior inference speed and reduced VRAM usage in both FP4 and FP8 formats. Currently, ten NIM microservices are available for RTX users, covering a range of applications from image and language generation to speech AI and computer vision. Nvidia regularly updates its AI ecosystem, promising more blueprint documents and services in the future. Through its RTX AI Garage blog series, Nvidia shares community-driven AI innovations weekly, helping users understand how to leverage NIM microservices and AI blueprint documents to build AI agents, creative workflows, digital humans, and other productivity applications. This initiative underscores Nvidia's commitment to fostering a vibrant AI community and supporting continuous learning and innovation. The launch of Nvidia AI Blueprint for 3D-guided generative AI marks a significant milestone in the evolution of AI image generation technology. By providing an accessible and powerful platform, Nvidia has lowered the barrier to entry for advanced AI workflows and expanded the toolkit available to creative professionals. This development is expected to have a profound effect on the AI art creation landscape, driving further advancements and applications in the field. Nvidia's ongoing leadership in AI technology demonstrates its dedication to pushing the boundaries of what is possible with AI, thereby maintaining its position as a frontrunner in the industry. Overall, Nvidia's introduction of AI Blueprint for 3D-guided generative AI is a game-changer, offering a practical and effective solution to the challenges of creative control in AI-generated art. The tool's combination of user-friendly design and high-performance capabilities sets a new standard for generative AI workflows and is likely to influence future developments in the field. Nvidia's focus on user-centric innovation and robust technical support is a testament to its commitment to driving the AI industry forward.

Related Links