NVIDIA researchers are showcasing their latest advancements in visual generative AI at the Computer Vision and Pattern Recognition (CVPR) conference in Seattle. These developments focus on custom image generation, 3D scene editing, visual language understanding, and autonomous vehicle perception. Among the highlights is JeDi, a technique that simplifies the customization of diffusion models for text-to-image generation using just a few reference images. Another significant innovation is FoundationPose, which instantly tracks the 3D pose of objects in videos without requiring per-object training. NVIDIA’s NeRFDeformer streamlines 3D scene editing, and their collaboration with MIT has resulted in VILA, a family of vision language models capable of understanding images, videos, and text. NVIDIA’s research also includes over a dozen papers on autonomous vehicle perception, mapping, and planning. Two of their projects are finalists for CVPR’s Best Paper Awards, and NVIDIA has won the CVPR Autonomous Grand Challenge’s End-to-End Driving at Scale track, showcasing their leadership in generative AI for self-driving vehicles.

NVIDIA Unveils Groundbreaking Visual AI Models at CVPR 2023
NVIDIA is pushing the boundaries of visual generative AI at CVPR 2023.
1–2 minutes










