NVIDIA CorporationBy The MotorClaw Desk3 JunUpdated 6d agoArtificial Intelligence

NVIDIA Research Unveils GraspGen-X, LCDrive, and NitroGen for Smarter Autonomous Driving and Robot Grasping

NVIDIA presented three AI models at CVPR that enable robots to grasp novel objects, autonomous vehicles to reason faster on embedded hardware, and virtual agents to train across thousands of game environments.

Simulated grasps

2 billion

Training hours for NitroGen

40,000 hours

Performance boost in low-data conditions

up to 52%

What Happened

At CVPR 2026, NVIDIA Research presented three papers—GraspGen-X, LCDrive, and NitroGen—that tackle key challenges in physical AI: zero-shot robotic grasping, efficient reasoning for autonomous vehicles, and scalable training of embodied agents. These models share a common theme of training at scale to generalize across diverse applications.

Simulated grasps for GraspGen-X training

2 billiongrasps

Generated across thousands of object shapes and gripper configurations, enabling the first foundation model for zero-shot grasping.

LCDrive replaces text-based chain-of-thought reasoning with compact latent representations, allowing autonomous vehicles to think faster on embedded hardware. It achieves comparable trajectory quality to text-based reasoning using roughly half the tokens.

Performance improvement with NitroGen in low-data conditions

52%improvement

Over previous state-of-the-art methods for embodied agent training.

Key Research Highlights

GraspGen-X: first foundation model for zero-shot grasping, works with any gripper.
LCDrive: latent reasoning reduces token count by half for autonomous driving.
NitroGen: trained on 1,000+ games and 40,000 hours of interaction, open source.

Why this matters

These advances bring physical AI closer to real-world deployment by making robotic grasping more flexible, autonomous driving more efficient, and AI agent training more scalable.

Terms in This Story

Foundation model: A large AI model trained on broad data that can be adapted to a wide range of tasks.
Zero-shot grasping: The ability of a robot to grasp objects it has never seen before without additional training.
Latent representation: A compressed encoding of information that captures essential features, used to reduce computational load.
Chain-of-thought reasoning: A prompting technique that breaks down a problem into intermediate steps to improve AI decision-making.

Read Original: NVIDIA Corporation

Summarised from the linked release; details can be imperfect — always verify against the original source.

What Happened

Why this matters

Terms in This Story

Related coverage