NVIDIA CorporationBy The MotorClaw Desk3 JunUpdated 6d agoArtificial Intelligence

NVIDIA Unveils Physical AI Agent Skills to Accelerate AV, Robot, and Vision AI Research

NVIDIA launched new physical AI agent skills and models at CVPR to help researchers streamline development workflows for autonomous vehicles, robotics, and vision AI.

Alpamayo 2 Super parameters

32 billion

Physical AI Dataset downloads

15 million+

GRAIL dataset size

50 hours

What Happened

At CVPR, NVIDIA announced new physical AI agent skills that automate workflows for autonomous vehicle, robotics, and vision AI research. The skills pair with NVIDIA Cosmos 3, a foundation model, and simulation frameworks to help researchers move from model capabilities to scalable workflows. Key models include the 32-billion-parameter Alpamayo 2 Super VLA model for autonomous driving and Cosmos 3, the first full omnimodel for physical AI.

NVIDIA Physical AI Dataset downloads

15 million+

on Hugging Face

New Agent Skills and Tools

Neural Reconstruction for creating editable 3D scenes from fleet data
Defect Image Generation for creating synthetic visual anomalies
Video Augmentation for fine-tuning video analysis models
Isaac mobility skills for autonomous navigation workflows

Why this matters

These tools automate fragmented steps in physical AI research—like scene reconstruction, synthetic data generation, and policy evaluation—making it faster to develop safer autonomous systems and robots.

Terms in This Story

Physical AI: AI systems that understand and interact with the physical world, such as robots and autonomous vehicles.
VLA model: Vision Language Action model that can reason about visual input, understand language, and take actions.
Omnimodel: A single AI model that unifies multiple capabilities, such as vision reasoning, world generation, and action generation.
Neural Reconstruction: A technique that uses AI to convert real-world images or video into editable 3D scenes for simulation.

Read Original: NVIDIA Corporation

Summarised from the linked release; details can be imperfect — always verify against the original source.

What Happened

Why this matters

Terms in This Story

Related coverage