NVIDIA Isaac GR00T N1: An Open Foundation Model for Humanoid Robots

At NVIDIA, we are developing AI solutions to enable general-purpose humanoid robots to understand the human world, follow language instructions, and perform diverse tasks. A robust Vision-Language-Action (VLA) model is crucial for such advanced capabilities. To this end, we developed GR00T N1, a generalist robot model trained on a diverse dataset that includes egocentric human videos, real and simulated robot trajectories, and synthetic data.

GR00T N1 outperforms state-of-the-art imitation learning models in simulation benchmarks across multiple robot embodiments. Additionally, it demonstrates effective language-conditioned bimanual manipulation on the Fourier GR-1 and 1X humanoids in household tasks.

To help Physical AI builders solve the most critical problems of our society, we make our models open-weight with permissive licenses available via NVIDIA Isaac GR00T.