Part II: Frontier Robot Manipulation

Chapter 4: GR00T and Humanoid VLAs — System 2 Sees, System 1 Acts

Written: 2026-06-08 Last updated: 2026-06-08

GR00T is NVIDIA's foundation-model family for humanoid robots. GR00T N1 introduced a dual-system architecture: System 2 uses a vision-language model for deliberate reasoning, while System 1 converts the plan into continuous robot motion [1]. This matches the S3 framing: robot intelligence needs both slow planning and fast control.

Figure 4.1: VLA-style language-vision-action policy for robot control. source: S3 reused figure
Figure 4.1: VLA-style language-vision-action policy for robot control. source: S3 reused figure

4.1 Reading GR00T for Manufacturing

NVIDIA explicitly mentions material handling, packaging, and inspection as GR00T-relevant tasks [1]. The conservative reading is not that humanoids will immediately automate all packaging lines. The practical reading is that natural-language task specification, human demonstrations, and synthetic trajectories are becoming one post-training workflow.

4.2 GR00T N1.7 and the Reference Humanoid

At GTC 2026, NVIDIA announced GR00T N1.7 early access with commercial licensing and advanced dexterous control [3]. At GTC Taipei on June 1, 2026, it announced an Isaac GR00T Reference Humanoid Robot combining Unitree H2 Plus, Sharpa tactile five-finger hands, Jetson AGX Thor T5000, and the Isaac GR00T software stack, with availability planned through Unitree in late 2026 [3].

Figure 4.2: High-level reasoning separated from continuous action generation. source: S3 reused figure
Figure 4.2: High-level reasoning separated from continuous action generation. source: S3 reused figure

For manufacturers, the reference architecture matters more than the humanoid form factor. Standardized tactile hands, wrist cameras, onboard edge compute, and simulation workflow can make results easier to compare across labs and factories.

References

  1. NVIDIA (2025). NVIDIA Announces Isaac GR00T N1 and Simulation Frameworks. NVIDIA Newsroom.
  2. NVIDIA (2026). NVIDIA and Global Robotics Leaders Take Physical AI to the Real World. NVIDIA Investor Relations.
  3. NVIDIA (2026). NVIDIA Announces Isaac GR00T Reference Humanoid Robot. NVIDIA Investor Relations.
  4. Moo Jin Kim et al. (2024). OpenVLA: An Open-Source Vision-Language-Action Model. arXiv.