Chapter 6: Hands and Touch — The Last Bottleneck in Manual-Work Automation
The final bottleneck in manual-work automation is the hand. Humans regulate grip through tactile events such as contact, slip, and release [1]. Robots still struggle with this. That is why tasks that look easy to workers remain difficult in cosmetics, food, apparel, and electronics manufacturing.
6.1 VLA Is Not Enough
RT-2, OpenVLA, Octo, and pi0 connect language, vision, and action, but contact-rich tasks need force and tactile feedback. A robot handling a soft tube must know not only where the tube is, but how hard it is squeezing.
6.2 The Rise of Tactile Data
GelSight, DIGIT, ReSkin, AnySkin, Sparsh, Tactile-VLA, and ForceVLA are pulling touch into robot learning [2] [3] [4]. NVIDIA's reference humanoid including tactile five-finger hands is therefore not cosmetic; it points to the real bottleneck [5].
References
- Roland Johansson and J. Randall Flanagan (2009). Coding and Use of Tactile Signals from the Fingertips in Object Manipulation Tasks. Nature Reviews Neuroscience.
- Wenzhen Yuan et al. (2017). GelSight: High-Resolution Robot Tactile Sensors. Sensors.
- Mike Lambeta et al. (2020). DIGIT. arXiv.
- Raunaq Bhirangi et al. (2021). ReSkin. arXiv.
- NVIDIA (2026). NVIDIA Announces Isaac GR00T Reference Humanoid Robot. NVIDIA Investor Relations.