Helix: A vision-language-action model for generalist humanoid control
Published on: 2025-07-12 23:30:54
Introducing Helix
We're introducing Helix, a generalist Vision-Language-Action (VLA) model that unifies perception, language understanding, and learned control to overcome multiple longstanding challenges in robotics. Helix is a series of firsts:
Full-upper-body control : Helix is the first VLA to output high-rate continuous control of the entire humanoid upper body, including wrists, torso, head, and individual fingers.
Multi-robot collaboration : Helix is the first VLA to operate simultaneously on two robots, enabling them to solve a shared, long-horizon manipulation task with items they have never seen before.
Pick up anything: Figure robots equipped with Helix can now pick up virtually any small household object, including thousands of items they have never encountered before, simply by following natural language prompts.
One neural network : Unlike prior approaches, Helix uses a single set of neural network weights to learn all behaviors—picking and placing items, using drawe
... Read full article.