Figure 02 is Figure AI's second-generation humanoid robot, one of the best-funded robotics startups in history ($2.6B raised from Microsoft, OpenAI, NVIDIA, and Jeff Bezos). Powered by Helix, Figure's proprietary vision-language-action AI model, Figure 02 understands natural language commands and executes complex manipulation tasks. Already deployed at BMW in Spartanburg, it contributed to the production of 30,000 vehicles. Figure AI secured $2.6 billion in funding from Microsoft, OpenAI, NVIDIA, Intel Capital, and Jeff Bezos, representing one of the largest pre-revenue fundraises in robotics history. The Helix vision-language-action model processes natural speech commands and translates them into real-time dexterous manipulation without hard-coded motion primitives. BMW's Spartanburg, South Carolina plant deployment announced in January 2024 made Figure 02 one of the first commercially operating general-purpose humanoids in automotive manufacturing.
Taken together, Figure 02 reads as a platform built around height of 1,68 m, weight of 60 kg, and dof of 40+, with Helix proprietary Vision-Language-Action AI model, OpenAI integration for natural language, and Demonstration and imitation learning supporting Automotive assembly at BMW, Warehouse order picking, and Home assistance. That makes the profile feel more grounded in how Figure AI, Inc. Sunnyvale, Californie, USA is positioning the robot for real operating environments rather than as a one-off demo.
In practical terms, these figures describe a robot optimized for Automotive assembly at BMW, Warehouse order picking, and Home assistance, while Helix proprietary Vision-Language-Action AI model, OpenAI integration for natural language, and Demonstration and imitation learning define the balance between mobility, perception, and manipulation. The specification set also helps explain the scale of tasks Figure 02 can realistically handle today.
Overall, the timeline shows how Figure 02 moved from research or early unveiling toward clearer operational intent, with each stage tightening the link between height of 1,68 m, weight of 60 kg, and dof of 40+ and the jobs it is expected to perform. It also shows how the project matured from concept validation into a more deployment-oriented platform.
Across these roles, Figure 02 is being framed less as a general-purpose android and more as a system that can repeatedly deliver value in Automotive assembly at BMW, Warehouse order picking, and Home assistance. Helix proprietary Vision-Language-Action AI model, OpenAI integration for natural language, and Demonstration and imitation learning are the pieces that make those scenarios believable, because they connect sensing, planning, and physical execution into one workflow.
The Figure humanoid robot features custom electromechanical actuators with improved torque density, a sensory suite including six RGB cameras with 60% wider field of view, palm cameras, and fingertip tactile sensors detecting forces as low as 3 grams.Figure AI announcement, Figure.ai/figure, Humanoid Press. It employs the proprietary Helix vision-language-action AI model and offers 30 degrees of freedom, enabling key capabilities such as 20kg payload lifting and
Taken together, this stack suggests a machine whose real advantage comes from how Helix proprietary Vision-Language-Action AI model, OpenAI integration for natural language, and Demonstration and imitation learning are coordinated around height of 1,68 m, weight of 60 kg, and dof of 40+. The result is a platform that can convert perception into stable motion and task execution with less operator intervention than a simpler scripted robot.
Universal instant learning mastery of any task from a single observation, onboard cognition rivaling human intelligence, bionic hands with cell-level tactile sensitivity, 72-hour autonomy without recharging, authentic emotional dialogue.
Figure 01 validated bipedal locomotion and basic manipulation. Figure 02 added onboard AI reasoning through the OpenAI partnership.
16 custom actuators, onboard vision-language AI, individual finger control, 20 kg payload, 5-hour battery, speech + visual processing.
Evolved into Figure 03 each generation dramatically improving autonomy, dexterity, and real-world task capability.
Together, these technologies show that Figure 02 depends on a layered architecture rather than one breakthrough component. Helix proprietary Vision-Language-Action AI model, OpenAI integration for natural language, and Demonstration and imitation learning provide the core capabilities, while the surrounding stack determines how well the robot can perceive context, stay stable, and complete tasks without fragile scripting.