MolmoAct
$ 0

Multimodal VLA from the Molmo family adapted for robot action prediction; open weights with strong zero-shot generalisation
Out of stock
Brain Score
4Specifications and details:
| Nationality | US |
|---|---|
| Website | https://allenai.org |
| Model type | Foundation Model |
| Manufacturer | Allen Institute for AI (Ai2) |
| Release date | 2025 |
Description
MolmoAct is a foundation model that combines vision, language, and action into a single unified system. Instead of relying on predefined routines, it interprets visual and contextual information to plan movements step by step. Consequently, robots adapt quickly to new situations and unfamiliar environments. Moreover, its strong zero-shot generalization allows it to perform tasks it has not explicitly trained for, making it applicable across a wide range of real-world scenarios.
2026 Humanoid Robot Market Report
160 pages of exclusive insight from global robotics experts – uncover funding trends, technology challenges, leading manufacturers, supply chain shifts, and surveys and forecasts on future humanoid applications.

Featuring insights from
Aaron Saunders, Former CTO of
Boston Dynamics,
now Google DeepMind

2026 Humanoid Robot Market Report
160 pages of exclusive insight from global robotics experts – uncover funding trends, technology challenges, leading manufacturers, supply chain shifts, and surveys and forecasts on future humanoid applications.
In addition, MolmoAct emphasizes real-time responsiveness and smooth coordination. Therefore, robots adjust actions dynamically as conditions change, without needing complex reprogramming. Its multimodal design enables understanding of both context and intent, improving interaction with objects and environments. Because the model provides open weights, developers can refine and expand its capabilities over time. As a result, MolmoAct supports ongoing progress toward general-purpose and adaptable robotic intelligence.
Contact Humanoid.guide
Website: https://allenai.org





