Introducing GEN-0, our latest 10B+ foundation model for robots ⏱️ built on Harmonic Reasoning, new architecture that can think & act seamlessly 📈 strong scaling laws: more pretraining & model size = better 🌍 unprecedented corpus of 270,000+ hrs of dexterous data Read more 👇

Nov 4, 2025 · 4:13 PM UTC

Surpassing the intelligence threshold 🧠 in the high-data regime for robotics, we observe a phase transition ⚡️ at 7B params where smaller models exhibit ossification under data overload, while larger ones continue to improve and can quickly adapt to new tasks.
2
9
7
158
Robotics at Generalist is no longer limited by data. GEN-0 is pretrained on our growing dataset of over 270,000 hrs of real, diverse manipulation data (10,000 hrs/week & accelerating). 1,000s of data collection devices/robots in homes, warehouses, and workplaces worldwide🏠🏭💼
We’ve developed a new approach to training models, Harmonic Reasoning, which creates a "harmonic" interplay between asynchronous, continuous-time streams of sensing and acting tokens. ⚙️🎵 Watch GEN-0 pack a camera. 🤖📸
GEN-0 models exhibit strong scaling laws, in which more pretraining data and compute consistently (and predictably) improve downstream post-training performance of the model across many tasks. 🔬📈🧠
2
13
9
105
📱Packing the phone into the box: here GEN-0 is tasked with gently packing a phone (including its accessory box) into a snug-fit retail box. Slotting in the phone, inserting the accessory box, flipping, and closing the lid, with all tight fits.
📦🧠🤖🔄 Packing unseen objects - this evaluates GEN-0's generalization and “physical commonsense": reasoning over the highly varied geometry and physics of new objects, nudging and reorienting, packing in differently depending on shape, pushing objects around to make space.
Cross-Embodiment – GEN-0 architecture works on different robots by design. We have tested our models on various 6DoF, 7DoF, and 16+DoF semi-humanoid robots.
1
1
59
0
To scale GEN-0 capabilities, we are constructing the largest and most diverse real-world manipulation dataset ever built 🌍 including every manipulation task humans can think of – from peeling potatoes, to threading bolts – spanning homes, bakeries, warehouses, factories, & more
Read more in our blog post, including early notes from large-scale ablations on our pretraining data. Blog: generalistai.com/blog/nov-04…
1
6
70
Replying to @GeneralistAI
Are y’all planning on incorporating hands for more complex assembly or just focusing on grippers?
4
Replying to @GeneralistAI
Who needs feet, anyway?
Replying to @GeneralistAI
Amazing work! Where'd you source 270k hours of dexterous data?
5
Replying to @GeneralistAI
Robots just got their Ivy League upgrade. GEN-0 sounds like the kid who aced both calculus and karate. With 270,000 hours of data, it probably folds laundry with perfect choreography. At this rate, the term “hands-on learning” will soon apply more to machines than humans.
Replying to @GeneralistAI
Huge leap forward for robotics 🤖
Replying to @GeneralistAI
only up from here
Replying to @GeneralistAI
impressive, can't wait for demos!
Replying to @GeneralistAI
wow, 270k hours is a lot, what is ratio sim/real?
1
4
Replying to @GeneralistAI
Whoa, GEN-0 sounds like the robot version of Tony Stark! 🤖✨ Can't wait to see it in action!
Replying to @GeneralistAI
The 7B phase transition is critical - it's showing us the minimum viable intelligence threshold for embodied learning. Below 7B: models exhibit ossification under diverse sensorimotor data. The weights can't absorb new patterns without catastrophic forgetting. Above 7B: continued scaling improvements. This is the first time we're seeing LLM-style scaling laws validated for robotics foundation models with this level of rigor. But Harmonic Reasoning is the real innovation here. It solves a fundamental problem: language models can "pause to think" because digital time is discrete. Robots can't - physics doesn't stop. Solution: asynchronous, continuous-time streams of sensing and acting tokens. The model doesn't do "perceive → reason → act" sequentially. It harmonizes all three in parallel, like how humans process visual input while simultaneously planning and executing motor commands. This is architecturally different from traditional VLA models that still operate in discrete time steps. GEN-0 treats sensorimotor control as a continuous process, enabling: - Sub-100ms reactive reflexes - Predictive control (act on expected state, not just current) - Multi-timescale reasoning (strategic + reactive simultaneously) 270K+ hours of dexterous data + strong scaling laws + continuous control architecture = the first deployment-ready foundation model for generalist robotics. Mann!! If this holds at scale, sequential action models will look as outdated as batch processing in a streaming world.
Replying to @GeneralistAI
10B parameters of motion — the moment reasoning meets embodiment, robotics stops being mechanical and starts being cognitive.
Replying to @GeneralistAI
impressive work! any plans to open source this, or at least provide an api/sdk to train and evaluate the policy? how can the community independently verify the scaling/ossification and power-law claims beyond the blog demos ; e.g. public checkpoints, eval suites, or third-party benchmarks?
1
Replying to @GeneralistAI
Even collecting this much data in China would cost between $2 million and $3 million. It's important to note that data costs in China are among the lowest in the world.
Replying to @GeneralistAI
These demos are, as always from Generalist, incredible. Would love to see more detail on Harmonic Reasoning since that seems like your alternative to PI's real time action chunking. That seems significant enough of an achievement that it warrants a writeup of its own!
6
Replying to @GeneralistAI
Congratulations on this exciting milestone!
1
1
Replying to @GeneralistAI
Impressive on paper, now prove it in a messy kitchen
7
Replying to @GeneralistAI
Amazing results and very promising scaling laws!
9
Replying to @GeneralistAI
Fascinating
Replying to @GeneralistAI
270k+ hours of dexterous data is wild. Getting that much high-quality, multimodal training data must’ve been a huge undertaking. Curious how it generalizes across different robot platforms.
Replying to @GeneralistAI
will models trained on this data scale to other embodiments? for example if you add more DOFs to the gripper or make it more anthropomorphic?
Replying to @GeneralistAI
Ginormous dex data. Quality > quantity. How well do the results translate from sim to real-world non-clinical environment?
Replying to @GeneralistAI
holy crap. huge if true is everyone else cooked?
Replying to @GeneralistAI
awesome results!
1
Replying to @GeneralistAI
Amazing advances for robotics- congrats to the team!
2
Made in the USA NDAA Compliant AI NVIDIA Flight Controllers. Ready to scale your autonomous drone production.
3
7
3
95