Yitang Li · Nov 7, 2025 · 3:32 PM UTC

Yitang Li · Nov 7, 2025 · 3:32 PM UTC

Yitang Li

Yitang Li @li_yitang

Nov 7

Meet BFM-Zero: A Promptable Humanoid Behavioral Foundation Model w/ Unsupervised RL👉 lecar-lab.github.io/BFM-Zero… 🧩ONE latent space for ALL tasks ⚡Zero-shot goal reaching, tracking, and reward optimization (any reward at test time), from ONE policy 🤖Natural recovery & transition

Nov 7, 2025 · 3:32 PM UTC

205

Yitang Li · Nov 7, 2025 · 3:33 PM UTC

Yitang Li @li_yitang

Nov 7

How it works 👉 lecar-lab.github.io/BFM-Zero… 🧠 Unsupervised RL — no specific rewards in training 🔁 Forward–Backward Representation — builds a dynamics-aware space 🎯 Zero-shot Inference — reach goals, track motions, or optimize any reward function at test time without retraining

Yitang Li · Nov 7, 2025 · 3:34 PM UTC

Yitang Li @li_yitang

Nov 7

Goal Reaching: z = B(s_g) (🤔B can be viewed as "inverse dynamics" that maps from a desired state to a needed latent skill) lecar-lab.github.io/BFM-Zero… See how natural it can be 👇:

Yitang Li · Nov 7, 2025 · 3:35 PM UTC

Yitang Li @li_yitang

Nov 7

Reward Optimization: z = \sum_i B(s_i)r(s_i) lecar-lab.github.io/BFM-Zero… We do not have any labeled rewards in training ! In the test time, users can prompt in ANY type of reward w.r.t. the robot states, and the policy zero-shot output the optimized skills without retraining.

Yitang Li · Nov 7, 2025 · 3:35 PM UTC

Yitang Li @li_yitang

Nov 7

Motion Tracking: z = \sum_n \lambda_n B(s_{t+n}) (this can be viewed as a "moving horizon" (or MPC) version of goal tracking) lecar-lab.github.io/BFM-Zero… Diverse motion tracking + natural & smooth recovery even under unexpected falls🫣

Yitang Li · Nov 7, 2025 · 3:36 PM UTC

Yitang Li @li_yitang

Nov 7

🌿 Smooth & well-regulated space enables: 1️⃣ Natural recovery under disturbance (it even runs a few steps🦿to regain balance when pushed🫱!) 2️⃣ Search-based few-shot adaptation 3️⃣ Meaningful latent-space interpolation See more details in our website~

Yitang Li · Nov 7, 2025 · 3:37 PM UTC

Yitang Li @li_yitang

Nov 7

Deeply grateful to @zhengyiluo — a dream collaboration😍; to the Motivo [link: metamotivo.metademolab.com/] team at @AIatMeta for making this possible; to @TongheZhang01 @JimmyDai218776 @ElijahGalahad for the countless late nights; and to @GuanyaShi @teopir for the invaluable advice.

Meta Motivo

A first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.

metamotivo.metademolab.com

Carlos DP 🤖🥊🇺🇸 · Nov 8, 2025 · 11:31 PM UTC

Carlos DP 🤖🥊🇺🇸

@the_carlosdp

18h

Replying to @li_yitang

this is extremely cool work! and really nicely presented demos