Jianlan Luo

136 posts

Jianlan Luo

@jianlanluo

@berkeley_ai @Theteamatx, PhD from @UCBerkeley

Berkeley, CA Katılım Ocak 2013

89 Takip Edilen1.6K Takipçiler

Jianlan Luo@jianlanluo·30 Nis

Website: finch.agibot.com/research/lwd Paper: finch-static.agibot.com/LWD/lwd-paper.…

English

1.4K

Jianlan Luo@jianlanluo·30 Nis

Technically, LWD combines distributional implicit value learning from heterogeneous fleet data and adjoint matching for policy extraction in flow-based VLAs. Across 8 real-world tasks, one generalist policy reaches 95% average success.

English

1.6K

Jianlan Luo@jianlanluo·30 Nis

Excited to share LWD: Learning While Deploying. Our robots learn while doing real tasks—restocking groceries, brewing Gongfu tea, making cocktails, making juice, and packing shoes. Deployment is no longer just evaluation; it becomes the training loop. 🧵

English

398

621.1K

Jianlan Luo@jianlanluo·6 Oca

The takeaway: scaling robots becomes a way to scale learning. SOP suggests a shift in how we build robot foundation models: not “pretrain → fine-tune → freeze,” but deploy → learn → redeploy—continuously. Project and paper: agibot.com/research/sop

English

1.6K

Jianlan Luo@jianlanluo·6 Oca

This enables continuous execution of tasks such as laundry folding and box assembly for over 36 hours without performance degradation.

English

2.8K

Jianlan Luo@jianlanluo·6 Oca

Generalist robots don’t fail due to a lack of generality. They fail due to a lack of proficiency where it matters. We introduce SOP, enabling generalist policies to improve from real-world experience across distributed robot fleets, without sacrificing generality. 🧵 agibot.com/research/sop

English

377

1.7M

Jianlan Luo@jianlanluo·1 Oca

Project website: geniereasoner.github.io/GenieReasoner/ ArXiv: arxiv.org/abs/2512.24125…

English

1.7K

Jianlan Luo@jianlanluo·1 Oca

The results: • Stronger embodied reasoning (via ERIQ benchmark) • Lower action reconstruction error than prior tokenizers • Better real-world manipulation than both discrete and continuous baselines

English

2.1K

Jianlan Luo@jianlanluo·1 Oca

One core bottleneck in VLA models is action representation. Discrete tokens scale beautifully with VLM pretraining—but lose precision. Continuous actions are precise—but often break VLM reasoning. In our new work, we resolve this tension at the representation level. 🧵

English

236

47.8K

Jianlan Luo@jianlanluo·30 Ara

Project page: act2goal.github.io Paper: arxiv.org/abs/2512.23541

English

470

Jianlan Luo@jianlanluo·30 Ara

On real robots, Act2Goal shows strong zero-shot generalization. With reward-free online adaptation (hindsight goal relabeling + lightweight LoRA finetuning), success rates on challenging OOD tasks improve from ~30% → ~90% within minutes of autonomous interaction.

English

532

Jianlan Luo@jianlanluo·30 Ara

Long-horizon visual goals remain surprisingly hard for robot manipulation. We introduce Act2Goal, a goal-conditioned policy that uses a visual world model to reason about progress toward a goal, and practice it autonomously in the real world.

English

95.1K

Keşfet

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry