Jianlan Luo

136 posts

Jianlan Luo

Jianlan Luo

@jianlanluo

@berkeley_ai @Theteamatx, PhD from @UCBerkeley

Berkeley, CA Katılım Ocak 2013
89 Takip Edilen1.6K Takipçiler
Jianlan Luo
Jianlan Luo@jianlanluo·
Technically, LWD combines distributional implicit value learning from heterogeneous fleet data and adjoint matching for policy extraction in flow-based VLAs. Across 8 real-world tasks, one generalist policy reaches 95% average success.
Jianlan Luo tweet media
English
1
1
8
1.6K
Jianlan Luo
Jianlan Luo@jianlanluo·
Excited to share LWD: Learning While Deploying. Our robots learn while doing real tasks—restocking groceries, brewing Gongfu tea, making cocktails, making juice, and packing shoes. Deployment is no longer just evaluation; it becomes the training loop. 🧵
English
7
38
398
621.1K
Jianlan Luo
Jianlan Luo@jianlanluo·
The takeaway: scaling robots becomes a way to scale learning. SOP suggests a shift in how we build robot foundation models: not “pretrain → fine-tune → freeze,” but deploy → learn → redeploy—continuously. Project and paper: agibot.com/research/sop
English
0
0
2
1.6K
Jianlan Luo
Jianlan Luo@jianlanluo·
This enables continuous execution of tasks such as laundry folding and box assembly for over 36 hours without performance degradation.
English
2
1
5
2.8K
Jianlan Luo
Jianlan Luo@jianlanluo·
Generalist robots don’t fail due to a lack of generality. They fail due to a lack of proficiency where it matters. We introduce SOP, enabling generalist policies to improve from real-world experience across distributed robot fleets, without sacrificing generality. 🧵 agibot.com/research/sop
English
17
39
377
1.7M
Jianlan Luo
Jianlan Luo@jianlanluo·
The results: • Stronger embodied reasoning (via ERIQ benchmark) • Lower action reconstruction error than prior tokenizers • Better real-world manipulation than both discrete and continuous baselines
Jianlan Luo tweet mediaJianlan Luo tweet mediaJianlan Luo tweet media
English
1
3
6
2.1K
Jianlan Luo
Jianlan Luo@jianlanluo·
One core bottleneck in VLA models is action representation. Discrete tokens scale beautifully with VLM pretraining—but lose precision. Continuous actions are precise—but often break VLM reasoning. In our new work, we resolve this tension at the representation level. 🧵
English
2
23
236
47.8K
Jianlan Luo
Jianlan Luo@jianlanluo·
On real robots, Act2Goal shows strong zero-shot generalization. With reward-free online adaptation (hindsight goal relabeling + lightweight LoRA finetuning), success rates on challenging OOD tasks improve from ~30% → ~90% within minutes of autonomous interaction.
English
1
0
0
532
Jianlan Luo
Jianlan Luo@jianlanluo·
Long-horizon visual goals remain surprisingly hard for robot manipulation. We introduce Act2Goal, a goal-conditioned policy that uses a visual world model to reason about progress toward a goal, and practice it autonomously in the real world.
English
1
5
37
95.1K