Philipp Wu

448 posts

Philipp Wu

@philippswu

PhD @Berkeley_AI advised by @pabbeel. Previously @MetaAI @covariantai.

Berkeley, CA Katılım Kasım 2018

394 Takip Edilen2.2K Takipçiler

Sabitlenmiş Tweet

Philipp Wu@philippswu·25 Eyl

🎉Excited to share a fun little hardware project we’ve been working on. GELLO is an intuitive and low cost teleoperation device for robot arms that costs less than $300. We've seen the importance of data quality in imitation learning. Our goal is to make this more accessible 1/n

English

108

694

161.8K

Philipp Wu@philippswu·2d

have literally wanted this capabiltiy for mujoco for so long

Kevin Zakka@kevin_zakka

Really excited to release mjviser, a web-based MuJoCo viewer, powered by Viser. It has almost all the features of the native MuJoCo viewer, but runs in your browser. Load and simulate any MuJoCo model with a single uv command 👇 uvx mjviser

English

6.5K

Philipp Wu retweetledi

Boyi Li@Boyiliee·5d

Autogaze for video understanding! Up to 4-100x token savings, 19x speedup, and enables scaling to 4K-res 1K-frame videos.

Baifeng@baifeng_shi

Humans can see in high-res, high-FPS in real-time. Why can't VLMs? Introducing AutoGaze: ViTs/VLMs "gaze" only at key video regions! Up to 4-100x token savings, 19x speedup, and enables scaling to 4K-res 1K-frame videos. 📄 arxiv.org/abs/2603.12254 🌐 autogaze.github.io 🤗 huggingface.co/collections/bf… (1/n)🧵

English

6.2K

Philipp Wu retweetledi

Kevin Zakka@kevin_zakka·20 Mar

With @ki_ki_ki1's help, mjlab now provides terrain normal estimation (green arrow) and foot height sensors (magenta dots). Should significantly improve rough terrain locomotion.

English

17.8K

Philipp Wu retweetledi

Kevin Zakka@kevin_zakka·20 Mar

Applied to Claude Code and Codex OSS programs for my MuJoCo work (mjlab + related tools), but didn’t get in 😢. If anyone at OpenAI or Anthropic is open to taking another look, would love to share more about what I’m building and its impact on the ecosystem.

English

205

33.7K

Philipp Wu retweetledi

Kevin Zakka@kevin_zakka·16 Mar

Amazing work by @mitchaiet! Full mocap-to-real pipeline for the G1, all open-source and powered by mjlab + MuJoCo-Warp. Amazing to see the community building on this stack 🚀🔥

mitch@mitchaiet

Introducing G1 Moves! 60 open-source motion capture clips + trained RL policies for the Unitree G1 humanoid robot. Come see live robot mocap and interactive roasts at the Dell booth at #GTC this week! huggingface.co/spaces/exptech… #DellProPrecision #DellTech #NVIDIA #Robotics

English

7.4K

Philipp Wu@philippswu·15 Mar

:o sim2real for manipulation

Kevin Zakka@kevin_zakka

Coming soon to mjlab: heterogeneous worlds, aka every world gets its own object 👀

English

6.5K

Philipp Wu retweetledi

Kevin Zakka@kevin_zakka·7 Mar

Happy Friday!! mjlab v1.2.0 is out. This is our biggest release yet with 60+ PRs from 12 contributors. pip install mjlab Some highlights include: - New more powerful domain randomization module - Revamped ergonomic viewers - Cloud training via @SkyPilot - Complete doc rewrite

English

114

15.1K

Philipp Wu@philippswu·3 Mar

@snasiriany Congrats on the release @snasiriany! Very needed in the robotics community!

English

230

Soroush Nasiriany@snasiriany·2 Mar

Proud to share the final project of my PhD, RoboCasa365! There has been so much progress in robot learning in the last couple of years but it’s starting to feel like running large-scale experimentation is increasingly out of reach for independent researchers and there is no consensus yet on benchmarking in the field. During my PhD I wanted to build something that would allow myself and other researchers to study robot learning on large datasets in a meaningful way. So I dedicated my time to building RoboCasa, a large-scale simulation framework for training and benchmarking generalist robot policies. We released the original framework in 2024 and today we are releasing a major new release, RoboCasa365. Compared to the original release, RoboCasa365 feels a lot more like a “full stack” simulation framework: - 2500 kitchen scenes - 365 everyday tasks - 600+ hours of teleportation data and 1600+ hours of synthetically generated trajectories - Benchmarking on sota VLA models By current industry standards, 600 hours of teleportation data is considered modest, but I think this is a good sweet spot of data to study how well robot foundation models can adapt to downstream applications. Right now the benchmark is far from solved. This makes it a useful tested to develop the next algorithms and architectures to push the boundaries on robot learning, be it VLAs, world models, RL algorithms, etc. There is a lot of work left to push generalization, reliability, and throughput for general purpose robots. I was incredibly lucky to have the support of my adviser @yukez, who gave me all the creative freedom, resources, and time to build RoboCasa. I am also very fortunate to work with two hardworking and passionate students, @abhirammaddukur and my brother @SepNasiriany. We spent countless long nights together on this project and it was really fun working as a lean team. I also want to thank @bgxc and the team at @LightwheelAI for being a major supporting force in sourcing assets and collecting the data that we used in this project. Also thank you to the RPL lab and Nvidia for supporting our work. You can now check out RoboCasa365 at robocasa.ai!

English

159

9.9K

Philipp Wu retweetledi

Angjoo Kanazawa@akanazawa·25 Şub

@brenthyi who worked on FPO/FPO++ is finishing his PhD and going on the job market 😭✨ He is also the person behind viser, pyroki, egoallo, jaxls, tyro and more! I can't express how amazing it is to have Brent on your team..! Any team would be incredibly lucky to have him!!

Angjoo Kanazawa@akanazawa

FPO++! We got RL on flow policies working on real robot tasks. Sim2real on humanoids trained from scratch + manipulation finetuning in sim with action chunking. Excited about this direction because we can now use RL with expressive policies to discover new behaviors!

English

106

9.9K

Philipp Wu@philippswu·24 Şub

Kevin Zakka@kevin_zakka

Just shipped a major domain randomization overhaul in mjlab and I'm super excited about it! The biggest highlight is physically consistent inertia randomization. Mass, center of mass, and the inertia tensor now vary together through a pseudo inertia parameterization, so every sample corresponds to a real rigid body. If you randomize those fields independently, you can end up with models that look fine numerically but are physically impossible. This fixes that. You can also randomize geom sizes at runtime. In C MuJoCo this breaks the collision tree, but MuJoCo Warp does not rely on a static BVH, so we recompute the collision bounds after each size change and keep things consistent. Link lengths, link angles, geom offsets, and site poses are safe to randomize now too. mujocolab.github.io/mjlab/main/sou…

QST

1.4K

Philipp Wu retweetledi

Joel Jang@jang_yoel·20 Şub

🚀 DreamZero training code is LIVE — train your own WAM (aka VAM)! 🔧 Replicate DROID from-scratch training 📊 Run evals on sim (DROID-Sim, MolmoSpaces, Polaris) & real-world (RoboArena) No 2 GB200s for real-time inference? No problem — let NVIDIA carry that burden 💪. Sign up for our API and jump into prompting new tasks! (e.g. "fan the burger" 🍔, totally unseen verb/task from DROID) Coming soon: new embodiment/robot fine-tuning initialized from our DreamZero-AGIBot checkpoint. Stay tuned! 🤖 🔗 github.com/dreamzero0/dre…

Seonghyeon Ye@SeonghyeonYe

VLAs (from VLMs) ❌ => WAMs (from Video Models) ✅ Why WAMs? 1️⃣ World Physics: VLMs know the internet, but Video Models implicitly model the physical laws essential for manipulation. 2️⃣ The "GPT Direction": VLAs are like BERT (rely heavily on task-specific post-training). WAMs are like GPT (pre-train & prompt), unlocking incredible zero-shot transfer! What I want to see in 2026: 📈 Scaling Laws: We will see much clearer scaling laws for robotics compared to VLAs. 🤝 Human-to-Robot Transfer: Unlocking massive transfer capabilities using video as a shared representation space. 🤖 Zero-Shot Mastery: Moving from short-horizon tasks to long-horizon, dexterous manipulation without task-specific demonstrations. We recently open-sourced the checkpoints, training and inference code. Dive into the research! 👇 📄 Paper: arxiv.org/abs/2602.15922 💻 Code: github.com/dreamzero0/dre… 🤗 HF: huggingface.co/GEAR-Dreams/Dr…

English

116

10.4K

Philipp Wu retweetledi

Travis Brashears@trbrashears·17 Şub

Continuing on the journey to accelerate towards an optical future. Come help us connect and move things with optical photons 🥰 working with an amazing team!

MESH@meshoptical

Introducing Mesh

English

6.1K

Philipp Wu retweetledi

MESH@meshoptical·17 Şub

Introducing Mesh

English

434

121.3K

Philipp Wu retweetledi

Kevin Zakka@kevin_zakka·15 Şub

Really excited to see the creativity and community engagement around mjlab 😍 Here’s a thread of what people are building 🧵👇

English

14.1K

Philipp Wu@philippswu·13 Şub

So excited to try out the sysid tool box! Hint: sysid is the actual secret sauce for sim2real

Kevin Zakka@kevin_zakka

Some exciting Friday news 🙂 We just open-sourced our system identification toolbox in MuJoCo 3.5. Get started today: "pip install mujoco[sysid]" mjlab v1.1 is also out featuring a brand new RGB-D renderer and now fully available on PyPI. Install with: "pip install mjlab"

English

3.2K

Philipp Wu@philippswu·7 Şub

@qiayuanliao incredible!

English

258

Philipp Wu retweetledi

Qiayuan Liao@qiayuanliao·7 Şub

One of my favorite robot clips (filmed Oct 2025). You can train any crazy full-body motions like this with our open-source stack without changing any parameters. whole_body_tracking: github.com/HybridRobotics… mjlab: github.com/mujocolab/mjla…

English

410

35.5K

Philipp Wu retweetledi

Brent Yi@brenthyi·6 Şub

New project! Flow Policy Gradients for Robot Control tldr; a simple online RL recipe for training and fine-tuning flow policies for robots co-led w/ @redstone_hong: hongsukchoi.github.io/fpo-control

English

101

607

71.4K

Philipp Wu retweetledi

Joel Jang@jang_yoel·4 Şub

Introducing DreamZero 🤖🌎 from @nvidia > A 14B “World Action Model” that achieves zero-shot generalization to unseen tasks & few-shot adaptation to new robots > The key? Jointly predicting video & actions in the same diffusion forward pass Project Page: dreamzero0.github.io 🧵 (1/10)

English

258

58.3K

Philipp Wu retweetledi

Yide Shentu@YideShentu·3 Şub

Definitely a fun project. My biggest takeaway is that varying the camera viewpoint really makes the learning more robust.

Justin Yu@uynitsuj

[Accepted to ICRA 2026!] 🚀 Introducing EgoMI: An egocentric manipulation interface that captures synchronized 6-DoF head and hand trajectories from egocentric human demonstrations! Transfers to IL policies zero-shot w/o visual augmentation or on-embodiment data. 1/n

English

3.5K

Keşfet

@ki_ki_ki1 @mitchaiet @SkyPilot @snasiriany @yukez @abhirammaddukur @SepNasiriany @bgxc