Dandan Shan

121 posts

Dandan Shan banner
Dandan Shan

Dandan Shan

@DandanShan_

Founding Research Scientist at Assured Robot Intelligence (ARI), PhD @UMichCSE, Ex-intern @NVIDIAAI, @AdobeResearch

Menlo Park, CA Katılım Ocak 2019
1.5K Takip Edilen1.5K Takipçiler
Sabitlenmiş Tweet
Dandan Shan
Dandan Shan@DandanShan_·
Want a better model to reconstruct 3D hands🖐️ from RGB images? Don't miss our latest work and fresh release, HaMeR, on Hand Mesh Recovery!!! Code and models are released. More video demos are on the project page!
Georgios Pavlakos@geopavlakos

We just released HaMeR, our latest work for Hand Mesh Recovery! We reconstruct hands in 3D from a single image with big improvements in accuracy and robustness. Code and models are available. Also colab and Hugging Face demo! geopavlakos.github.io/hamer/

English
2
3
28
7.2K
Dandan Shan
Dandan Shan@DandanShan_·
It's been super fun working with the team on exciting problems. The journey ahead is even more thrilling @Meta 🤖🚀 Let's go!
Xiaolong Wang@xiaolonw

Excited to share that Assured Robot Intelligence (ARI) has joined @Meta to help build the future of humanoid intelligence! When we started ARI one year ago, our mission was clear: achieve physical AGI. Through deep customer engagements and real-world deployments, it became clear to us that serving the massive opportunity ahead requires training a truly general-purpose physical agent. We believe this agent will be humanoid — and that scaling will come from learning directly from human experience, not teleoperation alone. Meta’s ecosystem brings together the key components needed to make this vision possible. We will be joining Meta Superintelligence Labs (MSL) to help bring personal superintelligence into the physical world. We are incredibly grateful to the brilliant minds, robotics researchers, engineers, partners, and supporters who have worked with us on this journey. Thank you to our investors and angels, led by @aixventureshq , for believing in our mission. This is just the beginning.

English
2
1
59
4.5K
Dandan Shan retweetledi
Xuxin Cheng
Xuxin Cheng@xuxin_cheng·
Excited to share that ARI (Assured Robot Intelligence) is joining @Meta! When we co-founded ARI a year ago, the mission was clear: build humanoid intelligence for the real world. Joining Meta Superintelligence Labs (MSL), we'll continue advancing frontier robotics models toward physical superintelligence in the physical world. Huge thanks to my co-founders, the incredible ARI team, and our investors led by @aixventureshq for backing this from day one. This is just the beginning.
Bloomberg@business

Meta Platforms Inc. has acquired Assured Robot Intelligence, a startup developing artificial intelligence models for robots, as part of a major initiative to build humanoid technology. bloomberg.com/news/articles/…

English
26
13
256
19.8K
Dandan Shan retweetledi
Lerrel Pinto
Lerrel Pinto@LerrelPinto·
ARI is joining @Meta! Over the past year, we have been building ARI (Assured Robot Intelligence) with the mission to build industry-grade physical AI for humanoids. The ARI stack is built on human experience, condensed into actionable tokens that can be rapidly adapted to real-world hardware. But the most rewarding part of ARI has been the people. I feel truly blessed to have worked alongside some of the world's best roboticists, a top-notch investor pool led by @aixventureshq, and the many supporters pushing for us behind the scenes. Starting next week, ARI will join the Meta Superintelligence Labs (MSL) to continue advancing frontier robotics models that advance personal superintelligence in the physical world. We have the potential to transform AI that can think and talk to AI that can do, assisting humans safely and reliably in the physical world. To the many people behind the scenes who supported us: Thank you! This is just the beginning. More in the Bloomberg article:
Bloomberg@business

Meta Platforms Inc. has acquired Assured Robot Intelligence, a startup developing artificial intelligence models for robots, as part of a major initiative to build humanoid technology. bloomberg.com/news/articles/…

English
36
38
369
54K
Dandan Shan retweetledi
Yufei Ye
Yufei Ye@yufei_ye·
How can human data help train robots? Can we control digital avatars like robots? How to make robots behave more like humans? Our CVPR 2026 workshop Agents in Interactions: From Humans to Robots is back this summer. Contribute to this exciting direction, submit your work by May 8
Yufei Ye tweet media
English
1
8
40
3.6K
Dandan Shan retweetledi
Homanga Bharadhwaj
Homanga Bharadhwaj@mangahomanga·
We're thrilled to organize the 2nd Workshop on Agents in Interactions: From Humans to Robots! Submit your best work by May 8 and join us at CVPR in Denver to discuss research in this exciting space w/ @yufei_ye @DandanShan_ @jiaman01 @xiaolonw Alan Yuille
Homanga Bharadhwaj tweet media
English
0
6
27
2.5K
Dandan Shan
Dandan Shan@DandanShan_·
🚨 Call for Papers! Deadline: May 8! The 2nd Workshop on Agents in Interactions: From Humans to Robots at #CVPR2026. We invite submissions on understanding and learning interactions involving humans and robots. w/ @yufei_ye @mangahomanga @jiaman01 @xiaolonw Alan Yuille
Dandan Shan tweet media
English
0
1
8
721
Dandan Shan retweetledi
Irmak Guzey
Irmak Guzey@irmakkguzey·
Learning from human data requires human-like hardware. Humans use their wrists constantly, but table-top manipulators lack this flexibility. We build upon RUKA and introduce RUKA-v2: a tendon-driven hand with a 2-DOF wrist and finger abduction/adduction 👋✌️
English
7
29
116
8.5K
Dandan Shan retweetledi
Jeff Cui
Jeff Cui@jeffacce·
We don't need the name of an object to pick it up; we simply need to know where it is and what it looks like. Introducing Contact-Anchored Policies (CAPs): instead of language, we explicitly condition on contacts. Our policy learns object pickup with only 16 hours of data! 🧵
English
5
28
109
13K
Dandan Shan retweetledi
Mahi Shafiullah 🏠🤖
Mahi Shafiullah 🏠🤖@notmahi·
Best ideas are often the simplest in hindsight. Meet Contact-Anchored Policies (CAP)🧢: by conditioning policies on physical contact (vs language) we achieve env & embodiment generalization with super low resources. This policy ⬇️ learned to pick from scratch w/ 16 hrs of data 🧵
English
7
31
173
16.6K
Dandan Shan retweetledi
Boyi Li
Boyi Li@Boyiliee·
Introducing FoundationMotion. A large-scale, video-derived motion annotation dataset & auto-labeling pipeline + advanced models for motion understanding. Fully open-source: code, datasets, and models, free to use and build on. Understanding motion is core to physical reasoning, yet today’s leading models still struggle with simple spatial actions like “turn right” or “move up” or “flip the toast” - mainly due to the lack of large, fine-grained motion datasets. We present FoundationMotion, a fully automated pipeline that: • detects & tracks objects in videos • extracts trajectories • uses LLMs + frames to generate rich motion captions & QA pairs → creating large-scale, high-quality motion datasets at scale. After fine-tuning the open-source models Qwen and NVILA on our annotations, these models now outperform the closed-source Gemini-3-Flash and GPT-5.1 on spatial understanding tasks across autonomous driving, robotics, and everyday scenarios. 📜Paper: arxiv.org/abs/2512.10927 🌐Webpage: yulugan.com/projects/Found… 💻 Code: github.com/Wolfv0/Foundat… 🕸️Model: huggingface.co/WoWolf/models 📊 Dataset: huggingface.co/datasets/WoWol… 👉 Interactive Demo: huggingface.co/spaces/yulu2/F… Let’s move research forward together. FoundationMotion is also referred to as Wolf V2 🐺, the second chapter in the Wolf series: wolfv0.github.io.
English
3
75
444
66.1K
Dandan Shan retweetledi
Irmak Guzey
Irmak Guzey@irmakkguzey·
Dexterous manipulation by directly observing humans - a dream in AI for decades - is hard due to visual and embodiment gaps. With simple yet powerful hardware - Aria 2 glasses 👓 - and our new work AINA 🪞, we are now one significant step closer to achieving this dream.
English
7
33
145
34.3K
Dandan Shan retweetledi
Raunaq Bhirangi
Raunaq Bhirangi@Raunaqmb·
When @anyazorin and @irmakkguzey open-sourced the RUKA Hand (a low-cost robotic hand) earlier this year, people kept asking us how to get one. Open hardware isn’t as easy to share as code. So we’re releasing an off-the-shelf RUKA, in collaboration with @WowRobo and @zhazhali01.
English
14
42
252
48.2K
Dandan Shan retweetledi
Homanga Bharadhwaj
Homanga Bharadhwaj@mangahomanga·
I'll be joining the faculty @JohnsHopkins late next year as a tenure-track assistant professor in @JHUCompSci Looking for PhD students to join me tackling fun problems in robot manipulation, learning from human data, understanding+predicting physical interactions, and beyond!
Homanga Bharadhwaj tweet mediaHomanga Bharadhwaj tweet mediaHomanga Bharadhwaj tweet media
English
87
114
852
131.4K
Dandan Shan retweetledi
NVIDIA
NVIDIA@nvidia·
✨ We were honored to deliver some of the very first NVIDIA DGX Sparks to AI Pioneer @ylecun and AI Researcher, @soumithchintala, from @Meta and @NYUniversity in NYC. “Every PhD student in AI should have one of these,” said Yann. We couldn’t agree more. We are anticipating great things from their visionary AI research. Learn more: nvda.ws/42EGKOM #SparkSomethingBig 💫
NVIDIA tweet mediaNVIDIA tweet mediaNVIDIA tweet mediaNVIDIA tweet media
English
70
167
1.1K
313.6K
Dandan Shan retweetledi
Lerrel Pinto
Lerrel Pinto@LerrelPinto·
I gave a Early Career talk at CoRL 2025 in Seoul last week, where I talked about my observations from the past decade in robot learning along with where the field is headed for the next decade. In summary, the future of robot learning needs: (1) Data beyond teleop: We are never going to reach the scale of LLM / VLM data by tele-operating robots. Need to leverage consumer hardware already in people's hands (e.g. iPhones) and emerging devices (e.g. Smartglasses). (2) Observations beyond vision: The hard problem in robotics is dexterity. Dexterity is all about moving objects intricately through contact. The sense of touch is critical for this. Vision can help you acquire objects, but anything more complex will need touch. (3) Reasoning beyond reactivity: The biggest wins in robot learning have been in reactive policies (both manipulation and locomotion). But the class of models that got us here are generally feed-forward nets. Long-horizon reasoning needs the ability to predict future outcomes and manipulate them. Currently unclear what the right scalable architectures are here, but we are working on it. (thanks @zacinaction for the pic!)
Lerrel Pinto tweet media
English
7
19
201
16.3K
Dandan Shan retweetledi
Linyi Jin
Linyi Jin@jin_linyi·
Hello! If you are interested in dynamic 3D or 4D, don't miss the oral session 3A at 9 am on Saturday: @zhengqi_li will be presenting "MegaSaM" I'll be presenting "Stereo4D" and @QianqianWang5 will be presenting "CUT3R"
English
1
6
37
1.6K