Xuanlin Li (Simon)

88 posts

Xuanlin Li (Simon)

Xuanlin Li (Simon)

@XuanlinLi2

Robotics, Vision Language, Embodied AI @sudo_robotics | Prev. @HaoSuLabUCSD @berkeley_ai

Beigetreten Mayıs 2021
434 Folgt1.2K Follower
Angehefteter Tweet
Xuanlin Li (Simon)
Xuanlin Li (Simon)@XuanlinLi2·
Scalable, reproducible, and reliable robotic evaluation remains an open challenge, especially in the age of generalist robot foundation models. Can *simulation* effectively predict *real-world* robot policy performance & behavior? Presenting SIMPLER!👇 simpler-env.github.io
English
4
24
135
86.3K
Yunzhu Li
Yunzhu Li@YunzhuLiYZ·
I want to call out one of our most important references: SIMPLER (simpler-env.github.io), by @XuanlinLi2, @kylehkhsu, @Jiayuan_Gu, @jiajunwu_cs, @haosu_twitr, @QuanVng, @xiao_ted, and colleagues, which laid the foundation for using simulation for policy evaluation through a systematic study of appearance and dynamics alignment and metrics for measuring sim–real correlation. (It took many nights of @kaiwynd and @shashuo0104's grinding for that correlation to appear — and once it did, extending to new tasks worked like a charm!)
Yunzhu Li@YunzhuLiYZ

📢 Announcing one of the most exciting works from us this year on **scalable robot policy evaluation through real-to-sim transfer**, moving toward a scalable evaluation engine with structured world models that capture the appearance, geometry, and dynamics of environments involving deformable objects. 🤖 Evaluation remains one of the biggest bottlenecks in building general-purpose robots. Today, robots are still evaluated only in the real world, which is **orders of magnitude slower** than the development of language agents. We propose a new framework where simulation performance **strongly correlates** with the real world (r > 0.9), even for deformable objects. The key difference from existing work lies in the correlation between simulation and reality: if a robot model performs better in the digital world, does it also perform better in the real world? This question has long made people hesitant about simulation-based evaluation — especially for deformable objects. We are changing that. Our pipeline achieves effective real-to-sim transfer, establishing **state-of-the-art correlation** between simulation and reality for deformable object manipulation. It provides a **scalable and reproducible evaluation engine** for robot learning. 🌐 real2sim-eval.github.io

English
5
11
96
11.8K
Xuanlin Li (Simon) retweetet
Shresth Grover
Shresth Grover@shroglc·
VLA models often forget their pretrained knowledge during action training, hurting generalization. 🤖Our framework unifies action & VLM training to preserve strong pretrained representations & maintain versatility, boosting generalization & robustness. gen-vla.github.io
English
1
7
38
4.9K
Xuanlin Li (Simon) retweetet
Apurva Badithela
Apurva Badithela@ApurvaBadithela·
Robotic manipulation has seen tremendous progress in recent years but rigorous evaluation of robot policies remains a challenge! We present our work: "Reliable and Scalable Robot Policy Evaluation with Imperfect Simulators"! 🧵
English
2
14
71
14.9K
Xuanlin Li (Simon) retweetet
Chris Rockwell
Chris Rockwell@_crockwell·
Ever wish YouTube had 3D labels? 🚀Introducing🎥DynPose-100K🎥, an Internet-scale collection of diverse videos annotated with camera pose! Applications include camera-controlled video generation🤩and learned dynamic pose estimation😯 Download: huggingface.co/datasets/nvidi…
English
2
38
177
43K
Xuanlin Li (Simon) retweetet
Allen Ren
Allen Ren@allenzren·
HNY! Lately I took a crack at implementing the pi0 model from @physical_int PaliGemma VLM (2.3B fine-tuned) + 0.3B "action expert" MoE + block attention Flow matching w/ action chunking Strong eval on Simpler w/ 75ms inference github.com/allenzren/open… ckpts available! 👇(1/6)
English
17
54
390
39.8K
Xuanlin Li (Simon) retweetet
Kuan Fang
Kuan Fang@KuanFang·
How can robots learn bimanual, contact-rich manipulation tasks when scaling up expert demonstrations is so challenging? With GLIDE, we synthesize diverse, high-quality demonstrations via efficient motion planning in simulation and train a task-conditioned diffusion policy to imitate these motions from observed point clouds. GLIDE enables two 7-DoF robot arms to manipulate objects of varying appearances and properties through rich arm-object contact. w/ @XuanlinLi2, Tong Zhao, @zhu_xinghao, Jiuguang Wang, Tao Pang
Xuanlin Li (Simon)@XuanlinLi2

Learning bimanual, contact-rich robot manipulation policies that generalize over diverse objects has long been a challenge. Excited to share our work: Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation! glide-manip.github.io 🧵1/n

English
0
5
49
5.4K
Xuanlin Li (Simon)
Xuanlin Li (Simon)@XuanlinLi2·
@init_malachi Policy output actions are diffused with Gaussian noise; policy latent serves as conditioning for the denoising process.
English
0
0
0
32
M
M@init_malachi·
@XuanlinLi2 so the policy latents are defused with gaussian noise? or the policy outputs
English
1
0
0
5
Xuanlin Li (Simon)
Xuanlin Li (Simon)@XuanlinLi2·
Learning bimanual, contact-rich robot manipulation policies that generalize over diverse objects has long been a challenge. Excited to share our work: Planning-Guided Diffusion Policy Learning for Generalizable Contact-Rich Bimanual Manipulation! glide-manip.github.io 🧵1/n
English
1
19
74
20.9K
Leroy Wang
Leroy Wang@LiruiWang1·
Some personal update: I defended my thesis last month after several wonderful years at MIT. Here is my defense talk: youtube.com/watch?v=i5BRVn…. Just joined OpenAI in SF and enjoyed the weather so far. Happy to grab coffee with folks in the Bay Area!
YouTube video
YouTube
English
17
3
142
14.4K
Xuanlin Li (Simon) retweetet
Hao Su
Hao Su@haosu_twitr·
Check out the full demo of Hillbot Alpha's latest progress! Our robot successfully stocks shelves over a **long horizon** in the real-world, while adapting smoothly to diverse objects, positions, and human interference.
Hillbot@Hillbot_AI

Check out the full demonstration of Hillbot Alpha, our autonomous mobile manipulation robot performing long-horizon shelf stocking, trained using #sim2real technology. #AGI #EmbodiedAI #AI #ArtificialIntelligence #Simulation #robots #autonomousrobots #computervision

English
2
4
57
6K
Xuanlin Li (Simon) retweetet
Stone Tao
Stone Tao@Stone_Tao·
just made it possible to evaluate generalist robotics models like Octo at 60-100x real world evaluation speeds via gpu simulation and rendering (~10x faster than original cpu sim code). All videos below are from our open source ManiSkill GPU sim!
English
6
12
168
13.5K