Utkarsh Mishra

4

24

7.5K

Utkarsh Mishra retweetledi

Bowen Li@Bw_Li1024·2d

Building autonomous robots that learn to reason and plan in the physical world is a long-standing problem. We are excited to release KinDER, a large task suite and benchmark to bring different communities (TAMP, VLA, RL, etc.) together for this challenge! Welcome to try it out!

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

6

24

3.5K

Utkarsh Mishra retweetledi

Danfei Xu@danfei_xu·3d

Long-horizon physical reasoning used to be the specialty of TAMP. KinDER is a new sim benchmark that horizontally compares across paradigms, from VLA to PDDL bilevel planning to RL. If you care about hard physical reasoning tasks, give KinDER a try! To appear at RSS 2026.

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

8

69

8.9K

Utkarsh Mishra retweetledi

Vaibhav Saxena@saxenavaibhav11·3d

Physical reasoning is core to robot learning, and what KinDER offers is a clean and robust way of testing it - whether you have a large pre-trained manipulation model, or you just want to see if your model can finetune to such data. More details 👇

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

2

5

986

Utkarsh Mishra retweetledi

Tom Silver@tomssilver·3d

As a planning+learning researcher, I’m really excited about KinDER. It clarifies planning (especially TAMP) for outsiders, defines key open challenges for the field, and creates a common ground to compare & combined planning+learning approaches. (1/n)

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

2

11

76

6.8K

Utkarsh Mishra@utkarshm0410·3d

I've tried solving long-horizon planning with physical constraints before, KinDER takes it to another level. My methods failed to solve many tasks — try evaluating yours today! x.com/YixuanHuang13/…

We report results and release implementations for 13 baselines in 8 environments. Empirical evaluation shows that existing methods struggle to solve many of the tasks, indicating substantial gaps in current approaches to physical reasoning. The general trend is that paying higher engineering costs leads to dividends in success rates. 🧵 9/n

English

1

78

Utkarsh Mishra@utkarshm0410·3d

KinDER is finally released 🎉 A step forward in defining what physical reasoning is and how to evaluate it, through many kinematic and dynamic manipulation challenges. See @YixuanHuang13 post to learn more! See you at RSS!

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

7

13

1.7K

Utkarsh Mishra@utkarshm0410·4d

Super excited to be part of this amazing cohort! Grateful for this opportunity. See you all in Sydney 🇦🇺

RSS Pioneers@RSSPioneers

We are excited to announce the 2026 cohort of RSS Pioneers! This year’s cohort brings together an outstanding group of early-career researchers whose work spans the breadth of robotics. A heartfelt thank you to all the organizers who made this year’s program possible.

English

2

13

1K

Utkarsh Mishra retweetledi

Danfei Xu@danfei_xu·6d

Gave a talk on Robot Learning from Human Data at Stanford. It was great to be back! Some opinionated points: 1. Human data collection capacity is outpacing the research. 2. We still don't have the "science" for scaling robot capability with human data. 3. We are far from being able to model naturalistic human behaviors. youtube.com/watch?v=NUtaN1…

YouTube

English

Woo Chul Shin@woochulshin1726

21

179

13.6K

Utkarsh Mishra@utkarshm0410·25 Nis

Happening now! Come see us at #4203 Pavilion 4 if you want to know more about compositional visual planning!

What if your robot could plan tasks it has never seen before without ever being retrained? Meet Compositional Visual Planning via Inference-Time Diffusion Scaling (ICLR 2026 🏆) comp-visual-planning.github.io If you are in Rio🇧🇷 visit us! Sat, 04/25/26 6:30-9:00 AM PDT Pavillion 4 #4203

English

Utkarsh Mishra@utkarshm0410

1

282

Utkarsh Mishra@utkarshm0410·24 Nis

The poster session is happening now in Pavilion 3 #1309

Our paper "Compositional Diffusion with Guided Search (CDGS)" is an Oral at #ICLR2026! Short-horizon Foundation Models + Compositional Generative Planning + Inference-time Search = CDGS for goal-conditioned long-horizon planning! More details: cdgsearch.github.io 🧵 below

English

2

162

Utkarsh Mishra retweetledi

Wei Guo@WeiGuo01·23 Nis

I’ll present two papers at ICLR and I’m happy to chat! (1) Proximal Diffusion Neural Sampler (Apr 23 morning, P3-#411) (2) Complexity Analysis of Normalizing Constant Estimation: from Jarzynski Equality to Annealed Importance Sampling and beyond (Apr 23 afternoon, P4-#4509)

Wei Guo@WeiGuo01

How annealing helps overcoming multimodality? In our ICLR 2025 paper openreview.net/forum?id=P6IVI… and preprint arxiv.org/abs/2502.04575, we established the first complexity bound for annealed sampling and normalizing constant (⇔free energy) estimation under weak assumptions on target!

English

Utkarsh Mishra@utkarshm0410

8

36

3.8K

Utkarsh Mishra retweetledi

Yongxin Chen@YongxinChen1·22 Nis

Check out our ICLR oral paper led by @utkarshm0410 @davidhe137 We demonstrate the power of inference time scaling in long horizon planning tasks with only short horizon generative models

Our paper "Compositional Diffusion with Guided Search (CDGS)" is an Oral at #ICLR2026! Short-horizon Foundation Models + Compositional Generative Planning + Inference-time Search = CDGS for goal-conditioned long-horizon planning! More details: cdgsearch.github.io 🧵 below

English

5

3

26

3.5K

Utkarsh Mishra retweetledi

Shun Iwase@s1wase·22 Nis

TRIで最後に関わったプロジェクトである、VLA Foundryがついにリリースされました！異なる言語モデルやビジョンモデルを手軽に試せるだけでなく、Drake + Blenderを用いたシミュレーション環境で複数タスクの評価も簡単に行えます。ぜひ試してみてください！

Jean Mercat@MercatJean

Releasing VLA Foundry: an open-source framework that unifies LLM, VLM, and VLA training in a single codebase. End-to-end control from language pretraining to action-expert fine-tuning — no more stitching together incompatible repos.

日本語

17

116

14.8K

Utkarsh Mishra retweetledi

Frank Dellaert@fdellaert·22 Nis

x.com/i/article/2046…

ZXX

5

33

187

25.9K

Utkarsh Mishra@utkarshm0410·22 Nis

@AnthonyZhang123 Wow, amazing find. Exactly! Our approach is from the compositional angle. We believe that iterative resampling pushes the intermediate states to satisfy the data distribution of both the overlapping modes in a more informed manner.

English

3

133

Anthony Zhang@AnthonyZhang123·22 Nis

@utkarshm0410 super cool paper Utkarsh! the iterative resampling reminds me of this paper: arxiv.org/abs/2601.18577, it seems like CDGS’s constant noising and denoising also pushes the output closer to the data distribution

English

Utkarsh Mishra@utkarshm0410

6

189

Utkarsh Mishra@utkarshm0410·21 Nis

Our paper "Compositional Diffusion with Guided Search (CDGS)" is an Oral at #ICLR2026! Short-horizon Foundation Models + Compositional Generative Planning + Inference-time Search = CDGS for goal-conditioned long-horizon planning! More details: cdgsearch.github.io 🧵 below

English

2

25

188

29K

Utkarsh Mishra@utkarshm0410·22 Nis

Super excited for the Oral talk and visiting Brazil for the first time! Both @davidhe137 and I are attending #ICLR2026. We will be presenting on April 24: Poster: 10 AM, Pavilion 3 P3-#1309 Oral: 4:27 PM, 201 A/B Looking forward to catching up with everyone!

Our paper "Compositional Diffusion with Guided Search (CDGS)" is an Oral at #ICLR2026! Short-horizon Foundation Models + Compositional Generative Planning + Inference-time Search = CDGS for goal-conditioned long-horizon planning! More details: cdgsearch.github.io 🧵 below

English

2

20

1.4K

Utkarsh Mishra@utkarshm0410·21 Nis

10/10 work done with @davidhe137 , @YongxinChen1 , @danfei_xu @ICatGT @gtcomputing @GTrobotics Get started: github.com/cdgsearch/gett…

English

1

5

285

Utkarsh Mishra@utkarshm0410·21 Nis

9/10 We show that CDGS scales with more inference-time compute by expanding search over wide denoising paths and strengthening message passing between consecutive atomic segments. This leads to improved performance without any retraining.

English