Tom Silver

8

92

6.7K

Tom Silver@tomssilver·17 May

I've noticed a feature of LLM writing; curious if it has a name. On advanced topics, LLMs connect two true statements with a "because" / "but" / "therefore" etc. -- but the two statements are not actually related. Hallucinonsequiturs?

English

4

1

19

2.2K

Tom Silver@tomssilver·10 May

This week's #PaperILike is "Human-Guided Complexity-Controlled Abstractions" (Peng et al., NeurIPS 2023). Selecting the right levels and kinds of abstractions remains important and open for many forms of human-AI / human-robot interaction. PDF: arxiv.org/abs/2310.17550

English

3

25

1.9K

Tom Silver retweetledi

Nishanth Kumar@nishanthkumar23·8 May

x.com/i/article/2052…

ZXX

3

14

66

12.3K

Tom Silver retweetledi

Anirudha Majumdar@Majumdar_Ani·4 May

Last July, I started my role as founding co-director of the Princeton Robotics Initiative (w/ Aimy Wissa)! We kicked things off with an inaugural symposium on our vision of humanity-driven robotics. Amazing to see the excitement, with 300+ attendees and 80+ posters! 🐯🤖

English

4

71

3.5K

Tom Silver@tomssilver·3 May

This week's #PaperILike is "HG-DAgger: Interactive Imitation Learning with Human Experts" (Kelly et al., 2019). This would definitely be high on my list of "papers to read if you want to understand what robot foundation model startups are doing." PDF: arxiv.org/abs/1810.02890

English

3

8

74

4.7K

Tom Silver@tomssilver·1 May

@vnhartmann @saxenavaibhav11 Ah, multiple robots in the same environment, understood. That also is something we have discussed. I can't promise we'll have it soon, but it is on the roadmap :)

English

0

1

27

Valentin Hartmann@vnhartmann·1 May

@tomssilver @saxenavaibhav11 Ah nice! I was thinking more like this, to force some people a bit on how we could move on from demos/how we use demos for more than two arms ;)

English

0

13

Tom Silver@tomssilver·30 Nis

As a planning+learning researcher, I’m really excited about KinDER. It clarifies planning (especially TAMP) for outsiders, defines key open challenges for the field, and creates a common ground to compare & combined planning+learning approaches. (1/n)

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

2

11

76

7.1K

Tom Silver retweetledi

Bowen Li@Bw_Li1024·1 May

Building autonomous robots that learn to reason and plan in the physical world is a long-standing problem. We are excited to release KinDER, a large task suite and benchmark to bring different communities (TAMP, VLA, RL, etc.) together for this challenge! Welcome to try it out!

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

6

25

3.7K

Tom Silver retweetledi

Utkarsh Mishra@utkarshm0410·30 Nis

KinDER is finally released 🎉 A step forward in defining what physical reasoning is and how to evaluate it, through many kinematic and dynamic manipulation challenges. See @YixuanHuang13 post to learn more! See you at RSS!

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

7

13

1.7K

Tom Silver retweetledi

Vaibhav Saxena@saxenavaibhav11·30 Nis

There's so much infra that went into this project - tasks in MuJoCo/PyBullet/Pymunk, Parameterized Skills, LLM/VLM Planners, MPC, PPO, SAC, DP, VLAs, and more. Kudos to the entire team for bringing this together! See you all at RSS :)

@Princeton @CMU_Robotics @ICatGT KinDER introduces five core physical reasoning challenges in dynamic & kinematic environments, both in 2D and 3D. These are bottlenecks for state-of-the-art robot planning and manipulation. Progress on KinDER will mean real progress for robotics. 🧵 2/n

English

4

19

1.9K

Tom Silver retweetledi

Danfei Xu@danfei_xu·30 Nis

Long-horizon physical reasoning used to be the specialty of TAMP. KinDER is a new sim benchmark that horizontally compares across paradigms, from VLA to PDDL bilevel planning to RL. If you care about hard physical reasoning tasks, give KinDER a try! To appear at RSS 2026.

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

8

70

9.9K

Tom Silver@tomssilver·1 May

@vnhartmann Thanks very much! Yes, in fact @saxenavaibhav11 has already added a Rainbow RB-Y1 behind the scenes :) Good to know that this is of interest.

English

0

2

36

Valentin Hartmann@vnhartmann·1 May

@tomssilver Extremely cool work! Want to add multiple robots to make it harder?

English

0

1

15

Tom Silver retweetledi

Nishanth Kumar@nishanthkumar23·30 Nis

As robot learning starts to mature, it’s important that we develop rigorous benchmarks to provide a North Star for progress and also compare a very wide variety of approaches. I think KinDER accomplishes both of these aims and more, and I’m excited to hopefully see fast progress on these tasks! 🤖

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English

4

26

2.2K

Tom Silver@tomssilver·30 Nis

I also want to give a massive shout out to all the authors who put in an extraordinary engineering effort, especially the leaders @YixuanHuang13, @Bw_Li1024 , and @saxenavaibhav11 👏 (n/n)

English

4

183

Tom Silver@tomssilver·30 Nis

Check out these environments and others (25 total) in the “KinDERGarden”: prpl-group.com/kinder-site And look out for KinDER at RSS 2026! arxiv.org/abs/2604.25788

English