Tom Silver

440 posts

Tom Silver banner
Tom Silver

Tom Silver

@tomssilver

Assistant Professor @Princeton. Developing robots that plan and learn to help people.

Princeton, NJ Katılım Ekim 2011
375 Takip Edilen3.3K Takipçiler
Tom Silver
Tom Silver@tomssilver·
This week's #PaperILike is "QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?" (Li, Kim, & Wang, NeurIPS 2025). Beautiful paper & increasingly important as agents start to ask more Qs. Curious how SOTA models do. PDF: arxiv.org/abs/2503.22674
English
2
7
52
6.6K
Tom Silver
Tom Silver@tomssilver·
This week's #PaperILike is "Classical Planning in Deep Latent Space: Bridging the Subsymbolic-Symbolic Boundary" (Asai & Fukunaga, AAAI 2018). One of the key papers that got me hooked on learning + planning before I started grad school. PDF: arxiv.org/abs/1705.00154
English
0
8
92
6.7K
Tom Silver
Tom Silver@tomssilver·
I've noticed a feature of LLM writing; curious if it has a name. On advanced topics, LLMs connect two true statements with a "because" / "but" / "therefore" etc. -- but the two statements are not actually related. Hallucinonsequiturs?
English
4
1
19
2.2K
Tom Silver
Tom Silver@tomssilver·
This week's #PaperILike is "Human-Guided Complexity-Controlled Abstractions" (Peng et al., NeurIPS 2023). Selecting the right levels and kinds of abstractions remains important and open for many forms of human-AI / human-robot interaction. PDF: arxiv.org/abs/2310.17550
English
0
3
25
1.9K
Tom Silver retweetledi
Anirudha Majumdar
Anirudha Majumdar@Majumdar_Ani·
Last July, I started my role as founding co-director of the Princeton Robotics Initiative (w/ Aimy Wissa)! We kicked things off with an inaugural symposium on our vision of humanity-driven robotics. Amazing to see the excitement, with 300+ attendees and 80+ posters! 🐯🤖
Anirudha Majumdar tweet media
English
0
4
71
3.5K
Tom Silver
Tom Silver@tomssilver·
This week's #PaperILike is "HG-DAgger: Interactive Imitation Learning with Human Experts" (Kelly et al., 2019). This would definitely be high on my list of "papers to read if you want to understand what robot foundation model startups are doing." PDF: arxiv.org/abs/1810.02890
English
3
8
74
4.7K
Tom Silver
Tom Silver@tomssilver·
@vnhartmann @saxenavaibhav11 Ah, multiple robots in the same environment, understood. That also is something we have discussed. I can't promise we'll have it soon, but it is on the roadmap :)
English
1
0
1
27
Valentin Hartmann
Valentin Hartmann@vnhartmann·
@tomssilver @saxenavaibhav11 Ah nice! I was thinking more like this, to force some people a bit on how we could move on from demos/how we use demos for more than two arms ;)
Valentin Hartmann tweet media
English
1
0
0
13
Tom Silver
Tom Silver@tomssilver·
As a planning+learning researcher, I’m really excited about KinDER. It clarifies planning (especially TAMP) for outsiders, defines key open challenges for the field, and creates a common ground to compare & combined planning+learning approaches. (1/n)
Yixuan Huang@YixuanHuang13

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English
2
11
76
7.1K
Tom Silver retweetledi
Bowen Li
Bowen Li@Bw_Li1024·
Building autonomous robots that learn to reason and plan in the physical world is a long-standing problem. We are excited to release KinDER, a large task suite and benchmark to bring different communities (TAMP, VLA, RL, etc.) together for this challenge! Welcome to try it out!
Yixuan Huang@YixuanHuang13

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English
0
6
25
3.7K
Tom Silver retweetledi
Utkarsh Mishra
Utkarsh Mishra@utkarshm0410·
KinDER is finally released 🎉 A step forward in defining what physical reasoning is and how to evaluate it, through many kinematic and dynamic manipulation challenges. See @YixuanHuang13 post to learn more! See you at RSS!
Yixuan Huang@YixuanHuang13

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English
1
7
13
1.7K
Tom Silver retweetledi
Vaibhav Saxena
Vaibhav Saxena@saxenavaibhav11·
There's so much infra that went into this project - tasks in MuJoCo/PyBullet/Pymunk, Parameterized Skills, LLM/VLM Planners, MPC, PPO, SAC, DP, VLAs, and more. Kudos to the entire team for bringing this together! See you all at RSS :)
Yixuan Huang@YixuanHuang13

@Princeton @CMU_Robotics @ICatGT KinDER introduces five core physical reasoning challenges in dynamic & kinematic environments, both in 2D and 3D. These are bottlenecks for state-of-the-art robot planning and manipulation. Progress on KinDER will mean real progress for robotics. 🧵 2/n

English
0
4
19
1.9K
Tom Silver retweetledi
Danfei Xu
Danfei Xu@danfei_xu·
Long-horizon physical reasoning used to be the specialty of TAMP. KinDER is a new sim benchmark that horizontally compares across paradigms, from VLA to PDDL bilevel planning to RL. If you care about hard physical reasoning tasks, give KinDER a try! To appear at RSS 2026.
Yixuan Huang@YixuanHuang13

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English
0
8
70
9.9K
Tom Silver
Tom Silver@tomssilver·
@vnhartmann Thanks very much! Yes, in fact @saxenavaibhav11 has already added a Rainbow RB-Y1 behind the scenes :) Good to know that this is of interest.
English
1
0
2
36
Tom Silver retweetledi
Nishanth Kumar
Nishanth Kumar@nishanthkumar23·
As robot learning starts to mature, it’s important that we develop rigorous benchmarks to provide a North Star for progress and also compare a very wide variety of approaches. I think KinDER accomplishes both of these aims and more, and I’m excited to hopefully see fast progress on these tasks! 🤖
Yixuan Huang@YixuanHuang13

Meet KinDER — a stress test for robot physical reasoning. All 13 methods failed 😈 🌎 25 environments ♾️ Infinite tasks 🏋️ Gymnasium API ⚒️ Over 20 parameterized skills 🪧 Human demonstrations 📊 13 baselines (planning and learning) From @Princeton @CMU_Robotics @ICatGT @CambridgeMLG @nvidia @MIT_CSAIL 🧵 1/n

English
0
4
26
2.2K