antonia bronars (@BronarsToni) - Twitter Profili

Sabitlenmiş Tweet

Introducing TEXterity ~ our method for simultaneous tactile estimation and control for extrinsic dexterity 👉TEXterity tackles precise in-hand manipulation with simple grippers, for tasks like assembly and tool-use 🍎website (videos + paper) : sites.google.com/view/texterity 🧵👇

English

2

13

68

15.6K

antonia bronars retweetledi

Siddharth Ancha@siddancha·3d

A very thoughtful study on selecting the right controller gains for robot learning (BC, RL, Sim2Real). And an AMAZING website: younghyopark.me/tune-to-learn

Younghyo Park@younghyo_park

What's different between these two BC policies? It's the same architecture, training budget, and data collection setup — the only difference is the controller gains! Controller gains are an understudied design parameter in robot learning. In our new work (w/ @BronarsToni*, @pulkitology), we show how they act as an inductive bias across BC, RL, and Sim2Real transfer, with real consequences on performance. Here's what we found 🧵 * Equal Contribution 📄arxiv: arxiv.org/abs/2604.02523 🔗website: younghyopark.me/tune-to-learn/

English

0

9

35

5.4K

antonia bronars retweetledi

Chonghyuk (ND) Song@ndsong95·3d

Relevant for anyone working on "videos as policies". Also the interactive visualizations on their project website are absolutely fire.

Younghyo Park@younghyo_park

[10/n] Broader implication 2 Learning from human videos or wearables is a promising direction. These paradigms, however, often treat "future state" as its pseudo action label — implicitly assuming perfect tracking, which is effectively a stiff-controller assumption. If our findings generalize, rethinking this assumption could unlock even more of their potential.

English

0

2

9

1.3K

antonia bronars retweetledi

Pulkit Agrawal@pulkitology·3d

PD Controller gains matter, and they behave much differently in robot learning compared with classical control. Learn more 👇

Younghyo Park@younghyo_park

What's different between these two BC policies? It's the same architecture, training budget, and data collection setup — the only difference is the controller gains! Controller gains are an understudied design parameter in robot learning. In our new work (w/ @BronarsToni*, @pulkitology), we show how they act as an inductive bias across BC, RL, and Sim2Real transfer, with real consequences on performance. Here's what we found 🧵 * Equal Contribution 📄arxiv: arxiv.org/abs/2604.02523 🔗website: younghyopark.me/tune-to-learn/

English

0

7

81

9.6K

antonia bronars retweetledi

Abhishek Gupta@abhishekunique7·3d

So good :) Great to see a deep dive into the often understudied parts of the robot learning stack from @younghyo_park @BronarsToni and @pulkitology!

Younghyo Park@younghyo_park

What's different between these two BC policies? It's the same architecture, training budget, and data collection setup — the only difference is the controller gains! Controller gains are an understudied design parameter in robot learning. In our new work (w/ @BronarsToni*, @pulkitology), we show how they act as an inductive bias across BC, RL, and Sim2Real transfer, with real consequences on performance. Here's what we found 🧵 * Equal Contribution 📄arxiv: arxiv.org/abs/2604.02523 🔗website: younghyopark.me/tune-to-learn/

English

0

4

21

3.7K

antonia bronars retweetledi

Cheng Chi@chichengcc·3d

The spatial segregation of learning vs classical robotics expertise often sorts people into camps by where they went to school. But the growing intersection of the two is where the real-world performance bottleneck lives. The people who globally optimize the entire system will win in the end. Kudos to @younghyo_park and @BronarsToni for pushing past the false dichotomy that dominates academic framing.

Younghyo Park@younghyo_park

What's different between these two BC policies? It's the same architecture, training budget, and data collection setup — the only difference is the controller gains! Controller gains are an understudied design parameter in robot learning. In our new work (w/ @BronarsToni*, @pulkitology), we show how they act as an inductive bias across BC, RL, and Sim2Real transfer, with real consequences on performance. Here's what we found 🧵 * Equal Contribution 📄arxiv: arxiv.org/abs/2604.02523 🔗website: younghyopark.me/tune-to-learn/

English

1

11

133

14.2K

antonia bronars@BronarsToni·3d

Check out our new work exploring the impact of controller gains on robot learning — this widely used but often overlooked design parameter can have real consequences on performance!

Younghyo Park@younghyo_park

What's different between these two BC policies? It's the same architecture, training budget, and data collection setup — the only difference is the controller gains! Controller gains are an understudied design parameter in robot learning. In our new work (w/ @BronarsToni*, @pulkitology), we show how they act as an inductive bias across BC, RL, and Sim2Real transfer, with real consequences on performance. Here's what we found 🧵 * Equal Contribution 📄arxiv: arxiv.org/abs/2604.02523 🔗website: younghyopark.me/tune-to-learn/

English

0

4

21

1.4K

antonia bronars retweetledi

Jyo Pari@jyo_pari·5 Eyl

For agents to improve over time, they can’t afford to forget what they’ve already mastered. We found that supervised fine-tuning forgets more than RL when training on a new task! Want to find out why? 👇

English

19

157

938

168.4K

antonia bronars retweetledi

Jyo Pari@jyo_pari·13 Haz

What if an LLM could update its own weights? Meet SEAL🦭: a framework where LLMs generate their own training data (self-edits) to update their weights in response to new inputs. Self-editing is learned via RL, using the updated model’s downstream performance as reward.

English

132

510

3.2K

664.2K

antonia bronars retweetledi

Jyo Pari@jyo_pari·6 Nis

Llama 4 (@Meta) shows too much SFT limits RL exploration — something we also found in our recent work! A new and superior pretraining paradigm is around the corner to unleash a new era of reasoning. Check out our paper: arxiv.org/abs/2502.19402 Thread: x.com/pulkitology/st…

Pulkit Agrawal@pulkitology

Overturning the next-token prediction is required for achieving general reasoning! We predict that RPT (Reward Pre-Training) will overtake GPT in the future -- similar to how AlphaZero overtook AlphaGo. Learn more: arxiv.org/pdf/2502.19402 🚨Our whitepaper, “General Reasoning Requires Learning to Reason from the Get-Go” challenges the idea that simply making models bigger and feeding them more data is enough for robust, adaptable reasoning. ⚡️We argue that models should be trained for iterative reasoning from scratch while separating knowledge (i.e., facts) from reasoning!

English

1

22

105

18.1K

antonia bronars@BronarsToni·3 Nis

@ohtake_i She is so beautiful! Congrats Kei!

English

1

0

70

Kei Ota@ohtake_i·1 Nis

Officially promoted to “Dad.” Starting my biggest adventure yet. 父になりました！

日本語

10

1

83

4.2K

antonia bronars retweetledi

Younghyo Park@younghyo_park·5 Kas

Collect robot demos from anywhere through AR! Excited to introduce 🎯DART, Dexterous AR Teleoperation interface enabling anyone to teleoperate robots in cloud-hosted simulation. With DART, anyone can collect robot demos anywhere, anytime, for multiple robots and tasks in one sitting. Every data is automatically logged on our open-sourced cloud database DexHub for public use. dexhub.ai/project 🧵[1/n]

English

4

45

228

27.2K

antonia bronars retweetledi

Kei Ota@ohtake_i·30 Haz

Excited to share that our paper on Autonomous Gear Assembly has been accepted to #IROS2024! 🥳 This work was done during my stay at @merl_news with the amazing team: @devesh_dkj, @RomeresDiego, Siddarth Jain, Bill Yerazunis, Radu Corcodel, Yash Shukla, and @BronarsToni!

Kei Ota@ohtake_i

Our demo video showcasing autonomous robotic assembly has been published! We have created a closed-loop system that exhibits robustness to failure and empowers the robot to autonomously assemble a gearbox from any initial condition. youtube.com/watch?v=cZ9M1D…

English

1

3

19

2.9K

antonia bronars retweetledi

Pulkit Agrawal@pulkitology·14 Mar

This is going to be the year of hands and dexterous manipulation :) A sneak-peak into what's been cooking in our lab:

English

3

18

247

22.9K

antonia bronars retweetledi

Younghyo Park@younghyo_park·9 Mar

🥽 Want to use your new Apple Vision Pro to control your robot? Want to record how you navigate / manipulate the world to train a policy? I developed an app for VisionOS that can stream your head / wrist / finger movements over WiFi, which you can subscribe on any machines using a simple python library. 🧵[1/6]

English

32

208

1.1K

214K

antonia bronars retweetledi

Marcel Torné@marceltornev·7 Mar

How can we train robust policies with minimal human effort?🤖 We propose RialTo, a system that robustifies imitation learning policies from 15 real-world demonstrations using on-the-fly reconstructed simulations of the real world. (1/9)🧵 Project website: real-to-sim-to-real.github.io/RialTo/

English

2

41

154

35.5K

antonia bronars retweetledi

Devesh Jha@devesh_dkj·7 Mar

Imagine that the next time you receive a Christmas gift, you have a robot to assemble it together. We have been working towards creating dexterous robots which can do this without use of complex hands or expensive fixtures. youtu.be/cZ9M1DQ23OI?si… via @YouTube

YouTube

English

1

3

11

4.6K

antonia bronars@BronarsToni·1 Şub

TEXterity is accepted to ICRA 24! Looking forward to learning about everyone's exciting work, and sharing ours, soon :) see you in Japan!

antonia bronars@BronarsToni

Introducing TEXterity ~ our method for simultaneous tactile estimation and control for extrinsic dexterity 👉TEXterity tackles precise in-hand manipulation with simple grippers, for tasks like assembly and tool-use 🍎website (videos + paper) : sites.google.com/view/texterity 🧵👇

English

1

2

10

1.7K

antonia bronars@BronarsToni·22 Oca

[7/7] This work builds off several of our previous contributions, give them a look as well! Tac2Pose: arxiv.org/abs/2204.11701 (IJRR version), arxiv.org/abs/2012.05205 (CoRL version) Simultaneous Tactile Estimation and Control: arxiv.org/abs/2303.03385

English

0

2

221

antonia bronars@BronarsToni·22 Oca

[6/7] We also demonstrate that it is possible to the achieve high-tolerance insertion after in-hand manipulation! After a reorientation step, we insert objects into cavities with 0.5mm and 1mm of clearance.

English

1

0

2

360

antonia bronars@BronarsToni·22 Oca

Introducing TEXterity ~ our method for simultaneous tactile estimation and control for extrinsic dexterity 👉TEXterity tackles precise in-hand manipulation with simple grippers, for tasks like assembly and tool-use 🍎website (videos + paper) : sites.google.com/view/texterity 🧵👇

English

2

13

68

15.6K

antonia bronars

Keşfet