Tobias Springenberg (@jtspringenberg) - Twitter Profili

The “on the job improvement” plot thickens!

We developed an RL method for fine-tuning our models for precise tasks in just a few hours or even minutes. Instead of training the whole model, we add an “RL token” output to π-0.6, our latest model, which is used by a tiny actor and critic to learn quickly with RL.

English

0

7

198

Tobias Springenberg retweetledi

Marcel Torné@marceltornev·4 Mar

We equipped PI policies with memory! And taught our robots to do long-horizon real world tasks such as preparing the items for a recipe, cooking a grilled cheese and cleaning the kitchen!

Physical Intelligence@physical_int

We’ve developed a memory system for our models that provides both short-term visual memory and long-term semantic memory. Our approach allows us to train robots to perform long and complex tasks, like cleaning up a kitchen or preparing a grilled cheese sandwich from scratch 👇

English

7

15

83

8.3K

Tobias Springenberg retweetledi

Physical Intelligence@physical_int·17 Ara

We discovered an emergent property of VLAs like π0/π0.5/π0.6: as we scale up pre-training, the model learns to align human videos and robot data! This gives us a simple way to leverage human videos. Once π0.5 knows how to control robots, it can naturally learn from human video.

English

80

345

2.7K

1.2M

Tobias Springenberg retweetledi

Kay - Liyiming Ke@xkelym·18 Kas

Need to make RL great again 😉

Physical Intelligence@physical_int

Our model can now learn from its own experience with RL! Our new π*0.6 model can more than double throughput over a base model trained without RL, and can perform real-world tasks: making espresso drinks, folding diverse laundry, and assembling boxes. More in the thread below.

English

6

8

111

17.4K

Tobias Springenberg retweetledi

Paul Zhou@zhiyuan_zhou_·18 Kas

Very excited to finally share what I’ve been up to @physical_int for the past 6 months: developing advantage-conditioned VLAs! We are finally moving beyond imitating teleop data, and towards improving models with suboptimal deployment data using scalable real-world RL. 👇🧵

English

8

28

319

41K

Tobias Springenberg retweetledi

Suraj Nair@SurajNair_1·18 Kas

Robots 🤝 their own experience

Physical Intelligence@physical_int

Our model can now learn from its own experience with RL! Our new π*0.6 model can more than double throughput over a base model trained without RL, and can perform real-world tasks: making espresso drinks, folding diverse laundry, and assembling boxes. More in the thread below.

English

6

89

17K

Tobias Springenberg retweetledi

Laura Smith@smithlaura1028·18 Kas

Excited to share what we've been brewing at PI! We’re working on making robots more helpful by making them faster and more reliable through real-world practice, even on delicate behaviors like carrying this very full latte cup

English

92

22

217

27.7K

Tobias Springenberg@jtspringenberg·18 Kas

This was a long standing dream of mine. We trained a robot to master tasks with real relevance all from tasty real experience!

Physical Intelligence@physical_int

Our model can now learn from its own experience with RL! Our new π*0.6 model can more than double throughput over a base model trained without RL, and can perform real-world tasks: making espresso drinks, folding diverse laundry, and assembling boxes. More in the thread below.

English

2

9

40

10.7K

Tobias Springenberg

Keşfet