Sabitlenmiş Tweet
Jenny Pan
15 posts

Jenny Pan retweetledi

Pushing online RL to the next level -- exposing RL Token from the π-0.6 model for online RL achieves superhuman performance with as little as 15 minutes of data.
Physical Intelligence@physical_int
We developed an RL method for fine-tuning our models for precise tasks in just a few hours or even minutes. Instead of training the whole model, we add an “RL token” output to π-0.6, our latest model, which is used by a tiny actor and critic to learn quickly with RL.
English
Jenny Pan retweetledi

We equipped PI policies with memory!
And taught our robots to do long-horizon real world tasks such as preparing the items for a recipe, cooking a grilled cheese and cleaning the kitchen!
Physical Intelligence@physical_int
We’ve developed a memory system for our models that provides both short-term visual memory and long-term semantic memory. Our approach allows us to train robots to perform long and complex tasks, like cleaning up a kitchen or preparing a grilled cheese sandwich from scratch 👇
English
Jenny Pan retweetledi

Most robot policies today still largely lack memory: they make all their decisions based on what they can see right now. MemER aims to change that by learning which frames are important; this lets it deal with tasks like object search. @ajaysridhar0, @jenpan_,
and @satviks107Sharma tell us about how to achieve this fundamental capability for long-horizon task execution.
Watch Episode #54 of RoboPapers with @micoolcho and @chris_j_paxton to learn more!
English
Jenny Pan retweetledi
Jenny Pan retweetledi

Work w/ @ajaysridhar0, @satviks107, and @chelseabfinn!
Paper: arxiv.org/pdf/2510.20328
Website: jen-pan.github.io/memer/
(5/5)
English



