Roberto

174 posts

Roberto banner
Roberto

Roberto

@RobobertoMM

Assistant CS Professor at UT Austin. Former Stanford and TUBerlin. Researching at the intersection of vision, learning and robotics 🏳️‍🌈

Katılım Ağustos 2019
279 Takip Edilen2.4K Takipçiler
Roberto
Roberto@RobobertoMM·
How do we turn a decent generalist VLA into a great multitask specialist? RL can turn experience into one fine-tuned policy. The challenge is doing it continuously as new tasks arrive: without forgetting old ones or limiting future learning. Surprisingly simple. Check this out.
Jiaheng Hu@JiahengHu1

VLA models are capable generalists. But can they continually self-improve? Such Continual Reinforcement Learning (CRL) problems are traditionally considered very challenging. Surprisingly, we found that with the right setup, the simplest CRL recipe can work really well! arxiv.org/abs/2603.11653

English
0
0
8
661
Roberto retweetledi
Roberto retweetledi
Chengshu Li
Chengshu Li@ChengshuEricLi·
We are excited to release MoMaGen, a data generation method for multi-step bimanual mobile manipulation. MoMaGen turns 1 human-teleoped robot trajectory into 1000s of generated trajectories automatically.🚀 Website: momagen.github.io arXiv: arxiv.org/abs/2510.18316
English
1
36
166
47.9K
Roberto retweetledi
Roberto retweetledi
naveen manwani
naveen manwani@NaveenManwani17·
🚨CoRL 2025 Best Poster Award 🏆 Paper Alert 🚨 ➡️Paper Title: Mash, Spread, Slice! Learning to Manipulate Object States via Visual Spatial Progress 🌟Few pointers from the paper 🎯Most robot manipulation focuses on changing the kinematic state of objects: picking, placing, opening, or rotating them. 🎯However, a wide range of real-world manipulation tasks involve a different class of object state change—such as mashing, spreading, or slicing—where the object’s physical and visual state evolve progressively without necessarily changing its position. 🎯Authors of this paper presented “SPARTA”, the first unified framework for the family of object state change manipulation tasks. 🎯Their key insight is that these tasks share a common structural pattern:they involve spatially-progressing, object-centric changes that can be represented as regions transitioning from an actionable to a transformed state. 🎯Building on this insight, SPARTA integrates spatially progressing object change segmentation maps, a visual skill to perceive actionable vs. transformed regions for specific object state change tasks, to generate a) structured policy observations that strip away appearance variability, and b) dense rewards that capture incremental progress over time. 🎯These are leveraged in two SPARTA policy variants:reinforcement learning for fine-grained control without demonstrations or simulation; and greedy control for fast, lightweight deployment. 🎯They validated “SPARTA” on a real robot for three challenging tasks across 10 diverse real-world objects, achieving significant improvements in training time and accuracy over sparse rewards and visual goal-conditioned baselines. 🎯Their results highlight progress-aware visual representations as a versatile foundation for the broader family of object state manipulation tasks. 🏢Organization: @UTAustin 🧙Paper Authors: Priyanka Mandikal, @JiahengHu1 , @ShivinDass , @sagnikmjr , @RobobertoMM , Kristen Grauman 📝 Read the Full Paper here: vision.cs.utexas.edu/projects/spart… 🗂️ Project Page: vision.cs.utexas.edu/projects/spart… 🎥 Be sure to watch the attached Technical Summary Video - Sound on 🔊🔊 Find this Valuable 💎 ? ♻️QT and teach your network something new Follow me 👣, @NaveenManwani17 , for the latest updates on Tech and AI-related news, insightful research papers, and exciting announcements. #CoRL2025
English
0
4
8
691
Roberto retweetledi
Gautam Kamath
Gautam Kamath@thegautamkamath·
📢 Call for Community Activities #AAAI2026 We invite submissions of proposals for including and open activities that help broaden community participation in the AI field. October 4: Submission Deadline October 18: Acceptance Notifications @RobobertoMM @marucabrera27 @RealAAAI
Gautam Kamath tweet media
English
1
5
10
3.8K
Roberto
Roberto@RobobertoMM·
Simple but *so effective idea*! And it can be used with any feature data selector. Great work led by @sateeshk21 . Do not miss it at #CoRL2025 (Spotlight 4 & Poster 2 on Sept 29)!
Sateesh Kumar@sateeshk21

Which data is best for training few-shot imitation policies for robot manipulation? Some think it’s the data that looks similar, or has similar motion, or comes with related language labels. They are all right AND wrong: depending on the task, sometimes this similarity helps but sometimes it is detrimental. Presenting Our #CoRL2025 work, COLLAGE 🎨, that adaptively combine data subsets efficiently for learning effective policies on target tasks.  🧵

English
0
1
7
1.4K
Roberto retweetledi
Rutav
Rutav@rutavms·
Intelligent humanoids should have the ability to quickly adapt to new tasks by observing humans Why is such adaptability important? 🌍 Real-world diversity is hard to fully capture in advance 🧠 Adaptability is central to natural intelligence We present MimicDroid 👇 🌐 ut-austin-rpl.github.io/MimicDroid
English
7
36
122
39.6K
Roberto
Roberto@RobobertoMM·
Honored to give an Early Career Invited Talk at #IJCAI today. See you at 11:30am in room 520C!
English
1
3
21
1.4K
Roberto
Roberto@RobobertoMM·
It was time to improve our evaluations in robot learning! We introduce a methodology based on anonymous A/B testing: fairer, stronger, community-driven. Awesome work by @KarlPertsch @pranav_atreya @tonyh_lee and an incredible crowdsourcing team. Upload and test your model! 🚀
Karl Pertsch@KarlPertsch

We’re releasing the RoboArena today!🤖🦾 Fair & scalable evaluation is a major bottleneck for research on generalist policies. We’re hoping that RoboArena can help! We provide data, model code & sim evals for debugging! Submit your policies today and join the leaderboard! :) 🧵

English
0
2
21
2.3K
Roberto retweetledi
Arpit Bahety
Arpit Bahety@ArpitBahety·
Imagine a future where robots are part of our daily lives — How can end users teach robots new tasks by directly showing them, just like teaching another person? 🧵👇
English
3
14
43
3.9K
Roberto retweetledi
Huihan Liu
Huihan Liu@huihan_liu·
Meet Casper👻, a friendly robot sidekick who shadows your day, decodes your intents on the fly, and lends a hand while you stay in control! Instead of passively receiving commands, what if a robot actively sense what you need in the background, and step in when confident? (1/n)
English
8
35
164
24.3K
Roberto retweetledi
Jiaheng Hu
Jiaheng Hu@JiahengHu1·
Excited to be in ATL for #ICRA2025 to present 🔥FLaRe: fine-tuning large transformer policies with #RL, 15:25 Tuesday @ room 410! I will also be attending the 📷Doctoral Consortium on Monday to talk about my research on self-improving robots. Happy to meet old and new friends!
Jiaheng Hu@JiahengHu1

🚀 Despite efforts to scale up Behavior Cloning for Robots, large-scale BC has yet to live up to its promise. How can we break through the performance plateau? Introducing 🔥FLaRe: fine-tuning large-scale robot policies with Reinforcement Learning. robot-flare.github.io 🧵

English
2
6
35
4.9K
Roberto
Roberto@RobobertoMM·
Loved working on this with our MIT/Stanford/OpenAI collaborators. It brings "The Bitter Lesson" to data curation: skip the hand-tuned heuristics (visual similarity, motion...) and let the data speak for itself! Datamodels is a fascinating framework 🤯
Shivin Dass@ShivinDass

Ever wondered which data from large datasets (like OXE) actually helps when training/tuning a policy for specific tasks? We present DataMIL, a framework for measuring how each training sample influences policy performance, hence enabling effective data selection 🧵

English
0
3
38
3K
Roberto retweetledi
Elias Stengel-Eskin
Elias Stengel-Eskin@EliasEskin·
Extremely excited to announce that I will be joining @UTAustin @UTCompSci in August 2025 as an Assistant Professor! 🎉 I’m looking forward to continuing to develop AI agents that interact/communicate with people, each other, and the multimodal world. I’ll be recruiting PhD students for Fall 2026 across a range of connected topics (details: esteng.github.io) and plan on recruiting interns for Fall 2025 as well. A huge shoutout to my mentors who have supported and shaped my research! Especially grateful to my postdoc advisor @mohitban47 for helping me grow along the whole spectrum of PI skills, and my PhD advisor @ben_vandurme for shaping my trajectory as a researcher, and of course the amazing students/collaborators from @uncnlp and @jhuclsp 🙏
Elias Stengel-Eskin tweet media
English
92
63
462
56.8K
Roberto
Roberto@RobobertoMM·
So happy for @JiahengHu1 ! He has been rocking it, with outstanding work that pushes the limits of what robot learning can achieve in mobile manipulation and other domains. And one of my first Ph.D. students! Congratulations! 🦾🦾🦾🦾
Jiaheng Hu@JiahengHu1

I'm honored to be awarded the 2025 Two Sigma PhD fellowship, and extremely grateful to my two amazing advisors @RobobertoMM @PeterStone_TX ! Looking forward to continuing to advance the field of RL and Robotics.

English
0
0
5
493
Roberto retweetledi
Fei Xia
Fei Xia@xf1280·
✨Super excited to share what the team has been working on! ♊️🤖 Gemini Robotics is a family of frontier models that are dexterous, interactive, and general. It builds on top of Gemini's world understanding, enhancing it's spatial/embodied reasoning, and producing robot actions.
Google DeepMind@GoogleDeepMind

Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖 Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…

English
8
14
125
10K