Roberto

174 posts

Roberto

@RobobertoMM

Assistant CS Professor at UT Austin. Former Stanford and TUBerlin. Researching at the intersection of vision, learning and robotics 🏳️‍🌈

Katılım Ağustos 2019

279 Takip Edilen2.4K Takipçiler

Roberto@RobobertoMM·13 Mar

How do we turn a decent generalist VLA into a great multitask specialist? RL can turn experience into one fine-tuned policy. The challenge is doing it continuously as new tasks arrive: without forgetting old ones or limiting future learning. Surprisingly simple. Check this out.

Jiaheng Hu@JiahengHu1

VLA models are capable generalists. But can they continually self-improve? Such Continual Reinforcement Learning (CRL) problems are traditionally considered very challenging. Surprisingly, we found that with the right setup, the simplest CRL recipe can work really well! arxiv.org/abs/2603.11653

English

661

Roberto retweetledi

Shivin Dass@ShivinDass·26 Şub

Excited to share that DataMIL is accepted at ICLR. To celebrate we are also releasing the codebase: github.com/UT-Austin-RobI… You can now curate data for your robot within a couple of hours!

Shivin Dass@ShivinDass

Ever wondered which data from large datasets (like OXE) actually helps when training/tuning a policy for specific tasks? We present DataMIL, a framework for measuring how each training sample influences policy performance, hence enabling effective data selection 🧵

English

3.4K

Roberto retweetledi

Chengshu Li@ChengshuEricLi·23 Eki

We are excited to release MoMaGen, a data generation method for multi-step bimanual mobile manipulation. MoMaGen turns 1 human-teleoped robot trajectory into 1000s of generated trajectories automatically.🚀 Website: momagen.github.io arXiv: arxiv.org/abs/2510.18316

English

166

47.9K

Roberto retweetledi

Shivin Dass@ShivinDass·9 Eki

A little late to this but excited to share that DataMIL won the best paper at the Data workshop at #CoRL! If you haven't already, check it out! 👇

Shivin Dass@ShivinDass

English

2.5K

Roberto retweetledi

naveen manwani@NaveenManwani17·29 Eyl

🚨CoRL 2025 Best Poster Award 🏆 Paper Alert 🚨 ➡️Paper Title: Mash, Spread, Slice! Learning to Manipulate Object States via Visual Spatial Progress 🌟Few pointers from the paper 🎯Most robot manipulation focuses on changing the kinematic state of objects: picking, placing, opening, or rotating them. 🎯However, a wide range of real-world manipulation tasks involve a different class of object state change—such as mashing, spreading, or slicing—where the object’s physical and visual state evolve progressively without necessarily changing its position. 🎯Authors of this paper presented “SPARTA”, the first unified framework for the family of object state change manipulation tasks. 🎯Their key insight is that these tasks share a common structural pattern:they involve spatially-progressing, object-centric changes that can be represented as regions transitioning from an actionable to a transformed state. 🎯Building on this insight, SPARTA integrates spatially progressing object change segmentation maps, a visual skill to perceive actionable vs. transformed regions for specific object state change tasks, to generate a) structured policy observations that strip away appearance variability, and b) dense rewards that capture incremental progress over time. 🎯These are leveraged in two SPARTA policy variants:reinforcement learning for fine-grained control without demonstrations or simulation; and greedy control for fast, lightweight deployment. 🎯They validated “SPARTA” on a real robot for three challenging tasks across 10 diverse real-world objects, achieving significant improvements in training time and accuracy over sparse rewards and visual goal-conditioned baselines. 🎯Their results highlight progress-aware visual representations as a versatile foundation for the broader family of object state manipulation tasks. 🏢Organization: @UTAustin 🧙Paper Authors: Priyanka Mandikal, @JiahengHu1 , @ShivinDass , @sagnikmjr , @RobobertoMM , Kristen Grauman 📝 Read the Full Paper here: vision.cs.utexas.edu/projects/spart… 🗂️ Project Page: vision.cs.utexas.edu/projects/spart… 🎥 Be sure to watch the attached Technical Summary Video - Sound on 🔊🔊 Find this Valuable 💎 ? ♻️QT and teach your network something new Follow me 👣, @NaveenManwani17 , for the latest updates on Tech and AI-related news, insightful research papers, and exciting announcements. #CoRL2025

English

691

Roberto retweetledi

Jiaheng Hu@JiahengHu1·28 Eyl

Excited that SPARTA (vision.cs.utexas.edu/projects/spart…) won the best poster award at the CoRL RINO workshop! Big congrats to the project lead Priyanka, who worked so hard on this project, as well as to the rest of the co-authors @ShivinDass @sagnikmjr @RobobertoMM and Kristen!

English

1.8K

Roberto retweetledi

Gautam Kamath@thegautamkamath·23 Eyl

📢 Call for Community Activities #AAAI2026 We invite submissions of proposals for including and open activities that help broaden community participation in the AI field. October 4: Submission Deadline October 18: Acceptance Notifications @RobobertoMM @marucabrera27 @RealAAAI

English

3.8K

Roberto@RobobertoMM·26 Eyl

Simple but *so effective idea*! And it can be used with any feature data selector. Great work led by @sateeshk21 . Do not miss it at #CoRL2025 (Spotlight 4 & Poster 2 on Sept 29)!

Sateesh Kumar@sateeshk21

Which data is best for training few-shot imitation policies for robot manipulation? Some think it’s the data that looks similar, or has similar motion, or comes with related language labels. They are all right AND wrong: depending on the task, sometimes this similarity helps but sometimes it is detrimental. Presenting Our #CoRL2025 work, COLLAGE 🎨, that adaptively combine data subsets efficiently for learning effective policies on target tasks. 🧵

English

1.4K

Roberto retweetledi

Rutav@rutavms·16 Eyl

Intelligent humanoids should have the ability to quickly adapt to new tasks by observing humans Why is such adaptability important? 🌍 Real-world diversity is hard to fully capture in advance 🧠 Adaptability is central to natural intelligence We present MimicDroid 👇 🌐 ut-austin-rpl.github.io/MimicDroid

English

122

39.6K

Roberto@RobobertoMM·20 Ağu

Honored to give an Early Career Invited Talk at #IJCAI today. See you at 11:30am in room 520C!

English

1.4K

Roberto@RobobertoMM·21 Haz

It was time to improve our evaluations in robot learning! We introduce a methodology based on anonymous A/B testing: fairer, stronger, community-driven. Awesome work by @KarlPertsch @pranav_atreya @tonyh_lee and an incredible crowdsourcing team. Upload and test your model! 🚀

Karl Pertsch@KarlPertsch

We’re releasing the RoboArena today!🤖🦾 Fair & scalable evaluation is a major bottleneck for research on generalist policies. We’re hoping that RoboArena can help! We provide data, model code & sim evals for debugging! Submit your policies today and join the leaderboard! :) 🧵

English

2.3K

Roberto retweetledi

Arpit Bahety@ArpitBahety·21 Haz

Imagine a future where robots are part of our daily lives — How can end users teach robots new tasks by directly showing them, just like teaching another person? 🧵👇

English

3.9K

Roberto retweetledi

Huihan Liu@huihan_liu·18 Haz

Meet Casper👻, a friendly robot sidekick who shadows your day, decodes your intents on the fly, and lends a hand while you stay in control! Instead of passively receiving commands, what if a robot actively sense what you need in the background, and step in when confident? (1/n)

English

164

24.3K

Roberto@RobobertoMM·5 Haz

@abhishekunique7 @JiahengHu1 Thanks, Abhishek!

English

141

Abhishek Gupta@abhishekunique7·5 Haz

Excellent work as usual from @JiahengHu1 and collaborators :)

English

845

Abhishek Gupta@abhishekunique7·5 Haz

Hell yea real world RL :)

Jiaheng Hu@JiahengHu1

Real-world RL, where robots learn directly from physical interactions, is extremely challenging — especially for high-DoF systems like mobile manipulators. 1⃣ Long-horizon tasks and large action spaces lead to difficult policy optimization. 2⃣ Real-world exploration with whole-body contact raises serious safety concerns. 🚀 Introducing SLAC, a framework that brings safety and efficiency to whole-body real-world RL. Paper: arxiv.org/abs/2506.04147 Video: youtube.com/watch?v=bj5Ghj… 🧵

English

2.6K

Roberto@RobobertoMM·5 Haz

🚨RL training for contact-rich tasks with a mobile manipulator IN THE REAL WORLD?!🤯 We're not crazy—just equipped with the right action space! SLAC learns a safe, effective action space via unsupervised RL in sim, enabling real-world RL training in minutes. Check it out!🚀

Jiaheng Hu@JiahengHu1

English

1.2K

Roberto retweetledi

Jiaheng Hu@JiahengHu1·19 May

Excited to be in ATL for #ICRA2025 to present 🔥FLaRe: fine-tuning large transformer policies with #RL, 15:25 Tuesday @ room 410! I will also be attending the 📷Doctoral Consortium on Monday to talk about my research on self-improving robots. Happy to meet old and new friends!

Jiaheng Hu@JiahengHu1

🚀 Despite efforts to scale up Behavior Cloning for Robots, large-scale BC has yet to live up to its promise. How can we break through the performance plateau? Introducing 🔥FLaRe: fine-tuning large-scale robot policies with Reinforcement Learning. robot-flare.github.io 🧵

English

4.9K

Roberto@RobobertoMM·17 May

Loved working on this with our MIT/Stanford/OpenAI collaborators. It brings "The Bitter Lesson" to data curation: skip the hand-tuned heuristics (visual similarity, motion...) and let the data speak for itself! Datamodels is a fascinating framework 🤯

Shivin Dass@ShivinDass

English

Roberto retweetledi

Elias Stengel-Eskin@EliasEskin·5 May

Extremely excited to announce that I will be joining @UTAustin @UTCompSci in August 2025 as an Assistant Professor! 🎉 I’m looking forward to continuing to develop AI agents that interact/communicate with people, each other, and the multimodal world. I’ll be recruiting PhD students for Fall 2026 across a range of connected topics (details: esteng.github.io) and plan on recruiting interns for Fall 2025 as well. A huge shoutout to my mentors who have supported and shaped my research! Especially grateful to my postdoc advisor @mohitban47 for helping me grow along the whole spectrum of PI skills, and my PhD advisor @ben_vandurme for shaping my trajectory as a researcher, and of course the amazing students/collaborators from @uncnlp and @jhuclsp 🙏

English

462

56.8K

Roberto@RobobertoMM·24 Nis

So happy for @JiahengHu1 ! He has been rocking it, with outstanding work that pushes the limits of what robot learning can achieve in mobile manipulation and other domains. And one of my first Ph.D. students! Congratulations! 🦾🦾🦾🦾

Jiaheng Hu@JiahengHu1

I'm honored to be awarded the 2025 Two Sigma PhD fellowship, and extremely grateful to my two amazing advisors @RobobertoMM @PeterStone_TX ! Looking forward to continuing to advance the field of RL and Robotics.

English

493

Roberto retweetledi

Fei Xia@xf1280·12 Mar

✨Super excited to share what the team has been working on! ♊️🤖 Gemini Robotics is a family of frontier models that are dexterous, interactive, and general. It builds on top of Gemini's world understanding, enhancing it's spatial/embodied reasoning, and producing robot actions.

Google DeepMind@GoogleDeepMind

Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖 Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…

English

125

10K

Keşfet

@UTAustin @JiahengHu1 @ShivinDass @sagnikmjr @NaveenManwani17 @marucabrera27 @RealAAAI @sateeshk21