Carlo D'Eramo

247 posts

Carlo D'Eramo

@CarloDeramo

Professor of Reinforcement Learning & Decision-Making and group leader of the LiteRL group | @Uni_WUE | Creator of @Mushroom_RL

Würzburg, Germany Katılım Aralık 2021

261 Takip Edilen903 Takipçiler

Carlo D'Eramo retweetledi

Jürgen Schmidhuber@SchmidhuberAI·24 Nis

Using only box-forwarding speed as the reward, our Stackelberg PPO automatically evolves robots with arms for pushing and legs for moving. The key idea is a novel game-theoretic view of structure–control co-design, yielding more effective optimization and dramatically better designs. Come see our poster at ICLR 2026 on Apr 25, 10:30 AM, at P4-#4810. With @YuhuiWangAI, @YanningD_AI, @oneDylanAshley. Paper: arxiv.org/abs/2603.15388 Project Page: yanningdai.github.io/stackelberg-pp…

English

537

50.1K

Carlo D'Eramo retweetledi

Samuele Tosatto@tosatto_samuele·24 Mar

🧠 What we are looking for: • Rigorous ML/RL background & math fluency • Excellent programming skills • Analytic problem-solving & communication If you want to push the boundaries of RL, join us! Please RT to help spread the word! #ML #RL #AI #PhDPosition

English

170

Carlo D'Eramo retweetledi

Samuele Tosatto@tosatto_samuele·24 Mar

📢 We are seeking a highly motivated student for a 3-year fully funded PhD position in Reinforcement Learning at the University of Innsbruck! 🇦🇹 Help us advance the theory & algorithms of off-policy RL. Details & Apply: samueletosatto.online/new-phd-positi…

English

1.6K

Carlo D'Eramo retweetledi

Ahmed Hendawy | أحمد هنداوى@AHendawy19·11 Şub

🧵 Accepted at @iclr_conf ! Target networks stabilize bootstrapping in RL 🛡️ But induce slow-moving targets 🐢 Online networks adapt fast ⚡ But can diverge with function approximation 💥 𝗠𝗜𝗡𝗧𝗢🌿 uses the online network 𝗼𝗻𝗹𝘆 𝗶𝗳 𝗶𝘁 𝗰𝗮𝗻 — yielding faster and more stable RL. Here’s how 👇

English

2.2K

Carlo D'Eramo retweetledi

Théo Vincent@Theo_Vincent_·5 Şub

It was a pleasure to collaborate with @YogeshTrip7354, Tim Faust, @aportekilaa, Yaniv Oren, Melih Kandemir, @Jan_R_Peters, and @CarloDeramo Thanks to @ias_tudarmstadt, @DFKI @Hessian_AI, @infsys_uniwue for supporting this research!

English

338

Carlo D'Eramo retweetledi

Ahmed Hendawy | أحمد هنداوى@AHendawy19·26 Oca

🌿 MINTO has been accepted at #ICLR2026! 📌 MINTO is a simple, yet effective target bootstrapping method for off-policy RL that enables faster, more stable learning and consistently improves performance across algorithms and benchmarks. 📄 Preprint: arxiv.org/abs/2510.02590

English

2.7K

Carlo D'Eramo retweetledi

Ahmed Hendawy | أحمد هنداوى@AHendawy19·3 Eki

I had a fantastic time discussing my research @AmiiThinks and @UAlberta last August. If you are interested in Multi-Task Reinforcement Learning (MTRL) and Mixture of Experts (MoE), then this talk is for you. ➡️ Full talk: youtu.be/yqRajcJLl2I #reinforcementlearning #AI

YouTube

English

1.1K

Carlo D'Eramo retweetledi

Francesco Bertolotti@f14bertolotti·16 Eyl

In this paper, the authors compute the gradient update of the policy of one agent by accounting also for the update of all other agents. I feel this is a fairly general idea that could be applied to most multi-agent RL algorithms. 🔗arxiv.org/abs/2509.12117

English

227

16.2K

Carlo D'Eramo retweetledi

Ahmed Hendawy | أحمد هنداوى@AHendawy19·4 Ağu

Attending @RL_Conference this year at @UAlberta, Canada 🇨🇦 ! I'm excited to be organizing a workshop on Inductive Biases in Reinforcement Learning @ibrlworkshop. Let's meet to discuss multi-task RL, Mixture of Experts, and other cool topics. Hope to see you there!

English

730

Carlo D'Eramo retweetledi

Marlos C. Machado@MarlosCMachado·4 Ağu

RLC starts tomorrow here in Edmonton. I couldn't be more excited! It has a fantastic roll of speakers, great papers, and workshops. And this time, it is in Edmonton 😁 @RL_Conference is my favourite conference, and no, it is not because I am one of its organizers this year.

English

11.6K

Carlo D'Eramo retweetledi

Théo Vincent@Theo_Vincent_·4 Ağu

Looking forward to @RL_Conference! I will be presenting 4 posters, feel free to come and exchange with me during the conference, @RLFrameWorkshop, or @ibrlworkshop🙂

English

804

Carlo D'Eramo retweetledi

Inductive Biases in RL@ibrlworkshop·4 Ağu

🗓️ The IBRL Workshop kicks off tomorrow! 🎉 Join us at @RL_Conference @UAlberta to explore how Inductive Biases can boost 🚀 the performance of RL agents. 📄 Accepted papers: sites.google.com/view/ibrl-work… 📅 Full schedule: sites.google.com/view/ibrl-work… #ReinforcementLearning #RLC2025

English

10.1K

Carlo D'Eramo retweetledi

Théo Vincent@Theo_Vincent_·3 Ağu

To increase the reward propagation in value-based RL algorithms, it is tempting to reduce the target update period🤔 But, this makes the training unstable💔 @RL_Conference, I will present i-QN, a new method that allows faster reward propagation, while keeping stability⚡️ 👉🧵

English

8.7K

Carlo D'Eramo retweetledi

Théo Vincent@Theo_Vincent_·11 Tem

🎤 Very excited to give a talk @Cohere_Labs next week 🎤 I will be presenting the research I have been working on for the last 2 years with @CarloDeramo, @Jan_R_Peters, and many more collaborators! x.com/Cohere_Labs/st…

Cohere Labs@Cohere_Labs

Join our Reinforcement Learning Group next week on Friday, July 18th as they welcome @Theo_Vincent_ for a session on "Optimizing the Learning Trajectory of Reinforcement Learning Agents." Thanks to @rahul_narava and @gustiwinata_ for organizing this event ✨

English

Carlo D'Eramo retweetledi

Théo Vincent@Theo_Vincent_·8 Tem

Thanks to my amazing co-authors: Tim Faust, @YogeshTrip7354, @Jan_R_Peters, and @CarloDeramo 🎬Video👉youtu.be/xUpvOTyetKY 📄Paper👉arxiv.org/pdf/2503.01437 👨‍💻Code👉github.com/theovincent/Ea… 🏋️‍♂️Weights👉huggingface.co/TheoVincent/At…

YouTube

English

538

Carlo D'Eramo retweetledi

Inductive Biases in RL@ibrlworkshop·28 May

📢 Submission deadline extension: the new deadline is June 6th AoE 🔗 Portal: openreview.net/group?id=rl-co… 🌐 More information at: sites.google.com/view/ibrl-work… 🚀 Looking forward to seeing you at @RL_Conference !

English

Carlo D'Eramo retweetledi

Kristian Kersting@kerstingAIML·23 May

🚀 TU Darmstadt leads the new Cluster of Excellence "Reasonable AI" – advancing trustworthy, efficient & adaptive AI grounded in common sense. A big thank you to the entire team, the university, and the state of Hesse for their tremendous work & support! buff.ly/N7xLKfW

English

8.3K

Carlo D'Eramo retweetledi

Davide Tateo@davide_tateo·26 May

I'm pleased to announce that, starting October 1st, I will be joining the Computer Science department at Lund University (in Sweden) as a Senior Lecturer. I will join the Robotic and Semantic Systems group and collaborate in the RobotLab LTH, where I'll bring my RL expertise.

English

4.8K

Carlo D'Eramo retweetledi

Davide Tateo@davide_tateo·26 May

I'm very excited for this new adventure and very grateful to @Jan_R_Peters for his massive support during my time as postdoc/group leader at @ias_tudarmstadt Also, many thanks to my amazing friends @GeorgiaChal and @CarloDeramo for their help and support.

English

301

Carlo D'Eramo retweetledi

Théo Vincent@Theo_Vincent_·23 Nis

How do you tune the hyperparameters of your RL agent? 🤔 Come and chat with me about it tomorrow afternoon at poster 397 of @iclr_conf ! I will be presenting⚡️ Adaptive Q-Network⚡️a method that adaptively selects the hyperparameters of your RL agent during training 🔥

GIF

English

4.6K

Keşfet

@YuhuiWangAI @YanningD_AI @oneDylanAshley @iclr_conf @YogeshTrip7354 @aportekilaa @Jan_R_Peters @ias_tudarmstadt