Carlo D'Eramo

247 posts

Carlo D'Eramo banner
Carlo D'Eramo

Carlo D'Eramo

@CarloDeramo

Professor of Reinforcement Learning & Decision-Making and group leader of the LiteRL group | @Uni_WUE | Creator of @Mushroom_RL

Würzburg, Germany Katılım Aralık 2021
261 Takip Edilen903 Takipçiler
Carlo D'Eramo retweetledi
Jürgen Schmidhuber
Jürgen Schmidhuber@SchmidhuberAI·
Using only box-forwarding speed as the reward, our Stackelberg PPO automatically evolves robots with arms for pushing and legs for moving. The key idea is a novel game-theoretic view of structure–control co-design, yielding more effective optimization and dramatically better designs. Come see our poster at ICLR 2026 on Apr 25, 10:30 AM, at P4-#4810. With @YuhuiWangAI, @YanningD_AI, @oneDylanAshley. Paper: arxiv.org/abs/2603.15388 Project Page: yanningdai.github.io/stackelberg-pp…
English
14
64
537
50.1K
Carlo D'Eramo retweetledi
Samuele Tosatto
Samuele Tosatto@tosatto_samuele·
🧠 What we are looking for: • Rigorous ML/RL background & math fluency • Excellent programming skills • Analytic problem-solving & communication If you want to push the boundaries of RL, join us! Please RT to help spread the word! #ML #RL #AI #PhDPosition
English
0
1
2
170
Carlo D'Eramo retweetledi
Samuele Tosatto
Samuele Tosatto@tosatto_samuele·
📢 We are seeking a highly motivated student for a 3-year fully funded PhD position in Reinforcement Learning at the University of Innsbruck! 🇦🇹 Help us advance the theory & algorithms of off-policy RL. Details & Apply: samueletosatto.online/new-phd-positi…
English
1
7
20
1.6K
Carlo D'Eramo retweetledi
Ahmed Hendawy | أحمد هنداوى
🧵 Accepted at @iclr_conf ! Target networks stabilize bootstrapping in RL 🛡️ But induce slow-moving targets 🐢 Online networks adapt fast ⚡ But can diverge with function approximation 💥 𝗠𝗜𝗡𝗧𝗢🌿 uses the online network 𝗼𝗻𝗹𝘆 𝗶𝗳 𝗶𝘁 𝗰𝗮𝗻 — yielding faster and more stable RL. Here’s how 👇
English
1
6
36
2.2K
Carlo D'Eramo retweetledi
Ahmed Hendawy | أحمد هنداوى
🌿 MINTO has been accepted at #ICLR2026! 📌 MINTO is a simple, yet effective target bootstrapping method for off-policy RL that enables faster, more stable learning and consistently improves performance across algorithms and benchmarks. 📄 Preprint: arxiv.org/abs/2510.02590
Ahmed Hendawy | أحمد هنداوى tweet media
English
0
7
29
2.7K
Carlo D'Eramo retweetledi
Francesco Bertolotti
Francesco Bertolotti@f14bertolotti·
In this paper, the authors compute the gradient update of the policy of one agent by accounting also for the update of all other agents. I feel this is a fairly general idea that could be applied to most multi-agent RL algorithms. 🔗arxiv.org/abs/2509.12117
Francesco Bertolotti tweet mediaFrancesco Bertolotti tweet mediaFrancesco Bertolotti tweet mediaFrancesco Bertolotti tweet media
English
5
34
227
16.2K
Carlo D'Eramo retweetledi
Ahmed Hendawy | أحمد هنداوى
Attending @RL_Conference this year at @UAlberta, Canada 🇨🇦 ! I'm excited to be organizing a workshop on Inductive Biases in Reinforcement Learning @ibrlworkshop. Let's meet to discuss multi-task RL, Mixture of Experts, and other cool topics. Hope to see you there!
English
0
1
15
730
Carlo D'Eramo retweetledi
Marlos C. Machado
Marlos C. Machado@MarlosCMachado·
RLC starts tomorrow here in Edmonton. I couldn't be more excited! It has a fantastic roll of speakers, great papers, and workshops. And this time, it is in Edmonton 😁 @RL_Conference is my favourite conference, and no, it is not because I am one of its organizers this year.
Marlos C. Machado tweet media
English
3
12
82
11.6K
Carlo D'Eramo retweetledi
Théo Vincent
Théo Vincent@Theo_Vincent_·
To increase the reward propagation in value-based RL algorithms, it is tempting to reduce the target update period🤔 But, this makes the training unstable💔 @RL_Conference, I will present i-QN, a new method that allows faster reward propagation, while keeping stability⚡️ 👉🧵
Théo Vincent tweet media
English
1
12
92
8.7K
Carlo D'Eramo retweetledi
Théo Vincent
Théo Vincent@Theo_Vincent_·
🎤 Very excited to give a talk @Cohere_Labs next week 🎤 I will be presenting the research I have been working on for the last 2 years with @CarloDeramo, @Jan_R_Peters, and many more collaborators! x.com/Cohere_Labs/st…
Cohere Labs@Cohere_Labs

Join our Reinforcement Learning Group next week on Friday, July 18th as they welcome @Theo_Vincent_ for a session on "Optimizing the Learning Trajectory of Reinforcement Learning Agents." Thanks to @rahul_narava and @gustiwinata_ for organizing this event ✨

English
1
4
23
2K
Carlo D'Eramo retweetledi
Kristian Kersting
Kristian Kersting@kerstingAIML·
🚀 TU Darmstadt leads the new Cluster of Excellence "Reasonable AI" – advancing trustworthy, efficient & adaptive AI grounded in common sense. A big thank you to the entire team, the university, and the state of Hesse for their tremendous work & support! buff.ly/N7xLKfW
English
5
14
86
8.3K
Carlo D'Eramo retweetledi
Davide Tateo
Davide Tateo@davide_tateo·
I'm pleased to announce that, starting October 1st, I will be joining the Computer Science department at Lund University (in Sweden) as a Senior Lecturer. I will join the Robotic and Semantic Systems group and collaborate in the RobotLab LTH, where I'll bring my RL expertise.
English
12
2
53
4.8K
Carlo D'Eramo retweetledi
Davide Tateo
Davide Tateo@davide_tateo·
I'm very excited for this new adventure and very grateful to @Jan_R_Peters for his massive support during my time as postdoc/group leader at @ias_tudarmstadt Also, many thanks to my amazing friends @GeorgiaChal and @CarloDeramo for their help and support.
English
1
1
8
301
Carlo D'Eramo retweetledi
Théo Vincent
Théo Vincent@Theo_Vincent_·
How do you tune the hyperparameters of your RL agent? 🤔 Come and chat with me about it tomorrow afternoon at poster 397 of @iclr_conf ! I will be presenting⚡️ Adaptive Q-Network⚡️a method that adaptively selects the hyperparameters of your RL agent during training 🔥
GIF
English
2
12
55
4.6K