Daniel Furelos-Blanco

19 posts

Daniel Furelos-Blanco

Daniel Furelos-Blanco

@_danielfb

@imperialcollege

Beigetreten Ocak 2016
334 Folgt87 Follower
Angehefteter Tweet
Daniel Furelos-Blanco
Daniel Furelos-Blanco@_danielfb·
🚀 Last December we presented ATLAS at NeurIPS @SEAWorkshop. Now thrilled to share at #AAAI2026 @RealAAAI! ⚠️ RL policy generalization across tasks & levels is hard, especially when some tasks aren't realizable. 🗺️ ATLAS tackles this via autocurricula over tasks AND levels 🧵
GIF
English
1
2
17
1.5K
Daniel Furelos-Blanco
Daniel Furelos-Blanco@_danielfb·
🗺️ ATLAS creates autocurricula over BOTH tasks (reward machines) AND levels (Minigrid), enabling it to 🔹learn good policies even when solvable tasks are rarely sampled, 🔹perform mutations over tasks and levels jointly to bootstrap complex curricula from simple pairs.
Daniel Furelos-Blanco tweet media
English
1
0
1
98
Daniel Furelos-Blanco
Daniel Furelos-Blanco@_danielfb·
🚀 Last December we presented ATLAS at NeurIPS @SEAWorkshop. Now thrilled to share at #AAAI2026 @RealAAAI! ⚠️ RL policy generalization across tasks & levels is hard, especially when some tasks aren't realizable. 🗺️ ATLAS tackles this via autocurricula over tasks AND levels 🧵
GIF
English
1
2
17
1.5K
Daniel Furelos-Blanco retweetet
Michael Ivanitskiy
Michael Ivanitskiy@MishaIvanitskiy·
Wouldn't it be nice if there was a way to organize attention heads in LLMs by the patterns they produce? Poster @ NeurIPS mechint workshop, more in 🧵
English
1
4
14
570
Daniel Furelos-Blanco
Daniel Furelos-Blanco@_danielfb·
ATLAS creates autocurricula over BOTH tasks (reward machines) AND levels (Minigrid), enabling it to 🔹learn good policies even when solvable tasks are rarely sampled, 🔹perform mutations over levels and tasks jointly to bootstrap complex curricula from simple task-level pairs.
Daniel Furelos-Blanco tweet media
English
1
0
2
98
Daniel Furelos-Blanco
Daniel Furelos-Blanco@_danielfb·
We build upon existing work: 🔹UED → Generate auto-curricula, but only over levels. Task is fixed. 🔹Formal language conditioning (temporal logic, state machines) → generalize over tasks & levels, but just sample randomly - no autocurriculum. We propose ATLAS to extend these.
Daniel Furelos-Blanco tweet media
English
1
0
2
118
Daniel Furelos-Blanco
Daniel Furelos-Blanco@_danielfb·
🚀 Thrilled to be presenting ATLAS at the @SEAWorkshop at #NeurIPS2025 this Sunday! Training RL agents to generalize across diverse levels and tasks is hard, especially when some tasks are not always realizable. ATLAS tackles this by creating curricula over tasks AND levels 🧵
GIF
English
1
2
14
2K
Daniel Furelos-Blanco
Daniel Furelos-Blanco@_danielfb·
It was a pleasure to have @dabelcs present this work last October at @SPIKE_ICL! 📽️ The recording of the talk is now available: youtube.com/watch?v=XWxP_q… Don't miss their poster at #NeurIPS2025 🚀
YouTube video
YouTube
David Abel@dabelcs

Thrilled to share our new #NeurIPS2025 paper done at @GoogleDeepMind, Plasticity as the Mirror of Empowerment We prove every agent faces a trade-off between its capacity to adapt (plasticity) and its capacity to steer (empowerment) Paper: david-abel.github.io/plasticity.pdf 🧵🧵🧵👇

English
0
1
6
870
Daniel Furelos-Blanco retweetet
Clem Bonnet
Clem Bonnet@ClementBonnet16·
Excited to announce Jumanji v1.0, now featuring 22 fast, flexible, and scalable environments! Fully written in JAX, Jumanji offers on-device fully-jitted simulations and training. Jumanji got published at ICLR 2024! Paper: arxiv.org/abs/2306.09884 GitHub: github.com/instadeepai/ju…
GIF
English
1
28
126
31.2K
Daniel Furelos-Blanco retweetet
Nathan Grinsztajn @ Neurips
Nathan Grinsztajn @ Neurips@NGrinsztajn·
Excited to announce 2 papers on performance-driven diversity for RL and combinatorial optimization to be presented at Neurips today 📢 Some problems are simply too difficult to be solved with a single shot. But if you have several shots, how to make the most of them? 🧵1/7
GIF
English
1
11
43
4.4K
yobibyte
yobibyte@y0b1byte·
From the inside of the headphones, I definitely sounded much better xD Guess the song
English
2
0
9
2.6K
Daniel Furelos-Blanco
Daniel Furelos-Blanco@_danielfb·
Aloha! 🌴 This week I'm at #ICML2023 presenting "Hierarchies of Reward Machines". We increase the abstraction power of reward machines (RMs) by enabling them to call each other, composing a hierarchy of RMs (HRM). We describe methods for both exploiting and learning HRMs.
Daniel Furelos-Blanco tweet media
English
1
4
23
3K
Daniel Furelos-Blanco retweetet
InstaDeep
InstaDeep@instadeepai·
1/ We are delighted to share our recent work, Poppy 🌺 “Population-Based Reinforcement Learning for Combinatorial Optimization”, which achieves SOTA RL performance on canonical NP-hard combinatorial problems 🚀. 📚 Paper: arxiv.org/abs/2210.03475 📖 Blog: bit.ly/3stDenZ
InstaDeep tweet media
English
2
13
50
0
Daniel Furelos-Blanco retweetet
J. AI Research-JAIR
J. AI Research-JAIR@JAIR_Editor·
New Article: "Induction and Exploitation of Subgoal Automata for Reinforcement Learning" by Furelos-Blanco, Law, Jonsson, Broda and Russo bit.ly/30wuyz1
English
0
3
5
0