Michael Beukman

61 posts

Michael Beukman

@mcbeukman

PhD Student at @FLAIR_Ox, previously at @raillabwits. Ex SR @GoogleDeepMind Interested in Open ended learning, scaling RL and continual learning.

Oxford, England Katılım Temmuz 2022

190 Takip Edilen534 Takipçiler

Sabitlenmiş Tweet

Michael Beukman@mcbeukman·2d

1/ As compute continues to grow and simulators continue to improve, it is becoming feasible to train RL agents for billions or trillions of timesteps. However, this is only useful if agents can continue learning over such long training horizons, which is far from given 👇

English

325

85.2K

Michael Beukman retweetledi

Alex Goldie@AlexDGoldie·25 Mar

1/ 🪩 Automating the discovery of new algorithms could unlock significant breakthroughs in ML research. But optimising agents for this research has been limited by too few tasks to learn from! Introducing DiscoGen, a procedural generator of algorithm discovery tasks 🧵

English

146

35.6K

Michael Beukman@mcbeukman·2d

11/ Thanks to my great co-authors @khimya, Zeyu Zheng, @wwdabney, @j_foerst, @MichaelD1729 and @clarelyle! Check out the paper here for more details: arxiv.org/abs/2603.06009.

English

Michael Beukman@mcbeukman·2d

10/ Armed with these insights, we turn to the Kinetix benchmark—an open-ended universe of 2D physics based tasks. We scale to 1 million parallel environments, and show that this leads to monotonic performance improvement for more than 1 trillion timesteps without stagnating 🚀

English

3.1K

Michael Beukman@mcbeukman·2d

English

325

85.2K

Michael Beukman retweetledi

nathan monette@nathanrmonette·26 Eyl

I've compiled some notes on Unsupervised Environment Design (UED): nmonette.github.io/assets/ued_not… Please don't hesitate to reach out if interested in talking more about UED :)

English

2.1K

Michael Beukman retweetledi

nathan monette@nathanrmonette·10 Ağu

Was amazing to present my first paper at @RL_Conference !! Really awesome to meet new folks from the community :)

English

Michael Beukman retweetledi

Clarisse Wibault@ClarisseWibault·1 Haz

How can we bypass the need for online hyper-parameter tuning in offline RL? @FLAIR_Ox is introducing two fully offline algorithms: SOReL, for accurate offline regret approximation, and TOReL, for offline hyper-parameter tuning! arxiv.org/html/2505.2244…

English

4.6K

Michael Beukman retweetledi

nathan monette@nathanrmonette·28 May

Excited to announce my first paper, with @j_foerst and @FLAIR_Ox, was accepted into @rl_conference 2025! We establish a new UED method called NCC that obtains strong performance based on principles of optimisation theory.

English

12.6K

Michael Beukman retweetledi

Michael Matthews@mitrma·26 Nis

We are presenting Kinetix today! Oral - 11:30am Peridot Room 5F Poster - 3pm Hall 3+2B 377

Michael Matthews@mitrma

We are very excited to announce Kinetix: an open-ended universe of physics-based tasks for RL! We use Kinetix to train a general agent on millions of randomly generated physics problems and show that this agent generalises to unseen handmade environments. 1/🧵

English

1.6K

Michael Beukman@mcbeukman·24 Nis

Come and see the sessions or reach out to chat :)

Foerster Lab for AI Research@FLAIR_Ox

FLAIR is at ICLR 🇸🇬 Find out our schedule for the week 👇

English

464

Michael Beukman retweetledi

Matthew Jackson@JacksonMattT·18 Nis

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

English

172

94.5K

Michael Beukman@mcbeukman·17 Nis

I'll be attending ICLR next week to present Kinetix with @mitrma. Would love to chat about anything UED / Open-Ended RL / QD related, or interesting research in general :)

Michael Matthews@mitrma

English

1.5K

Keşfet

@khimya @wwdabney @j_foerst @MichaelD1729 @clarelyle @RL_Conference @FLAIR_Ox @mitrma