WhiRL

Matthew Jackson@JacksonMattT

8

557

WhiRL retweetledi

Matthew Jackson@JacksonMattT·23 Eyl

Unifloral has been accepted as an Oral at NeurIPS 2025! Immensely grateful to my @FLAIR_Ox co-authors @uljadb99 and @JarekLiesen for pouring months of effort into this project. There’s a ton of low-hanging fruit in offline RL… If you’re looking for a project, check it out!

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

English

3

20

179

33.2K

WhiRL retweetledi

Alex Goldie@AlexDGoldie·7 Ağu

🥳 It’s an honour to have been awarded the Outstanding Paper for Scientific Understanding in RL at RLC for our work, ‘How Should We Meta-Learn RL Algorithms?’ Thank you to the organisers @RL_Conference for putting on a great conference, and congratulations to the other winners!

English

3

21

224

22.1K

WhiRL@whi_rl·25 Tem

🔍 Discovering algorithms lets us surpass human ingenuity… But what tools should we be using for discovery? LLMs? Evolution? Our new paper explores this question, check it out! 👇

Alex Goldie@AlexDGoldie

1/ 🕵️ Algorithm discovery could lead to huge AI breakthroughs! But what is the best way to learn or discover new algorithms? I'm so excited to share our brand new @rl_conference paper which takes a step towards answering this! 🧵

English

Matthew Jackson@JacksonMattT

1

6

1.1K

WhiRL@whi_rl·21 Nis

The best of RL research, brought to Offline RL! 🚀 TL;DR 1. CleanRL-style implementations ⚡️ 2. Rainbow-style algorithm unification 🦾 3. Rliable-style evaluation protocol 🔬 Check out our paper + library!

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

English

1

16

1.4K

WhiRL retweetledi

Matthew Jackson@JacksonMattT·18 Nis

🌹 Today we're releasing Unifloral, our new library for Offline Reinforcement Learning! We make research easy: ⚛️ Single-file 🤏 Minimal ⚡️ End-to-end Jax Best of all, we unify prior methods into one algorithm - a single hyperparameter space for research! ⤵️

English

37

172

94.3K

WhiRL retweetledi

Shimon Whiteson@shimon8282·18 Nis

Our survey on meta reinforcement learning has now been published by Foundations and Trends in Machine Learning: nowpublishers.com/article/Detail…

English

2

42

4.3K

WhiRL retweetledi

Luisa Zintgraf@luisa_zintgraf·9 Nis

🎉 Our Meta-RL survey is now published in Foundations and Trends in Machine Learning! A deep dive into how agents can learn to learn 🤖🧠 Huge kudos to @jakeABeck & @ristovuorio for leading the charge, and to co-authors Evan Liu, Zheng Xiong, @chelseabfinn & @shimon8282!

English

10

57

5.2K

WhiRL retweetledi

Jacob Beck@jakeABeck·5 Nis

Big news—our survey paper “A Tutorial on Meta-Reinforcement Learning” is officially published! Meta-RL = learning how to adapt through interaction. It embraces The Bitter Lesson: don’t hardcode agents—train them to adapt on their own arxiv.org/abs/2301.08028 🧵⬇️

English

77

337

21.1K

WhiRL retweetledi

Shangtong Zhang@ShangtongZhang·17 Şub

Excited to share our new survey of in-context reinforcement learning!! arxiv.org/abs/2502.07978 w/ @AmirMoeini99 @wangjiuqi @jakeABeck @EthanBlaser @shimon8282 @rohanchandra30

English

49

223

14.8K

WhiRL retweetledi

Shimon Whiteson@shimon8282·18 Ara

A new version of the paper Counterfactual Multi-Agent Policy Gradients, that I first published in 2017 with @j_foerst, @greg_far and others, is now available on arXiv (arxiv.org/abs/1705.08926).

English

1

8

44

9.5K

WhiRL retweetledi

Jacob Beck@jakeABeck·14 Ara

🎉🚨 Big news! Our research, Metalic: Meta-Learning In-Context with Protein Language Models, 🧬 won a competition! #NeurIPS2024🤖📚 We advance in-context learning and protein fitness prediction with this paradigm: ✨ Pre-training 🔥 Learning to in-context learn🔥 ✨ Fine-tuning

English

4

9

1K

WhiRL@whi_rl·30 Tem

Check out this fantastic showing by @jakeABeck and @AlexDGoldie at #ICML2024! 🎙️🔥 They dove deep into the future of automated RL, meta-learning, and LLMs 🤖🔮

Jacob Beck@jakeABeck

Missed this provocative panel? I was honored to share the stage at #ICML2024 with @pcastr, @XingyouSong, and my colleague @AlexDGoldie! We discussed future perspectives on automated RL, meta-learning, and LLMs 🤖 Catch the discussion here: icml.cc/virtual/2024/w… at 7:16:00 🎙️

English

1

11

1K

WhiRL retweetledi

Alex Goldie@AlexDGoldie·27 Tem

Attending #ICML2024 was amazing and full of firsts: My first time presenting a poster, first time giving a talk at a conference and first time sitting on a panel! Many thanks to the @AutoRL_Workshop organisers for preparing a great workshop about AutoRL!

English

6

55

5.4K

WhiRL retweetledi

Zheng Xiong@xiongzheng0316·23 Tem

How to make a generalist robot more efficient? We propose knowledge decoupling as a key principle, and learn a universal morphology controller with 10x smaller size and 100x less FLOPs at inference time. Come to our #ICML2024 poster #217 at 11:30 on July 25 to chat more!

English

Matthew Jackson@JacksonMattT

2

5

469

WhiRL retweetledi

Alex Goldie@AlexDGoldie·16 Tem

1/ 🤖 Learned optimization offers huge potential to automate machine learning! So why doesn't it work well in RL (and how did we fix it)?! I'm excited to share OPEN, our @AutoRL_Workshop spotlight paper exploring this question! 🧵

English

1

27

115

25.1K

WhiRL retweetledi

Matthew Jackson@JacksonMattT·26 Haz

Exciting updates to Policy-Guided Diffusion! 🎉 PGD was accepted at @RL_Conference - see you in Amherst! 📈 For those building on PGD, we just released WandB logs with agent and diffusion model training: api.wandb.ai/links/flair/jo…

🎮 Introducing the new and improved Policy-Guided Diffusion! Vastly more accurate trajectory generation than autoregressive models, with strong gains in offline RL performance! Plus a ton of new theory and results since our NeurIPS workshop paper... Check it out ⤵️

English

26

122

12.2K

WhiRL@whi_rl·7 Haz

Excited to share that our work Bayesian Exploration Networks (BEN) has been accepted at ICML 🍾! BEN is the first model-free Bayesian RL approach that can learn Bayes-optimal policies 🙀 Congrats to @mattiefoxcs and collaborators! arxiv.org/pdf/2308.13049

English