Raul Steleac

20 posts

Raul Steleac banner
Raul Steleac

Raul Steleac

@steleac

PhD Student @EdinburghUni; studying temporally extended behaviours in both single and multi-agent RL

Beigetreten Ağustos 2019
94 Folgt22 Follower
Angehefteter Tweet
Raul Steleac
Raul Steleac@steleac·
Really excited to present our recent work at #ICLR2026 this week! We discover highly coordinated joint behaviours and integrate them into the skill sets of MARL agents, accelerating the search for effective joint strategies in downstream tasks.🧵 Paper: raulsteleac.github.io/iaro
Raul Steleac tweet media
English
1
4
16
1.5K
Raul Steleac
Raul Steleac@steleac·
Work done under the supervision of Mohan Sridharan and @dabelcs (many thanks)! If you’re at #ICLR2026 in Rio and want to chat, come find me during Poster Session 2! 🔥
English
0
1
2
99
Raul Steleac
Raul Steleac@steleac·
Finally, we use this multi-dimensional n-distance as a state representation for eigenoption discovery, leading to coordinated alignment patterns that are effective in aiding teams of agents in multiple downstream tasks. Also works with heterogeneous agent state spaces.
Raul Steleac tweet media
English
1
1
2
70
Raul Steleac
Raul Steleac@steleac·
Really excited to present our recent work at #ICLR2026 this week! We discover highly coordinated joint behaviours and integrate them into the skill sets of MARL agents, accelerating the search for effective joint strategies in downstream tasks.🧵 Paper: raulsteleac.github.io/iaro
Raul Steleac tweet media
English
1
4
16
1.5K
Raul Steleac retweetet
yobibyte
yobibyte@y0b1byte·
A lot of people complain that RL doesn't work and RL researchers are still playing games. While this criticism is true to some extent, there's been a new trend of applying RL for real-life problems. This is a thread of notable papers split by the topic. 1/n
GIF
English
24
140
746
0
Raul Steleac retweetet
Scott Bryan
Scott Bryan@scottygb·
23 and 24 year olds able to book their NHS vaccine appointments from tomorrow.
English
12
54
241
0
Raul Steleac retweetet
Google DeepMind
Google DeepMind@GoogleDeepMind·
Discover how WaveNet has evolved from research concept to advanced real-world system that creates more natural-sounding speech and helps @Google unblock communication barriers for millions of people around the world: dpmd.ai/wavenet
Google DeepMind tweet media
English
4
62
208
0
Raul Steleac retweetet
Bwipo
Bwipo@Bwipo·
Chongus
Bwipo tweet media
Filipino
17
24
2.9K
0
Raul Steleac retweetet
Aran Komatsuzaki
Aran Komatsuzaki@arankomatsuzaki·
Diffusion Models Beat GANs on Image Synthesis Achieves 3.85 FID on ImageNet 512×512 and matches BigGAN-deep even with as few as 25 forward passes per sample, all while maintaining better coverage of the distribution. arxiv.org/abs/2105.05233
Aran Komatsuzaki tweet media
English
8
106
581
0
Raul Steleac retweetet
Karol Hausman
Karol Hausman@hausman_k·
In addition to MT-Opt, we are releasing Actionable Models, which addresses the problem of defining tasks (which becomes quite cumbersome at scale). This work uses the dataset collected by MT-Opt but uses goal-conditioned offline Q-learning to learn a general goal-reaching policy.
GIF
Yevgen Chebotar@YevgenChebotar

Excited to present our new work on Actionable Models, an approach for learning functional understanding of the world via goal-conditioned Q-functions in a fully-offline setting! paper: arxiv.org/abs/2104.07749 website: actionable-models.github.io youtube.com/watch?v=S3SCR7…

English
1
14
30
0
Raul Steleac retweetet
Google DeepMind
Google DeepMind@GoogleDeepMind·
Most RL agents assume that rewards are caused by recent actions, and learn slowly when this isn't true. This new method speeds up learning in tasks with delayed reward by learning to link related events - regardless of how much time separates them. dpmd.ai/12425
Google DeepMind tweet media
English
10
142
679
0
Raul Steleac retweetet
Demis Hassabis
Demis Hassabis@demishassabis·
Thrilled to announce our first major breakthrough in applying AI to a grand challenge in science. #AlphaFold has been validated as a solution to the ‘protein folding problem’ & we hope it will have a big impact on disease understanding and drug discovery: deepmind.com/alphafold-blog
English
149
1.8K
7.6K
0
Raul Steleac retweetet
sanny
sanny@sannykimchi·
Everyone has heard about fast.ai or CS231n (for a good reason), but did you know you can access Stanford’s CS224w ML with Graphs or download the book Elements of Causal Inference for free? Thread on underappreciated ML resources 📚🎥 that deserve more love 👇 /1
English
29
905
3.6K
0