Raul Steleac

20 posts

Raul Steleac

@steleac

PhD Student @EdinburghUni; studying temporally extended behaviours in both single and multi-agent RL

Beigetreten Ağustos 2019

94 Folgt22 Follower

Angehefteter Tweet

Raul Steleac@steleac·20 Nis

Really excited to present our recent work at #ICLR2026 this week! We discover highly coordinated joint behaviours and integrate them into the skill sets of MARL agents, accelerating the search for effective joint strategies in downstream tasks.🧵 Paper: raulsteleac.github.io/iaro

English

1.5K

Raul Steleac@steleac·20 Nis

Work done under the supervision of Mohan Sridharan and @dabelcs (many thanks)! If you’re at #ICLR2026 in Rio and want to chat, come find me during Poster Session 2! 🔥

English

Raul Steleac@steleac·20 Nis

Finally, we use this multi-dimensional n-distance as a state representation for eigenoption discovery, leading to coordinated alignment patterns that are effective in aiding teams of agents in multiple downstream tasks. Also works with heterogeneous agent state spaces.

English

Raul Steleac@steleac·20 Nis

English

1.5K

Raul Steleac retweetet

Frank Dellaert@fdellaert·22 Nis

Nice! (could not resist)

Songyou Peng@songyoupeng

🚨 Are neural implicit representations applicable for larger-scale SLAM? Check out our NICE-SLAM👍! #CVPR2022 NICE website: pengsongyou.github.io/nice-slam NICE code: github.com/cvg/nice-slam NICE collaborations w/ Zihan Zhu (undergrad) @visionviktor @Martin_R_Oswald @mapo1 et al. 1/6

Atlanta, GA 🇺🇸 English

Raul Steleac retweetet

yobibyte@y0b1byte·20 Nis

A lot of people complain that RL doesn't work and RL researchers are still playing games. While this criticism is true to some extent, there's been a new trend of applying RL for real-life problems. This is a thread of notable papers split by the topic. 1/n

GIF

English

140

746

Raul Steleac retweetet

Scott Bryan@scottygb·14 Haz

23 and 24 year olds able to book their NHS vaccine appointments from tomorrow.

English

241

Raul Steleac retweetet

Google DeepMind@GoogleDeepMind·26 May

Discover how WaveNet has evolved from research concept to advanced real-world system that creates more natural-sounding speech and helps @Google unblock communication barriers for millions of people around the world: dpmd.ai/wavenet

English

208

Raul Steleac retweetet

Bwipo@Bwipo·14 May

Chongus

Filipino

2.9K

Raul Steleac retweetet

Aran Komatsuzaki@arankomatsuzaki·12 May

Diffusion Models Beat GANs on Image Synthesis Achieves 3.85 FID on ImageNet 512×512 and matches BigGAN-deep even with as few as 25 forward passes per sample, all while maintaining better coverage of the distribution. arxiv.org/abs/2105.05233

English

106

581

Raul Steleac retweetet

Google DeepMind@GoogleDeepMind·3 May

Today @iclr_conf - Women in Machine Learning (@WIML) at 2PM - Philosophy and AGI at 5PM with @dabelcs, @clarelyle and @jakeABeck (@UniOfOxford) There are also various poster sessions happening today from 5PM - see the full schedule here: dpmd.ai/ICLR21 #ICLR2021

English

120

Raul Steleac retweetet

Karol Hausman@hausman_k·20 Nis

In addition to MT-Opt, we are releasing Actionable Models, which addresses the problem of defining tasks (which becomes quite cumbersome at scale). This work uses the dataset collected by MT-Opt but uses goal-conditioned offline Q-learning to learn a general goal-reaching policy.

GIF

Yevgen Chebotar@YevgenChebotar

Excited to present our new work on Actionable Models, an approach for learning functional understanding of the world via goal-conditioned Q-functions in a fully-offline setting! paper: arxiv.org/abs/2104.07749 website: actionable-models.github.io youtube.com/watch?v=S3SCR7…

English

Raul Steleac retweetet

Google DeepMind@GoogleDeepMind·23 Mar

Most RL agents assume that rewards are caused by recent actions, and learn slowly when this isn't true. This new method speeds up learning in tasks with delayed reward by learning to link related events - regardless of how much time separates them. dpmd.ai/12425

English

142

679

Raul Steleac retweetet

Demis Hassabis@demishassabis·30 Kas

Thrilled to announce our first major breakthrough in applying AI to a grand challenge in science. #AlphaFold has been validated as a solution to the ‘protein folding problem’ & we hope it will have a big impact on disease understanding and drug discovery: deepmind.com/alphafold-blog

English

149

1.8K

7.6K

Raul Steleac retweetet

Paula Gherghinescu@paulag_astro·29 Kas

The break conundrum

dinosaur@dinosaurcouch

English

Raul Steleac retweetet

sanny@sannykimchi·14 Ağu

Everyone has heard about fast.ai or CS231n (for a good reason), but did you know you can access Stanford’s CS224w ML with Graphs or download the book Elements of Causal Inference for free? Thread on underappreciated ML resources 📚🎥 that deserve more love 👇 /1

English

905

3.6K

Raul Steleac@steleac·16 Ağu

❤️

ART

Entdecken

@dabelcs @Google @iclr_conf @WIML @clarelyle @jakeABeck @UniOfOxford @elonmusk