Dylan R. Ashley

25 posts

Dylan R. Ashley

@oneDylanAshley

PhD student studying reinforcement learning with @SchmidhuberAI. MSc with Rich Sutton of @rlai_lab. Sometimes amateur photographer. Opinions are my own.

Lugano, Switzerland Katılım Ağustos 2018

213 Takip Edilen202 Takipçiler

Dylan R. Ashley@oneDylanAshley·25 Nis

I enjoyed being part of this #RL #robotics work with @YanningD_AI, @YuhuiWangAI, and @SchmidhuberAI. Interestingly, treating morphology-control co-design as a Stackelberg game cleanly captures the bi-level coupling that many existing methods sidestep.

Jürgen Schmidhuber@SchmidhuberAI

Using only box-forwarding speed as the reward, our Stackelberg PPO automatically evolves robots with arms for pushing and legs for moving. The key idea is a novel game-theoretic view of structure–control co-design, yielding more effective optimization and dramatically better designs. Come see our poster at ICLR 2026 on Apr 25, 10:30 AM, at P4-#4810. With @YuhuiWangAI, @YanningD_AI, @oneDylanAshley. Paper: arxiv.org/abs/2603.15388 Project Page: yanningdai.github.io/stackelberg-pp…

English

144

Dylan R. Ashley@oneDylanAshley·6 Mar

So proud of my MSc supervisor for winning the #TuringAward (the #NobelPrize in CS). If there’s one thing that’s always set @RichardSSutton a peg above the average researcher I’ve met, it’s undoubtedly the absurd degree of his dedication to advancing human knowledge.

English

191

Dylan R. Ashley@oneDylanAshley·23 Şub

Definitely the longest paper I've been a part of so far. Some pretty intense theory that gives one of the first peeks into the depths of these algorithms.

Jürgen Schmidhuber@SchmidhuberAI

Do you like RL and math? Our collaboration, IDSIA-KAUST-NNAISENSE, has the most detailed exploration of the convergence and stability of modern RL frameworks like Upside-Down RL, Online Decision Transformers, and Goal-Conditioned Supervised Learning arxiv.org/abs/2502.05672

English

190

Dylan R. Ashley@oneDylanAshley·2 Ara

Can confirm that it’s pretty swish. There’s some intense people that win this.

Francesco Faccio@FaccioAI

Are you a rising star in AI? 🌟 Join us as a speaker for the 4th edition of the KAUST Rising Stars in AI Symposium. In the past 2 years co-organizing this event, I've met incredible researchers now in top industrial and academic positions worldwide. More info: 📅 Event date: April 7-10, 2025 ⏳ Application deadline: December 18 🔗 Apply here: kaust.edu.sa/en/news/rising…

English

217

Dylan R. Ashley retweetledi

Scholarship for PhD@ScholarshipfPhd·25 Kas

ZXX

161

121.5K

Dylan R. Ashley@oneDylanAshley·14 Kas

We’ll shortly be presenting at ISMIR a small follow-up to our narrative essence work, where we looked at how well a transformer can accomplish automatic album sequencing. The answer is not as well as a narrative essence approach. You can read more here: arxiv.org/abs/2411.07772

English

189

Dylan R. Ashley@oneDylanAshley·14 Kas

Check out @idivinci’s post for even more details: x.com/idivinci/statu… (9/n)

Vincent Herrmann@idivinci

Excited to share new work done with @oneDylanAshley, Zachary Friggstad and @SchmidhuberAI on stories, music, movies and information theory, published in IEEE TPAMI📚🎶 🎬🤖 Below a quick summary and a web tool you can use to sort your playlists:) 1/n doi.org/10.1109/TPAMI.…

English

465

Dylan R. Ashley@oneDylanAshley·14 Kas

We also have a neat web app demo based on these ideas that gives you good orderings for your music 🎵 playlists: story-distiller.streamlit.app (8/n)

English

408

Dylan R. Ashley@oneDylanAshley·14 Kas

Have you ever wondered what happens when you reduce a story to a low-dimensional latent representation? Well my new IEEE TPAMI work with @idivinci, Zachary Friggstad, and @SchmidhuberAI shows that it has some pretty cool applications 🎉 Check it out: doi.org/10.1109/TPAMI.… (1/n)

English

21.7K

Dylan R. Ashley@oneDylanAshley·30 Eki

Our poster session starts in a few minutes. Come by and check out our work!

Dylan R. Ashley@oneDylanAshley

Some very cool #AI #DeepLearning #RL work to have been involved with. Interestingly, there's no clear bound on the depth of the network or general scalability here, so a lot of potential.

English

197

Dylan R. Ashley@oneDylanAshley·25 Eki

Some quite groundbreaking #LLM #deeplearning #AI work to have been a part of. All good frontier benchmarks should include an a non-trivial automatic evaluator.

Mingchen Zhuge@MingchenZhuge

🔔 new 𝗔𝗴𝗲𝗻𝘁-𝗮𝘀-𝗮-𝗝𝘂𝗱𝗴𝗲 paper: 𝗖𝗮𝗻 𝗔𝗜 𝗮𝗴𝗲𝗻𝘁𝘀 𝗲𝘃𝗮𝗹𝘂𝗮𝘁𝗲 𝗔𝗜 𝗮𝗴𝗲𝗻𝘁𝘀 𝗮𝘀 𝗲𝗳𝗳𝗲𝗰𝘁𝗶𝘃𝗲𝗹𝘆 𝗮𝘀 𝗵𝘂𝗺𝗮𝗻𝘀? 𝗬𝗲𝘀, 𝘁𝗵𝗲𝘆 𝗰𝗮𝗻! 📄 arxiv.org/abs/2410.10934… 👨‍💻 github.com/metauto-ai/age… Introducing 𝗔𝗴𝗲𝗻𝘁-𝗮𝘀-𝗮-𝗝𝘂𝗱𝗴𝗲, a groundbreaking proof-of-concept that reduces costs and time by 97%, while providing rich, intermediate feedback. It precisely captures the natural step-by-step processes of agentic systems. We also developed 𝗗𝗲𝘃𝗔𝗜, a new benchmark featuring 55 automated AI development tasks and 365 requirements. Agent-as-a-Judge not only outperforms LLM-as-a-Judge but also closely mirrors human evaluations with greater efficiency and precision. The real game-changer? It provides reliable reward signals, paving the way for scalable, self-improving agentic systems. Thanks my Meta/KAUST mentors/peers/collaborators @SchmidhuberAI @tydsh @zechunliu @vikasc @YoungXiong1 @vikasc @Obs01ete @erniecyc @oneDylanAshley ...

English

220

Dylan R. Ashley retweetledi

Kishan@jst_kishan·12 Eyl

Why pay for Claude, when I can get my code written by amazon.

English

202

1.1K

21.4K

1.1M

Dylan R. Ashley@oneDylanAshley·6 Ağu

Some very cool #AI #DeepLearning #RL work to have been involved with. Interestingly, there's no clear bound on the depth of the network or general scalability here, so a lot of potential.

Francesco Faccio@FaccioAI

Can neural networks with 5000 layers improve long-term planning? 🤖 Check out our latest research with @SchmidhuberAI, @oneDylanAshley, and team: arxiv.org/abs/2406.08404 #AI #DeepLearning #RL

English

352

Dylan R. Ashley@oneDylanAshley·12 Mar

The old city of #jeddah: #albalad instagram.com/p/CpsWgLfDXdk/…

English

183

Dylan R. Ashley retweetledi

Kai Arulkumaran@kaixhin·25 Şub

Finally had some time to write down some thoughts. All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RL w/ @rupspace @oneDylanAshley @SchmidhuberAI arxiv.org/abs/2202.11960

English

171

Dylan R. Ashley retweetledi

Kai Arulkumaran@kaixhin·1 May

Decidated to everyone else who's had to live through the horror 💀

English

157

Keşfet

@YanningD_AI @YuhuiWangAI @SchmidhuberAI @RichardSSutton @idivinci @rupspace @elonmusk @BarackObama