Dylan R. Ashley

25 posts

Dylan R. Ashley banner
Dylan R. Ashley

Dylan R. Ashley

@oneDylanAshley

PhD student studying reinforcement learning with @SchmidhuberAI. MSc with Rich Sutton of @rlai_lab. Sometimes amateur photographer. Opinions are my own.

Lugano, Switzerland Katılım Ağustos 2018
213 Takip Edilen202 Takipçiler
Dylan R. Ashley
Dylan R. Ashley@oneDylanAshley·
So proud of my MSc supervisor for winning the #TuringAward (the #NobelPrize in CS). If there’s one thing that’s always set @RichardSSutton a peg above the average researcher I’ve met, it’s undoubtedly the absurd degree of his dedication to advancing human knowledge.
Dylan R. Ashley tweet media
English
1
0
7
191
Dylan R. Ashley
Dylan R. Ashley@oneDylanAshley·
We’ll shortly be presenting at ISMIR a small follow-up to our narrative essence work, where we looked at how well a transformer can accomplish automatic album sequencing. The answer is not as well as a narrative essence approach. You can read more here: arxiv.org/abs/2411.07772
English
0
0
0
189
Dylan R. Ashley
Dylan R. Ashley@oneDylanAshley·
Have you ever wondered what happens when you reduce a story to a low-dimensional latent representation? Well my new IEEE TPAMI work with @idivinci, Zachary Friggstad, and @SchmidhuberAI shows that it has some pretty cool applications 🎉 Check it out: doi.org/10.1109/TPAMI.… (1/n)
English
1
13
43
21.7K
Dylan R. Ashley
Dylan R. Ashley@oneDylanAshley·
Some quite groundbreaking #LLM #deeplearning #AI work to have been a part of. All good frontier benchmarks should include an a non-trivial automatic evaluator.
Mingchen Zhuge@MingchenZhuge

🔔 new 𝗔𝗴𝗲𝗻𝘁-𝗮𝘀-𝗮-𝗝𝘂𝗱𝗴𝗲 paper: 𝗖𝗮𝗻 𝗔𝗜 𝗮𝗴𝗲𝗻𝘁𝘀 𝗲𝘃𝗮𝗹𝘂𝗮𝘁𝗲 𝗔𝗜 𝗮𝗴𝗲𝗻𝘁𝘀 𝗮𝘀 𝗲𝗳𝗳𝗲𝗰𝘁𝗶𝘃𝗲𝗹𝘆 𝗮𝘀 𝗵𝘂𝗺𝗮𝗻𝘀? 𝗬𝗲𝘀, 𝘁𝗵𝗲𝘆 𝗰𝗮𝗻! 📄 arxiv.org/abs/2410.10934… 👨‍💻 github.com/metauto-ai/age… Introducing 𝗔𝗴𝗲𝗻𝘁-𝗮𝘀-𝗮-𝗝𝘂𝗱𝗴𝗲, a groundbreaking proof-of-concept that reduces costs and time by 97%, while providing rich, intermediate feedback. It precisely captures the natural step-by-step processes of agentic systems. We also developed 𝗗𝗲𝘃𝗔𝗜, a new benchmark featuring 55 automated AI development tasks and 365 requirements. Agent-as-a-Judge not only outperforms LLM-as-a-Judge but also closely mirrors human evaluations with greater efficiency and precision. The real game-changer? It provides reliable reward signals, paving the way for scalable, self-improving agentic systems. Thanks my Meta/KAUST mentors/peers/collaborators @SchmidhuberAI @tydsh @zechunliu @vikasc @YoungXiong1 @vikasc @Obs01ete @erniecyc @oneDylanAshley ...

English
0
0
2
220
Dylan R. Ashley retweetledi
Kishan
Kishan@jst_kishan·
Why pay for Claude, when I can get my code written by amazon.
Kishan tweet media
English
202
1.1K
21.4K
1.1M
Dylan R. Ashley retweetledi
Kai Arulkumaran
Kai Arulkumaran@kaixhin·
Decidated to everyone else who's had to live through the horror 💀
Kai Arulkumaran tweet media
English
5
18
157
0