
👽 PsiPhi-learning 👽 (long talk #ICML) sites.google.com/view/psiphi-le… shows how an agent can use data from the behavior of other agents with diverse goals: to infer their intentions and fulfill its own! 🧵
Greg Farquhar
24 posts


👽 PsiPhi-learning 👽 (long talk #ICML) sites.google.com/view/psiphi-le… shows how an agent can use data from the behavior of other agents with diverse goals: to infer their intentions and fulfill its own! 🧵

Excited to announce that our work on “Discovering state-of-the-art RL algorithms” is finally published in @Nature! In this work, we meta-learned RL algorithms at scale. Paper: nature.com/articles/s4158… Blog: google-deepmind.github.io/disco_rl/ See thread 👇



👽 PsiPhi-learning 👽 (long talk #ICML) sites.google.com/view/psiphi-le… shows how an agent can use data from the behavior of other agents with diverse goals: to infer their intentions and fulfill its own! 🧵




Really excited about our new work: In deep RL, we typically collect new data using a non-stationary policy that gets updated as we learn and improve. We show this can impact the learning dynamics of our deep policy and lead to worse generalization arxiv.org/abs/2006.05826 (1/7)

I am proud to announce the release of the NetHack Learning Environment (NLE)! NetHack is an extremely difficult procedurally-generated grid-world dungeon-crawl game that strikes a great balance between complexity and speed for single-agent reinforcement learning research. 1/


Happy to share the extended version of our #QMIX paper “Monotonic Value Function Factorisation for Deep Multi-Agent RL” We include further analysis and ablation studies that investigate how monotonic factorisation of joint Q-val helps QMIX outperform VDN arxiv.org/abs/2003.08839














