Greg Farquhar (@greg_far) - Twitterプロフィール

固定されたツイート

There’s huge potential in using ‘demonstrations’ from other agents with different goals: to understand which features & dynamics of the environment *might* be important to you; and to borrow from others' behaviours only where they are useful for you.

Angelos Filos@filangelos

👽 PsiPhi-learning 👽 (long talk #ICML) sites.google.com/view/psiphi-le… shows how an agent can use data from the behavior of other agents with diverse goals: to infer their intentions and fulfill its own! 🧵

English

1

8

0

Greg Farquhar@greg_far·3 Ara

This was a great project to work on. Happy to have it published now in @Nature! Meta-learning is important.

Junhyuk Oh@junh_oh

Excited to announce that our work on “Discovering state-of-the-art RL algorithms” is finally published in @Nature! In this work, we meta-learned RL algorithms at scale. Paper: nature.com/articles/s4158… Blog: google-deepmind.github.io/disco_rl/ See thread 👇

English

0

1

146

Greg Farquhar@greg_far·8 Kas

@j_foerst In case it's not where you got this, fyi en.wikipedia.org/wiki/China_bra…

English

1

0

1

117

Jakob Foerster@j_foerst·8 Kas

If we had 10x more humans on the planet we could have a _social_ network that mirrors the human brain, but each "neuron" is a person. 🤯

English

7

1

25

3.6K

Greg Farquhar@greg_far·15 Tem

There are a bunch of ideas in this paper, but it all fits together really neatly! Great work from @filangelos and team 👏

English

0

4

0

Greg Farquhar@greg_far·15 Tem

There’s huge potential in using ‘demonstrations’ from other agents with different goals: to understand which features & dynamics of the environment *might* be important to you; and to borrow from others' behaviours only where they are useful for you.

Angelos Filos@filangelos

👽 PsiPhi-learning 👽 (long talk #ICML) sites.google.com/view/psiphi-le… shows how an agent can use data from the behavior of other agents with diverse goals: to infer their intentions and fulfill its own! 🧵

English

1

8

0

Greg Farquhar@greg_far·5 Eki

@risi1979 Combining Deep Reinforcement Learning and Search for Imperfect-Information Games arxiv.org/abs/2007.13544 from @polynoamial @anton_bakhtin et al. kinda has it all -- clarity, insights, theory, great empirical results, code available 👏

English

0

1

12

0

Sebastian Risi@risi1979·5 Eki

Favourite paper of 2020 so far?

English

4

1

8

0

Greg Farquhar@greg_far·7 Tem

@NandoDF And if you want to have children while awaiting your ILR, no recourse for them that I'm aware of :(

English

0

1

0

Greg Farquhar@greg_far·7 Tem

@NandoDF Yes, acquiring ILR (settled status is I think a similar scheme for EU citizens) takes many years (can be 10 years in some cases) and is very expensive. I went through the absurd process (German citizen lived here 15 years) and it would be be very hard for less privileged folks.

English

1

0

1

0

Nando de Freitas@NandoDF·7 Tem

If you’re born in the UK and other European countries, you are not entitled to European citizenship, even if your parents have residence. Citizenship is mostly about your genetic makeup. To the best of my understanding this is a racist bias.

English

12

5

92

0

Greg Farquhar@greg_far·22 Haz

Permanent damage to generalisation from early updates in non-stationary training -- really enjoyed looking into this intriguing problem and trying to solve it for deep RL agents!

Maximilian Igl@MaxiIgl

Really excited about our new work: In deep RL, we typically collect new data using a non-stationary policy that gets updated as we learn and improve. We show this can impact the learning dynamics of our deep policy and lead to worse generalization arxiv.org/abs/2006.05826 (1/7)

English

0

2

17

0

Greg Farquhar@greg_far·6 May

This is awesome, but I'm a little scared of how much time I might spend playing it myself...

Tim Rocktäschel@_rockt

I am proud to announce the release of the NetHack Learning Environment (NLE)! NetHack is an extremely difficult procedurally-generated grid-world dungeon-crawl game that strikes a great balance between complexity and speed for single-agent reinforcement learning research. 1/

English

0

7

0

Greg Farquhar がリツイート

Tim Rocktäschel@_rockt·5 May

I am proud to announce the release of the NetHack Learning Environment (NLE)! NetHack is an extremely difficult procedurally-generated grid-world dungeon-crawl game that strikes a great balance between complexity and speed for single-agent reinforcement learning research. 1/

GIF

English

14

183

697

0

Greg Farquhar@greg_far·20 Mar

I particularly enjoyed visualising & analysing the learned mixing functions that combine per-agent utilities into joint values!

Mikayel Samvelyan@_samvelyan

Happy to share the extended version of our #QMIX paper “Monotonic Value Function Factorisation for Deep Multi-Agent RL” We include further analysis and ablation studies that investigate how monotonic factorisation of joint Q-val helps QMIX outperform VDN arxiv.org/abs/2003.08839

English

0

2

0

Greg Farquhar@greg_far·27 Eyl

Potential for cool applications in meta-learning, multi-agent learning, etc. If you have ideas or want to chat, let me know or find me at NeurIPS 😀

English

0

7

0

Greg Farquhar@greg_far·27 Eyl

A much-improved 🎲Loaded DiCE🎲 objective lets you easily compute low-variance estimators of any-order derivatives for RL. Paper arxiv.org/abs/1909.10549 and code github.com/oxwhirl/loaded… online, nice working with @shimon8282 and @j_foerst! #NeurIPS2019

English

1

12

61

0

Greg Farquhar がリツイート

Noam Brown@polynoamial·19 Tem

Tuomas Sandholm and I are doing a Reddit AMA now on the #Pluribus poker AI! reddit.com/r/MachineLearn…

English

0

4

25

0

Greg Farquhar@greg_far·16 Tem

AI accelerates by 10x in the hour it takes to repost from r/machinelearning to r/singularityisnear... just how near is it at that rate?? 😱

English

1

13

0

Greg Farquhar@greg_far·1 Tem

Progressively growing the action space creates a great curriculum for learning agents -- check out our paper: arxiv.org/abs/1906.12266 + code: github.com/TorchCraft/Tor…. Great working with Laura Gustafson @ebetica @shimon8282 Nicolas Usunier @syhw

English

0

32

129

0

Greg Farquhar がリツイート

Tim Rocktäschel@_rockt·11 Haz

How can RL agents exploit the compositional, relational and hierarchical structure of the world? A growing number of authors propose learning from natural language. We are excited to share our @IJCAIconf survey of this emerging field! arxiv.org/abs/1906.03926 TL;DR:🤖+📖=📈🎯🏆🥳

English

2

71

249

0

Greg Farquhar がリツイート

Tim Rocktäschel@_rockt·29 Ağu

I had the pleasure to co-supervise outstanding MSc students jointly with Jakob Foerster (@j_foerst) and Greg Farquhar (@greg_far) at @CompSciOxford this year. Together, we compiled our advice for embarking on short-term machine learning research projects: rockt.github.io/2018/08/29/msc…

English

3

87

265

0

Greg Farquhar がリツイート

Maximilian Igl@MaxiIgl·8 Haz

I am very excited to share our ICML paper “Deep Variational Reinforcement Learning (DVRL) for POMDPs”: Our agent learns a model of the environment and acts based on its belief state in this model. w/ @zinmalu @tuananhle7 @frankdonaldwood @shimon8282 arxiv.org/abs/1806.02426

English

0

34

123

0

Greg Farquhar

ディスカバー