Conor Durkan

148 posts

Conor Durkan

@conormdurkan

Research @GoogleDeepMind

New York Katılım Kasım 2016

444 Takip Edilen2.2K Takipçiler

Conor Durkan@conormdurkan·24 Tem

I can attest after using explicit sharding for a couple of months that I feel a deep sense of calm whenever I train models, knowing exactly where all my shards are ahead-of-time.

Cristian Garcia@cgarciae88

highly recommend you try out JAX's new Explicit Sharding API. its more intuitive in that for intermediate computation .sharding will print the actual sharding at that point so you don't have to add with_sharding_constraint everywhere, but its a bit more strict. you can mix-and-match via the new auto_axes transform.

English

1.9K

Conor Durkan@conormdurkan·3 Haz

@FionnualaCrowle @DDoroshow @cardismith 👏👏👏

QME

112

Fionnuala Crowley@FionnualaCrowle·3 Haz

Grateful to everyone who came by and thanks for not pimping me too much on the depths of basic immunology. Thank you to my mentors for these projects @DDoroshow, @cardismith , Dr. Tom Marron. #ASCO25

English

3.4K

Conor Durkan@conormdurkan·2 Haz

@FionnualaCrowle 👏👏👏

QME

Fionnuala Crowley@FionnualaCrowle·2 Haz

I am presenting two posters this afternoon in two different sessions! Come say Hi! #ASCO25

English

Conor Durkan@conormdurkan·18 Şub

I'm rejoining @GoogleDeepMind in NYC this week, looking forward to catching up with folks :)

English

145

15.2K

Conor Durkan@conormdurkan·5 Şub

I find myself manually ensembling LLMs pretty often e.g. compare approaches from r1, o1 pro and o3-mini-high, or go back and forth between Claude and Gemini, asking each for their take on feedback from a 'colleague'. It would be nice if this was built directly into product.

English

1.7K

Conor Durkan@conormdurkan·4 Şub

@jon_barron It's another tool in the editing/personalized-content toolbox (the market is enormous, CapCut has 300M MAUs)

English

229

Conor Durkan@conormdurkan·27 Oca

More than anything, the R1 model and paper make me wonder how far along we'd be if everyone was still clamoring to shout their best ideas from the rooftops. I know it's naïve to think we could sustain open research of the kind we saw up to 2020 indefinitely, but still...

English

1.6K

Conor Durkan@conormdurkan·26 Oca

@gallabytes Separate from tone or style, but I think seeing the thinking tokens in r1 (and Gemini tbf) is quite humanizing.

English

theseriousadult@gallabytes·26 Oca

which model feels the most humanesque

English

612

Conor Durkan@conormdurkan·26 Oca

@michael_nielsen I’ve also not read it for a long time, but I’m currently listening to the audiobook with Andy Serkis narrating, and I’m glad it’s as special as I remember!

English

354

Michael Nielsen@michael_nielsen·26 Oca

Rereading “The Lord of the Rings” for the first time in many years. What a marvelous book it is!

English

167

13.8K

Conor Durkan@conormdurkan·24 Oca

There's something so earnest and endearing about Operator trying to do things

English

844

Conor Durkan@conormdurkan·23 Oca

Dial-up and usage limits from my childhood rushing back

English

734

Conor Durkan@conormdurkan·21 Oca

pxgy.substack.com/p/thoughts-on-…

ZXX

344

Conor Durkan@conormdurkan·21 Oca

I wrote up some thoughts having worked on generative music over the past year. I talk a bit about where tech is now, how it's being used, and where it might go. Link below!

English

1.4K

Conor Durkan@conormdurkan·20 Oca

@finbarrtimbers I'm really interested to see whether proxy (e.g. neural) rewards can be made viable (avoiding hacking etc), or whether there really needs to be a formal ground-truth verifier.

English

272

finbarr@finbarrtimbers·20 Oca

The $$$ dollar question, now, is what sources of data exist for RLVR beyond math

English

4.9K

Conor Durkan@conormdurkan·20 Oca

This is one of those idealized trends that you imagine in a perfect world, but then the results actually bear it out

English

431

Conor Durkan@conormdurkan·8 Oca

@jonkhler @PatrickKidger Maybe I’ll make a more earnest effort to try it out then!

English

Jonas Köhler@jonkhler·8 Oca

@PatrickKidger @conormdurkan This is how I see it as well and where I see the core strength! I don't care about all the bells-and-whistle-norms or whatever preimplemented. I care about a clean and maintainable solution for handling NN state that doesn't get int the way. This is what equinox achieves.

English

171

Conor Durkan@conormdurkan·8 Oca

Trying out NNX having previously switched from Haiku to Linen. Something something fool me once, something something I'll probably just get fooled again.

English

939

Conor Durkan@conormdurkan·8 Oca

First book of the year and it was a doozy. I think I have a soft spot for aerospace engineering--I liked this almost as much as 'Carrying the Fire'.

English

484

Conor Durkan@conormdurkan·8 Oca

@jonkhler @PatrickKidger I like equinox a lot actually, but it doesn't seem to have achieved escape velocity. As amazing as jax is, I do wish there was a standardized NN-framework.

English

113

Jonas Köhler@jonkhler·8 Oca

@conormdurkan Just use equinox ;) @PatrickKidger

English

161

Conor Durkan@conormdurkan·5 Oca

@YunTaTsai1 x.com/conormdurkan/s…

Conor Durkan@conormdurkan

QME

Yun-Ta Tsai@yunta_tsai·5 Oca

A joke in academy, “any sufficiently aged topic would turn into Bayesian”. 😆 Cool paper nonetheless.

Conor Durkan@conormdurkan

I like the Bayesian framing of reward-based post-training (i.e. reward-maximization with a KL penalty). Up to an additive constant, reward functions are log-likelihoods, and the pre-trained model is a prior. Then the posterior target is the product of the likelihoods and prior (the prior KL-weighting can equivalently sharpen or smooth your likelihoods). Rewards can be hard for math or code verification, or soft for subjective preference. This means post-training (of this kind at least) optimizes KL(model || posterior), whereas pre-training optimizes KL(data || model). It also means post-training is mode-seeking (as opposed to mode-covering like pre-training), so those rewards better be well calibrated. (Figure from 'RL with KL penalties is better viewed as Bayesian inference', link below along with other useful references)

English

9.2K

Keşfet

@FionnualaCrowle @DDoroshow @cardismith @GoogleDeepMind @jon_barron @gallabytes @michael_nielsen @finbarrtimbers