TimDarcet
1.4K posts

TimDarcet
@TimDarcet
codegen @ FAIR, prev. DINO stuff @ INRIA & FAIR

@npparikh I doubt all those things are really possible. Infact I believe, you are not doing a good PhD unless you have sleepless nights. Definitely just working on your thesis is possible if you follow a 9-6 schedule, but a good PhD which involves exploring, colabs, etc needs extra hours



Interesting article: time.com/article/2026/0…

And you realize that Kaiming He is the GOAT when you see that he wrote only 96 papers (vs people with his h-index usually have hundreds)






Nothing shockingly dumb?

Is it reasonable to consider that since the HBM3 memory of a H100 has a bandwidth of ~3Tb/s and the chip can do ~900TFlops, a rule of thumb is that every bfloat16 should be reused ~600 times?

every great research paper I've read has this shape: - absolutely stellar philosophical reasoning about why their structure is the purest and most logical thing - the dankest duct tape you ever seen in your life to make this thing even start


1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵













