
Misha Denil
1.4K posts

Misha Denil
@notmisha
I tweet about things that interest me, mostly machine learning things. ex-DeepMind.



Like @davidbessis and others, I think that Hinton is wrong. To explain why, let me tell you a brief story. About a decade ago, in 2017, I developed an automated theorem-proving framework that was ultimately integrated into Mathematica (see: youtube.com/watch?v=mMaid2…) (1/15)


"Modern ML is built on Linear Algebra". lol no its not.


All LLM evaluations are system evaluations. The LLM just sits there on disk. To get it do something, you need at least a prompt and a sampling strategy. Once you choose these, you have a system. The most informative evaluations will use optimal combinations of system components.

Due to popular demand, I've updated this figure to include DeepSeek-V2 and Mistral Large 2. It's also more zoomed for readability.


Excited to announce the release of TORAX, a tokamak transport simulator from our @GoogleDeepMind Fusion team! #fusionenergy - Open-source: github.com/google-deepmin… - Uses JAX: fast, differentiable - Easy coupling of ML-surrogates Hot off the press → arxiv.org/abs/2406.06718

I mean

This part is huge: ❝ plaintiffs have plausibly alleged facts to suggest compress copies, or effective compressed copies albeit stored as mathematical information ❞ Model is not a derivative, it's a database. storage.courtlistener.com/recap/gov.usco…



"Is this behavior emergent or does it come from the data?" is not a debate we should be having. All emergent behavior comes from the data. It's true of humans and it's true of AI. None of us has ever magically pulled anything out of the ether.






