Mike Dodds

8.8K posts

Mike Dodds

@miike

Formal methods enthusiast. Principal scientist at @Galois. English immigrant. Nitwit. Opinions my own.

Portland, OR Katılım Şubat 2008

598 Takip Edilen1.2K Takipçiler

Mike Dodds@miike·1d

@intoverflow Right, one of the intriguing things is the agents might find a much shorter proof! Or a much longer one for that matter. My guess is the agents need abstractions just the same as humans so it wouldn’t just be a big incomprehensible blob

English

Tim Carstens Ⓥ✨ is hacking 🤖@intoverflow·1d

@miike I wonder if it would take the same approach, and how the answer to that Q changes things

English

Mike Dodds@miike·1d

Someone should build seL4-ablate-bench. Progressively delete proofs, lemmas, theorems and see how much a long-running AI agent can reconstruct. End state: just give the AI the seL4 code + top spec, and re-synthesise the whole 1m+ line Isabelle proof

English

6.8K

Mike Dodds@miike·1d

@ember_arlynx You should talk to @qd_forall !

English

cmr://ember@ember_arlynx·1d

@miike love this idea! happy to collab w/anyone wanting to experiment with this (former sel4 contrib)

English

Mike Dodds@miike·1d

Fun exercise: predict what year an agent with sufficient scaffolding could reconstruct the entire seL4 proof. Estimates at Galois ranged from “2040” to “this year” :)

English

1.4K

Mike Dodds retweetledi

Ole Q Doc@qd_forall·1d

working with a SPAR (sparai.org) fellow to get this done rn

Mike Dodds@miike

English

367

Mike Dodds@miike·1d

@SMT_Solvers I guess I mean how do you trust your spec vs real libc? libc spec is deliberately under-specified and also relies on the semantics of kernel syscalls, global memory state, many hard problems

English

Chad Brewbaker@SMT_Solvers·1d

@miike Like any proof you just have to trust it up to axioms and proof kernel.

English

Mike Dodds@miike·1d

@SMT_Solvers csmith is an amazing tool! Another v useful project would be “csmith for everything” - quickly stand up generators for arbitrary input languages (maybe this already exists?)

English

226

Mike Dodds@miike·1d

@SMT_Solvers This would also be extremely useful & presents some interesting problems. Eg, how could you get an agent to help? How would you trust the spec afterwards?

English

Chad Brewbaker@SMT_Solvers·1d

@miike More fun excercise. Spec libc in enough detail that you can make safer C software at scale.

English

Chad Brewbaker@SMT_Solvers·1d

@miike Reminds me of github.com/csmith-project…

English

117

Mike Dodds@miike·1d

PS: I’ve heard @qd_forall may be working on similar ideas

English

215

Mike Dodds retweetledi

Ilya Sergey@ilyasergey·1d

New on "Proofs and Intuitions": Verifying Move Borrow Checker in Lean: an Experiment in AI-Assisted PL Metatheory. proofsandintuitions.net/2026/03/18/mov… The gist: I formalised Move's type system in Lean: 39KLOC, under a month, with Claude. Person-years in PL research are now person-weeks.

English

230

17.5K

Mike Dodds retweetledi

Arpit Gupta@arpitrage·2d

Leopold Aschenbrenner predicted in June 2024 that we would get a dramatic improvement in AI capabilities around the turn of 2026 due to the switch from chatbots to agents, which he thought would unlock a new set of AI capabilities Which is basically exactly what happened?

English

146

1.8K

139.5K

Mike Dodds retweetledi

roon@tszzl·5 Mar

@memeticweaver @tautologer > the USG can in general do whatever they want the founders of this great nation fought several bloody wars to make sure this is not true

English

938

51.2K

Mike Dodds retweetledi

Josh RR Jokien@joshcarlosjosh·6 Ağu

“I wish it need not have happened in my time,” said Frodo. “lmao" said Gandalf, “well it has.”

English

264

26.1K

89.8K

Mike Dodds@miike·5 Mar

@nielstron @HarmonicMath Yeah open for a while! I wrote an mcp server for calling Aristotle from Claude: github.com/septract/lean-…

English

Niels Mündler@nielstron·5 Mar

@miike @HarmonicMath wait they... are open now! I requested access and never got a mail that I got approved - I guess because I dont have to now.

English

Mike Dodds@miike·5 Mar

I formalised the Knuth / Stappers / Claude theorem in Lean4. Claude for the scaffolding and @HarmonicMath for the core proofs (Disclaimer: core theorems look plausible to me, but mistakes possible)

Tenobrus@tenobrus

Donald Knuth is vibemathing now. real tough day for the stochastic-parrot crew.

English

9.6K

Mike Dodds retweetledi

Axiom@axiommathai·5 Mar

1/ RELEASING AXLE: the Axiom Lean Engine ⚙️ We are serving our core Infrastructure for formal proving at scale. These are the same Lean metaprogramming tools that are behind AxiomProver, powering it to win Putnam and crack open research conjectures. Available to anyone today!

English

427

110.9K

Mike Dodds@miike·5 Mar

This isn’t the fully general theorem about all 760 such decompositions (what Knuth proves) - just the single construction that Claude found. If I get another sick day tomorrow I might try to prove the general result :)

English

378

Mike Dodds@miike·5 Mar

github.com/septract/claud…

ZXX

428

Keşfet

@intoverflow @ember_arlynx @qd_forall @SMT_Solvers @memeticweaver @tautologer @nielstron @HarmonicMath