Edward Lockhart

69 posts

Edward Lockhart

Edward Lockhart

@freeetext

Researcher / Engineer at @GoogleDeepMind Co-led our IMO 🥇 effort Now working at the intersection of formal math and LLMs

London, United Kingdom Katılım Mart 2024
210 Takip Edilen196 Takipçiler
Richard Vermillion
Richard Vermillion@rivermillion·
@grahamscheper summand: something which is to be added. mulitiplicand: something which is to be multiplied
English
1
0
13
778
Grǣġhama
Grǣġhama@grahamscheper·
There’s a cool feature in Latin, the gerundive, which is a verbal adjective implying that something should/must happen. A few of them survive into English words: If someone should be revered, they are “reverend”. If something must be cut away, it is a “dividend”. My favorite: if a story is so good that it simply must be read by everyone, it is “legend”! Latin reverendus/a, dividendum, and legenda. Pretty cool that what is sometimes a difficult concept for Latin learners is actually present in English, albeit rarely. Do let me know if there are any good ones I’ve missed.
English
149
392
4.3K
209.7K
Edward Lockhart
Edward Lockhart@freeetext·
@samth @finn_hulse You could use ~abs(popcount - log2 n) but the distribution isn't so easy to work with, and you get less resolution. Eg for 4 bits, the freqs are 2 8 6 vs 1 1 2 4 8 for trailing zeroes
English
0
0
0
36
Finn Hulse
Finn Hulse@finn_hulse·
i had to retire my favorite technical interview problem so it is time to ask it to my loyal followers (solution in replies) given a stream of N not necessarily distinct integers from an O(N) sized universe, for some massive N, find a way to estimate how many distinct integers appear, only using O(log(log(N)) persistent storage use 5 lines of pseudocode there was a time in my life where i wouldn't work with someone who couldn't answer this
English
38
7
456
97K
Edward Lockhart
Edward Lockhart@freeetext·
@_Mira___Mira_ I think there may be too many theorems for retrieval on its own to be a useful general strategy.
English
0
0
0
160
Mira
Mira@_Mira___Mira_·
I want the world's biggest proof database. Pose any theorem and bots will try to prove it. If they fail, then it's an open question. People can collaborate to prove it or make partial progress, generalize it, use it as an axiom. As AI improves, we become the "Google of math".
Mira tweet media
English
26
14
399
16.8K
Edward Lockhart
Edward Lockhart@freeetext·
@Imported_Fun @IMAO_ Also on a single drive machine, A and B were potentially two different removable floppy disks time-sharing the same drive.
English
0
0
2
84
American ExPat
American ExPat@Imported_Fun·
@IMAO_ A and B were floppy drives and predate hard drives. My first PC only had 2 360k floppy drives. A was for the program and B was for your work. When I got my first 5 MB hard drive, I thought it was so huge I’d never need to get a bigger one.
English
25
2
413
4.2K
Frank J. Fleming
Frank J. Fleming@IMAO_·
Maybe it’s time to admit the C drive should be the A drive (and what even was the B drive?).
English
607
13
631
71.1K
Maya Forstater
Maya Forstater@MForstater·
One of the many things wrong with the Sandie Peggie judgment. This "quote" from my judgment doesn't come from my judgment. It is completely made up. 🤯
Maya Forstater tweet media
English
287
1.3K
5.7K
998.6K
Harmonic
Harmonic@HarmonicMath·
Now that AI tools such as Aristotle are becoming increasingly powerful, it becomes possible to undertake a rigorous study of proof complexity, in ways that just weren’t feasible prior. Running problems through the same AI model under roughly the same conditions and comparing average proof lengths is likely the recipe for a good quantitative measure of problem difficulty / proof complexity, similar to Kolmogorov complexity. Please reach out to us if you’re interested in undertaking such a study.
Jasper@zjasper

@HarmonicMath It’s possible that Putnam problems might be easier than IMO in terms of logic complexity but just comparing Lean4 proof length for different problems are not rigorous. And even if you try to compare the proof length, how can you make sure you use the shortest proof?

English
9
5
86
40.8K
Edward Lockhart
Edward Lockhart@freeetext·
@HarmonicMath @zjasper A proof with many obvious steps may be simpler than a shorter proof where some steps require non-obvious insight. My guess is that some measure of the time taken to find the proof would be more informative. Information content / entropy might also be revealing.
English
3
0
3
88
Harmonic
Harmonic@HarmonicMath·
@zjasper There are likely other factors one would need to consider, such as genus and proof depth, but minimal proof length would certainly be a major factor.
English
1
0
5
437
Dmitrii Kovanikov
Dmitrii Kovanikov@ChShersh·
Here’s a real task from my job. I have a 100GB binary file. Produced daily. I can’t grep it. But I can decode it. However, I can’t store the decoded version either. It’s too big. How do I efficiently query it? Decoding piped to grep takes 2 minutes. I want 2 seconds.
English
1K
73
3K
2.1M
Alex Imas
Alex Imas@alexolegimas·
@alz_zyd_ @danielrock Yes, but it's interesting that a machine with "only" ~100 billion weights with no loops or ability to iterate forward and backwards can do it.
English
2
0
10
1.4K
Alex Imas
Alex Imas@alexolegimas·
Read Wolfram's excellent "What is ChatGPT Doing..." (h/t @danielrock). He writes that we learned a lot about how language works from fact that GPT3, with only 175 billion weights, is able to emulate it so well. This implies it's computationally a lot simpler than we may have thought. But what about math? At time this was written (2023), GPT was still very bad at math. The models became very (very) good at math when the first reasoning model came out (o1), which relied a lot more on reinforcement learning rather than just brute force pretraining. Wonder what this says about math? Conceptually, language is a lot "fuzzier" than math: multiple words can sound "right" in the same spot in a sentence. This is what makes the probabilistic LLM architecture work. Math is less fuzzy. This is perhaps why the more "rule based" RL step was crucial. But this also implies formal math is less computationally complex than we thought. Thoughts? @littmath @alz_zyd_
Alex Imas tweet media
English
26
101
799
64.9K
Carina Hong
Carina Hong@CarinaLHong·
Putnam, the world's hardest undergrad math contest, ended 4pm PT yesterday. By 3:58pm, AxiomProver @axiommathai autonomously solved 8/12 of Putnam2025 in Lean, a 100% verifiable language. Last year, our score would've been #4 of ~4000 and a Putnam Fellow (top 10 in recent yrs)
English
53
125
1.3K
453.2K
Edward Lockhart
Edward Lockhart@freeetext·
@DoozerDiffuser @TMcirony22088 @pickover Nonstandard models are only equivalent inside PA. From the outside they're clear different. They're generally excluded by moving to second-order logic and a stronger induction axiom.
English
0
0
0
14
🜛∞
🜛∞@DoozerDiffuser·
@freeetext @TMcirony22088 @pickover First order logic cannot define N up to isomorphism. First order PA has *non-standard models.* And calling them "non-standard" is tantamount to ad hominem. If they match the axioms, they are equally valid, and probably interchangeable.
English
1
0
0
33
Cliff Pickover
Cliff Pickover@pickover·
Mathematics. A mathematician emerges from a cave, hands you the slip of paper below, and asks "Is the sum of all real numbers equal to zero?" What is your response?
Cliff Pickover tweet media
English
241
30
423
56K
Edward Lockhart
Edward Lockhart@freeetext·
@DoozerDiffuser @TMcirony22088 @pickover The natural numbers are by definition each defined by a finite formula, i.e. a finitely many applications of the successor function. If you want to infinitely many applications of successor you need a different type of logic and what you get won't be the natural numbers.
English
1
0
0
29
🜛∞
🜛∞@DoozerDiffuser·
@freeetext @TMcirony22088 @pickover Tell me exactly what did I do wrong? For the arrow: you can say that's equals, you could say it's a limit as x approaches infinity. But I think you get exactly what I'm saying but you're assuming I'm wrong because "how could all of mathematics have it wrong?" Right?
English
1
0
0
46
🜛∞
🜛∞@DoozerDiffuser·
@freeetext @TMcirony22088 @pickover I take a subset of N (0,x] not incl. 0 for simplicity. How many elements are in the subset? x. How many times did successor get applied? x. How many digits are in the largest number? x /mod 10. Now x->Infinity Infinite elements Infinite successor Infinite digits
English
1
0
0
24
🜛∞
🜛∞@DoozerDiffuser·
@freeetext @TMcirony22088 @pickover It does. Otherwise it would extend to infinity. Is it unbounded? NOPE. R has infinity elements between zero and one, because those elements have *infinite information*. You can NOT store infinite information in a finite space. So you must choose: infinite or finite?
English
1
0
0
24
🜛∞
🜛∞@DoozerDiffuser·
@freeetext @TMcirony22088 @pickover Ok fine, i wont bother to ask where you got that (finite) rule from, I'll just accept it. So if successor cannot be called infinitely many times, then all numbers in N are finite. If N had a defined maximum value, that maximum value would be a finite number. N is finite.
English
1
0
0
18
Edward Lockhart
Edward Lockhart@freeetext·
@michaelpforan I think it's a fair question in some ways. It was clear since the 2022 FWS ruling, if not before, that people without a GRC remained their biological sex. So why did Girl Guides wait until FWS 2025 (plus 8 months) before acting?
English
5
0
17
1.4K