
Tom Nicholson
6.2K posts

Tom Nicholson
@TFWNicholson
Building the cognitive architecture for whatever comes next - @_mindanu https://t.co/usNSoYlpuz - AI from @Cambridge_Uni - 20+ years coding experience
Cambridge, England Katılım Eylül 2007
864 Takip Edilen546 Takipçiler
Sabitlenmiş Tweet

@elonmusk If human lifetimes are extended, maybe it doesn't matter. We have AI to come up with new ideas, so necrosis of thought doesn't matter that we don't die. This is neither good nor bad, it just is
English

@arankomatsuzaki Re parallel agents: tree of thoughts type branching etc? At every decision point, try them all and see what the outcome of that step is, rinse, repeat, and backtrack
English

i've been running Codex for ~8-24h per open math/physics research problem. few thoughts:
parallel agents don't seem to scale that cleanly for a lot of problems. many of these are just extremely sequential. you don't really get to "spawn 50 agents and solve it from nowhere." it's more like: tiny move, check, reframe, tiny move, dead end, try again. hours/days of serial cognition, which honestly rhymes with how these fields move over decades.
this updates me a bit against the sci-fi picture of "superhuman math/physics intelligence" as some alien oracle that instantly sees the proof / theory.
the actual superhuman-ness is more mundane and maybe more important: the agent has absorbed a huge prior, can read long papers basically instantly, can think/write at >50 tok/s, and you can clone it across dozens of problems. speed + knowledge volume + multiplicability. that's the superpower.
also: frontier physics seems much more tractable for these agents than decade-old open math problems. for some physics directions, ~8h is enough to get something paper-shaped and nontrivial.
big caveat tho: research taste is still missing. the agent is a pretty good problem-solver, but not yet a top-tier problem-picker. it can push hard once the direction is chosen, but you probably still want a human with taste choosing the problem / framing / bet.
current model: agents are becoming very strong research labor, but the bottleneck shifts upward into taste, problem selection, and knowing which hill is worth climbing.
English

@deanwball If there are no more algorithmic and data improvements (a fairly ridiculous assumption), the pace of change will be fast due to effects hardware advancements/ investments
English

@pushmeet @GoogleDeepMind The breakthrough isn't "AI solved 9 Erdős problems" — it's that the proofs are formally CHECKABLE in Lean/Coq. Not natural-language math you have to trust. That's why the 56-yr-old solutions are credible. Standard LLM math claims fail this bar.
English

AI agents are advancing research-level math. 🚀
I’m thrilled to share @GoogleDeepMind’s AlphaProof Nexus - an agentic framework for formal proof search powered by Gemini.
When applied to a set of open formal math problems, our agent autonomously solved:
✅ 9 open Erdős problems (including two open for 56 years!)
✅ 44 Online Encyclopedia of Integer Sequences (OEIS) problems
✅ A 15-year-old open problem in algebraic geometry ✅ A 7-year-old open question in min-max optimization
We are collaborating with mathematicians across disciplines - from combinatorics and graph theory to quantum optics. Ultimately, these results show the massive potential of even simple agentic loops powered by Gemini.
Read the paper here: arxiv.org/abs/2605.22763…

English

you think you’re down bad.. there are like 10 startups that raised money to build an AI mathematician and OpenAI just used a general model to go beyond the frontier…
so what do you do now ? try to position for an acquihire ? because the clock is ticking… you probably have till the end of the year at best
English

Looks like we got an answer to that cryptic openai post. codex mobile app. cant verify, hope its real :) would be really cool to see!


Quipra@Quipra_
Hell yeah .
English

@basedjensen Remember @basedjensen 200-300k people a year get infected with Hanta in Asia and Europe
But this is the first human to human multi person case
Still not air born like Covid though
English

Aw shit ... Here we go
Insider Paper@TheInsiderPaper
BREAKING: Two Singapore residents isolated, awaiting hantavirus test results, officials say
English

@thsottiaux Option to use pro.
Grammar constrained decoding.
More explicit search functionality
English

@WorldsStrongest "Reverse parked my 18-wheeler first time, get in"
English

@TodayinHistory The most intelligent thing I've heard out of the White House for a long time
English

@IterIntellectus Not irrational, just have bounded rationality, which is also modelled with game theory
English


codex feels genuienly remarkable with 5.5, it's hard to overstate just how much this level of intelligence with this level of speed and efficiency changes things.
it's picking up nuances and understanding my intent far far better than any other model and rather feeling like im wrangling my way through one frustrating bottleneck after the next i'm coming away from sessions feeling genuinely delighted
please just point this thing at something outrageous and see if it will do it, you may find yourself blown away by the results
English













