Tom Nicholson

6.2K posts

Tom Nicholson banner
Tom Nicholson

Tom Nicholson

@TFWNicholson

Building the cognitive architecture for whatever comes next - @_mindanu https://t.co/usNSoYlpuz - AI from @Cambridge_Uni - 20+ years coding experience

Cambridge, England Katılım Eylül 2007
864 Takip Edilen546 Takipçiler
Sabitlenmiş Tweet
Tom Nicholson
Tom Nicholson@TFWNicholson·
@elonmusk If human lifetimes are extended, maybe it doesn't matter. We have AI to come up with new ideas, so necrosis of thought doesn't matter that we don't die. This is neither good nor bad, it just is
English
112
20
345
5.1M
Tom Nicholson
Tom Nicholson@TFWNicholson·
@arankomatsuzaki Re parallel agents: tree of thoughts type branching etc? At every decision point, try them all and see what the outcome of that step is, rinse, repeat, and backtrack
English
0
0
0
5
Aran Komatsuzaki
Aran Komatsuzaki@arankomatsuzaki·
i've been running Codex for ~8-24h per open math/physics research problem. few thoughts: parallel agents don't seem to scale that cleanly for a lot of problems. many of these are just extremely sequential. you don't really get to "spawn 50 agents and solve it from nowhere." it's more like: tiny move, check, reframe, tiny move, dead end, try again. hours/days of serial cognition, which honestly rhymes with how these fields move over decades. this updates me a bit against the sci-fi picture of "superhuman math/physics intelligence" as some alien oracle that instantly sees the proof / theory. the actual superhuman-ness is more mundane and maybe more important: the agent has absorbed a huge prior, can read long papers basically instantly, can think/write at >50 tok/s, and you can clone it across dozens of problems. speed + knowledge volume + multiplicability. that's the superpower. also: frontier physics seems much more tractable for these agents than decade-old open math problems. for some physics directions, ~8h is enough to get something paper-shaped and nontrivial. big caveat tho: research taste is still missing. the agent is a pretty good problem-solver, but not yet a top-tier problem-picker. it can push hard once the direction is chosen, but you probably still want a human with taste choosing the problem / framing / bet. current model: agents are becoming very strong research labor, but the bottleneck shifts upward into taste, problem selection, and knowing which hill is worth climbing.
English
21
3
62
5.3K
Tom Nicholson
Tom Nicholson@TFWNicholson·
@deanwball If there are no more algorithmic and data improvements (a fairly ridiculous assumption), the pace of change will be fast due to effects hardware advancements/ investments
English
0
0
0
56
Dean W. Ball
Dean W. Ball@deanwball·
I feel us approaching yet another summer of discontent with ai, just like last year, when many of my peers in the ai commentariat declared deep learning to have hit a wall because of gpt-5 blah blah blah.
English
27
5
267
41.3K
Willrich
Willrich@WillrichOstmann·
@pushmeet @GoogleDeepMind The breakthrough isn't "AI solved 9 Erdős problems" — it's that the proofs are formally CHECKABLE in Lean/Coq. Not natural-language math you have to trust. That's why the 56-yr-old solutions are credible. Standard LLM math claims fail this bar.
English
1
0
2
849
Pushmeet Kohli
Pushmeet Kohli@pushmeet·
AI agents are advancing research-level math. 🚀 I’m thrilled to share @GoogleDeepMind’s AlphaProof Nexus - an agentic framework for formal proof search powered by Gemini. When applied to a set of open formal math problems, our agent autonomously solved: ✅ 9 open Erdős problems (including two open for 56 years!) ✅ 44 Online Encyclopedia of Integer Sequences (OEIS) problems ✅ A 15-year-old open problem in algebraic geometry ✅ A 7-year-old open question in min-max optimization We are collaborating with mathematicians across disciplines - from combinatorics and graph theory to quantum optics. Ultimately, these results show the massive potential of even simple agentic loops powered by Gemini. Read the paper here: arxiv.org/abs/2605.22763…
Pushmeet Kohli tweet media
English
68
213
1.4K
132.7K
Prakash
Prakash@8teAPi·
you think you’re down bad.. there are like 10 startups that raised money to build an AI mathematician and OpenAI just used a general model to go beyond the frontier… so what do you do now ? try to position for an acquihire ? because the clock is ticking… you probably have till the end of the year at best
English
9
1
47
3.5K
jason
jason@jxnlco·
When do you reach for other models instead of Codex? What can we do better? Hit me with all of your frustrations. dms open. If you can give me detail (e.g. specifics/transcipts) - it'll help a lot in finding out exactly what we need to do to improve the next model
English
418
23
845
184.6K
Tibo
Tibo@thsottiaux·
For those of you living inside the codex app, what should we prioritize among features, reliability or performance?
English
1.9K
38
2.1K
281K
Chubby♨️
Chubby♨️@kimmonismus·
Looks like we got an answer to that cryptic openai post. codex mobile app. cant verify, hope its real :) would be really cool to see!
Chubby♨️ tweet mediaChubby♨️ tweet media
Quipra@Quipra_

Hell yeah .

English
29
14
455
54.6K
Sam Altman
Sam Altman@sama·
what would you most like to see improve in our next model?
English
8.3K
304
9K
1.4M
Tibo
Tibo@thsottiaux·
As a Codex user, which platform are you on
English
507
21
707
220K
OpenAI
OpenAI@OpenAI·
Codex now works directly in Chrome on macOS and Windows. It’s even better at working with apps and sites in Chrome, and now works in parallel across tabs in the background without taking over your browser. To get started, install the Chrome plugin in the Codex app.
English
631
1.3K
13.4K
2.5M
Chris
Chris@Chrisgpt·
@basedjensen Remember @basedjensen 200-300k people a year get infected with Hanta in Asia and Europe But this is the first human to human multi person case Still not air born like Covid though
English
1
0
11
1.8K
Tom Nicholson
Tom Nicholson@TFWNicholson·
@thsottiaux Option to use pro. Grammar constrained decoding. More explicit search functionality
English
0
0
0
74
Tibo
Tibo@thsottiaux·
We are seeing strong traction and working to improve Codex for scientists across mathematics, physics, chemistry, biology and more. What do you wish it were capable of that it cannot do today?
English
530
46
2K
112.7K
Today in History
Today in History@TodayinHistory·
This may be the most articulate response I’ve ever heard to this question.
English
2.9K
17K
110.2K
6.8M
ρ:ɡeσn
ρ:ɡeσn@pigeon__s·
ok so gpt-5.5-instants personality seems to be like 3598x better than 5.3s and genuinely not slop i would say but i still feel like i prefer claudes a lot and even kimis but its like good enough now that i dont really care that much
English
4
0
122
10.7K
Tom Nicholson
Tom Nicholson@TFWNicholson·
@IterIntellectus Not irrational, just have bounded rationality, which is also modelled with game theory
English
0
0
1
21
vittorio
vittorio@IterIntellectus·
the problem with game theory is that it assumes rational players but most humans are irrational
English
380
105
2K
782.5K
Katie 🇺🇸
Katie 🇺🇸@SimplyKatie___·
What’s the best way to handle someone who’s more knowledgeable or intelligent than you?
English
1.5K
14
312
44.1K
Bandors
Bandors@oscpmentor·
@tenobrus Did that guy ever turn out to be anything important? Is he actually an insider somewhere? Or just some dude?
English
2
0
14
1.3K
🍓🍓🍓
🍓🍓🍓@iruletheworldmo·
codex feels genuienly remarkable with 5.5, it's hard to overstate just how much this level of intelligence with this level of speed and efficiency changes things. it's picking up nuances and understanding my intent far far better than any other model and rather feeling like im wrangling my way through one frustrating bottleneck after the next i'm coming away from sessions feeling genuinely delighted please just point this thing at something outrageous and see if it will do it, you may find yourself blown away by the results
English
36
18
377
16.6K
Hitchslap
Hitchslap@Hitchslap1·
Serious question. Do you believe time travel is possible?
English
1.5K
27
548
57K