Matty Hempstead

35 posts

Matty Hempstead banner
Matty Hempstead

Matty Hempstead

@mattyhempstead

seeking sparse rewards

Katılım Haziran 2023
442 Takip Edilen97 Takipçiler
Audrey
Audrey@audrlo·
I just built this for my grandma. Meet Sam, the first AI caretaker for seniors. Order one of the first 1,000 units (see below).
English
201
107
959
91.1K
Matty Hempstead retweetledi
Arthur Stockman
Arthur Stockman@StockmanArthur·
We ported Peter’s (🦞) coding setup to VR/AR at last weekend’s @fdotinc hackathon. We thought @steipete looked too tense in his picture, so we improved his user experience. See thread how it works. Built in 4 hours, teamed up with @mattyhempstead for this one!
English
5
5
87
16.6K
Matty Hempstead
Matty Hempstead@mattyhempstead·
just got a top 1% whoop score! i felt like i barely slept on the plane but i guess i did better than i thought 🙏 @bryan_johnson
Matty Hempstead tweet media
English
0
0
1
71
Matty Hempstead
Matty Hempstead@mattyhempstead·
gpt-5.4-mini and gpt-5.4-nano have been added to my custom benchmark ScratchBench mini scores 16% nano scores 14%
Matty Hempstead tweet media
English
0
0
0
83
Matty Hempstead
Matty Hempstead@mattyhempstead·
the fly when it realises the nature of it's existence
GIF
Hattie Zhou@oh_that_hat

There's a fruit fly walking around right now that was never born. @eonsys just released a video where they took a real fly's connectome — the wiring diagram of its brain — and simulated it. Dropped it into a virtual body. It started walking. Grooming. Feeding. Doing what flies do. Nobody taught it to walk. No training data, no gradient descent toward fly-like behavior. This is the opposite of how AI works. They rebuilt the mind from the inside, neuron by neuron, and behavior just... emerged. It's the first time a biological organism has been recreated not by modeling what it does, but by modeling what it is. A human brain is 6 OOM more neurons. That's a scaling problem, something we've gotten very good at solving. So what happens when we have a working copy of the human mind?

English
0
0
1
146
Matty Hempstead
Matty Hempstead@mattyhempstead·
just discovered a new unreleased gpt model while exploring the openai website source code
Matty Hempstead tweet media
English
1
0
4
144
Matty Hempstead
Matty Hempstead@mattyhempstead·
worth mentioning that we have also discovered a new scaling law with every 10x in the number of runs to argmax over, we notice a linear increase in performance (assuming performance of a given model follows a normal distribution)
Matty Hempstead tweet media
English
0
0
0
58
Matty Hempstead
Matty Hempstead@mattyhempstead·
fyi for investors: with the money we raise we hope to increase the value of N so we can start to take argmax over tens or even hundreds of thousands of runs
English
1
0
0
66
Matty Hempstead
Matty Hempstead@mattyhempstead·
im thinking of starting a neolab that specialises in overfitting for popular benchmarks our strategy (high level) - take a SOTA model and run it against our chosen benchmark 256 times - look for the maximum scoring run out of all 256 runs - find a series of 8 binary "steering" steps which guide the model to act like that 1/256th ranked run - design an "agent harness" that looks generalised but is just secretly encoding those 8 bits of steering - claim new SOTA looking to raise 100M at a 1B val lmk if anyone is interested
English
1
0
0
95
Matty Hempstead
Matty Hempstead@mattyhempstead·
Someone should start measuring the time it takes the average SWE to complete various tasks before its too late and all the human labor is contaminated with AI tools. Would be interesting to see how steep our curve is accelerating in the opposite direction.
English
0
0
0
37
Matty Hempstead
Matty Hempstead@mattyhempstead·
This graph is actually an underestimation. As humans rely more on AI to write code, manual programming skills in humans will atrophy. What previously would have taken a human 1 hour to complete will now take longer - nobody knows how to write assembly anymore. We should expect the y-axis to experience independent acceleration as humans forget the art of programming. In fact every technological adoption contaminates the current low-background steel of human labor, making benchmarking like this less reliable as an absolute measure.
Matty Hempstead tweet media
English
1
0
0
65
Matty Hempstead
Matty Hempstead@mattyhempstead·
big things are happening at CES 2026
English
0
0
2
96
Matty Hempstead
Matty Hempstead@mattyhempstead·
debating rn whether to set up a notification for whenever a new SOTA is released on any benchmark and then be the first person to build an ensemble of the top 3 submissions and claim the glory
English
0
0
2
93