Stephen Edginton

2.2K posts

Stephen Edginton

Stephen Edginton

@StephenEdginton

Chief Product and Technology Officer @Dext | ex Founder - Technology | Business | Fitness

England, United Kingdom Katılım Ocak 2012
5.3K Takip Edilen620 Takipçiler
Drew Houston
Drew Houston@drewhouston·
@mitchellh Looking like GLM 5.2 is truly Opus-tier -- to run it fast (>100 tok/sec) you'll need 8x RTX 6000 pros minimum ($125-150k), but achievable now
English
2
0
15
1.9K
Mitchell Hashimoto
Mitchell Hashimoto@mitchellh·
We've gone really quickly from "local models are dogshit" to "local models are good actually" (like, a 12 month window from A to B). I don't think they're actually good ENOUGH yet. We need an Opus 4.5 quality local model. When that happens, I think the world will spill over. Opus 4.5 is/was amazing, and is more than good enough for almost all tasks still as long as you pair with a frontier-level planner/judge. It'll still require a hugely expensive machine to run it, I'm sure, like a $5K or more laptop or mac studio. But, that's going to be pennies compared to the API costs plus all the benefits of guaranteed privacy and so on.
English
177
202
3.9K
248.1K
Charles Curran
Charles Curran@charliebcurran·
I used AI to explain SpaceX to my girlfriend, with fruit.
English
411
807
7.1K
556.4K
Microsoft Developer
Meet the Majorana 2, a next-generation topological quantum chip developed with the help of Microsoft Discovery’s agentic AI.
English
26
107
712
62.1K
Stephen Edginton
Stephen Edginton@StephenEdginton·
@DavidSacks Let’s hope we can get this solved quickly we know it’s just slowing down the inevitable
English
0
0
0
17
David Sacks
David Sacks@DavidSacks·
I’ve had a number of conversations with folks inside and outside government about the current situation with Anthropic, and here is what I believe to be true: — As we know, Anthropic publicly released its Mythos class models earlier this week under the commercial name Fable. — Fable is Mythos with guardrails. But if those guardrails fail, then you’ve exposed Mythos and its advanced cyber capabilities to people who shouldn’t have them. (Keep in mind that Anthropic itself widely promoted the idea that Mythos was a cyberweapon and needed to be regulated as such. They asked for government regulation of Mythos and championed the guardrails on Fable. If there is a vulnerability — big or small — it is Anthropic’s responsibility to patch.) — A highly credible trusted partner of both Anthropic and the USG who was testing Fable came forward with a jailbreak of those guardrails. The Admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused. — In their blog post, Anthropic defended its decision by saying the jailbreak isn’t serious. That is not what the trusted partner and the USG believe; nor is that kind of minimizing language consistent with Anthropic’s brand as the AI safety company. It’s difficult to fathom how they could claim a jailbreak allowing operability of a cyber weapon could be defined as not “serious.” — In the past, Anthropic has always said that safety must be top priority and taken super seriously. In this case, Anthropic prioritized the continued offering of the consumer model over safety. — In reaction, the Admin issued the export control. The Admin did this reluctantly. It’s been very surprised that Anthropic hasn’t wanted to cooperate with a reasonable safety request (ie fixing the jailbreak issue). Anthropic’s reaction is very much at odds with their branding and ethos as a safe AI research community. — The Admin’s hope now is that Anthropic remediates the safety issue, the export control is lifted, and Fable goes back into general release. The Admin wants all of this to happen as soon as possible. It is frankly bewildered that Anthropic hasn’t wanted to comply with safety requests that it previously said were its highest priority. — Those trying to misdirect and tie this action to the prior DoW/Anthropic issues are wrong. The Admin values Anthropic’s technical capabilities and feels that this issue, while serious, should be easily resolved. The ball is in Anthropic’s court.
English
2.2K
3.2K
25.5K
7.9M
Matt Clifford
Matt Clifford@matthewclifford·
I do find it extraordinary that current events in AI don’t make the top ~30 stories on the BBC News homepage
English
95
119
1.6K
176.2K
Stephen Edginton retweetledi
Xenova
Xenova@xenovacom·
I gave Fable 5 one job: write custom WebGPU kernels for Gemma 4 inference. It climbed to 84 tok/s, then hit a wall, insisting further optimization was impossible. Hours later, Anthropic rolled back invisible LLM development safeguards, and it hit 255 tok/s. The next day, access to Fable 5 was suspended globally.
English
146
370
5.3K
1.1M
Anthropic
Anthropic@AnthropicAI·
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
English
12.6K
25.8K
88.3K
91.4M
Stephen Edginton retweetledi
pabs
pabs@pabloberlangab·
Introducing Pemba. The first humanoid to climb to 20,000ft. Everest next. More below.
English
152
117
800
234.1K
Stephen Edginton
Stephen Edginton@StephenEdginton·
@_catwu I like the hook idea will intent that for our dbt repos
English
0
0
1
269
dax
dax@thdxr·
if you're setting up a new linux machine pick btrfs instead of ex4 trust me
English
217
55
2.5K
284.9K
dax
dax@thdxr·
i have seen enough proof now that using a coding agent is a deep skill it's confusing because the people you see heavily using them produce horrible results but that's because it's a skill! you can get better and the ceiling seems pretty high - this is very exciting to me
English
321
395
6.5K
379.5K
Michael Rabinovich
Michael Rabinovich@MikushRab·
Opus 4.8 just dropped and I ran it through our CAD tasks. 4.6 → 4.7 → 4.8 side by side. The results are unexpected!
English
198
193
3.5K
708.1K
Claude
Claude@claudeai·
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.
Claude tweet media
English
3.7K
8.6K
67.4K
15.3M
Stephen Edginton
Stephen Edginton@StephenEdginton·
@ycombinator @IrenaCronin @LightconePod @koomen This was very good love the fact your all leading the way internally and on side projects agree with the direction here all the primitives are being rediscovered and refactored - I’m going to feed the transcript to my company agent now to dream about
English
0
0
0
651
Y Combinator
Y Combinator@ycombinator·
Over the past year, we've been building our own internal agent infrastructure at YC: over 350 tools, self-improving skill loops, and a shared organizational brain that gets smarter overnight. In this episode of the @LightconePod, we sat down with YC General Partner Pete @koomen to talk about how he led the effort from the ground up. We cover how giving agents unrestricted access to one database was the key unlock, the self-improving skill loops that get smarter overnight, and why he thinks we've arrived at the personal computer moment for AI. 00:39 — YC's AI Stack 02:15 — The Finance Team Problem That Started It All 05:07 — SQL Access Changes Everything 07:20 — One Database to Rule Them All 09:14 — Jevons Paradox 10:07 — Denormalizing for Agents 12:15 — The Single-Player Era of Agents 14:16 — 350 Tools and a Shared Registry 16:24 — Skillify, DRY, and MECE Resolvers 18:23 — The Self-Improving Dream Cycle 20:26 — The Two-Sentence Pitch Skill 23:06 — How Super Intelligence Compounds 25:10 — Recording Everything as a Building Layer 27:10 — The Shared Organizational Brain 29:18 — Trust-Default Culture as a Requirement 30:44 — Raising the Floor for New Employees 32:35 — Horseless Carriages 34:24 — Why Chat Is the Best Interface for Agents 38:50 — Just-in-Time Software 40:49 — Centralizing vs. Decentralizing AI 43:32 — The Personal AI Revolution
English
83
118
805
751.7K
Stephen Edginton
Stephen Edginton@StephenEdginton·
@justindross Maybe that’s the missing trick we will need AI phycologists to help solve and manipulate the agents charge them for coaching
English
0
1
7
327
JD Ross
JD Ross@justindross·
This technology is so weird. Our CTO ran an agent overnight that decided “to sleep” for 4 hours at 2am before starting back on the task again. Hope the computer is less tired now.
English
25
11
1.5K
69.5K
British Army 🇬🇧
British Army 🇬🇧@BritishArmy·
We've been conducting a major @NATO exercise in London this week - and no one above ground suspected a thing. Hundreds of soldiers have been testing how they would run a major NATO command post, hidden deep beneath one of the busiest cities on earth. Read more ⬇️ bit.ly/4dMwHfs
British Army 🇬🇧 tweet mediaBritish Army 🇬🇧 tweet mediaBritish Army 🇬🇧 tweet media
English
107
210
1.2K
172.3K