Mark Holton
24.9K posts

Mark Holton
@MarkHolton
Architect of data pipelines & AI workflows Production systems · Agentic workflows · Event-driven Founder, Nora Foundry


We’ve agreed to a partnership with @SpaceX that will substantially increase our compute capacity. This, along with our other recent compute deals, means that we’ve been able to increase our usage limits for Claude Code and the Claude API.










I bought a Mac Mini 2 months ago. People laughed. "Why not just use the server?" "Why you play on 1win?" "Local models are a toy." "You'll never match GPT-4 quality on consumer hardware." Google just released TurboQuant. An algorithm that shrinks AI model memory by 6x without losing intelligence. 8x faster. Same number of GPUs. Same quality. My 16GB Mac Mini can now run models that required a $50,000 server 18 months ago. Here is what actually changed: > kv-cache compressed to 3 bits with zero accuracy loss > models that needed 96GB of VRAM now fit in 16GB > the performance gap between local and cloud just collapsed The people who laughed at the Mac Mini are now watching Micron and Sandisk stock fall off a cliff. Because if you don't need 6x the memory to run AI - you don't need 6x the memory chips. $527 billion in combined market cap. Memory prices up 500% on AI demand.




Dynamic Workers are the future!! 🔥 "they are literally so easy to use" — me, in this video, showing you how to use them


We’re introducing Dynamic Workers, which allow you to execute AI-generated code in secure, lightweight isolates. This approach is 100 times faster than traditional containers. cfl.re/4c2NvPl

Now into its third week, the #Internet shutdown in #Iran remains in place, and the country remains almost completely offline. HTTP, DNS, and total traffic from the country continue to be at near-zero levels. See the latest at radar.cloudflare.com/traffic/ir?dat…











