Tim Middleton-Sally 🇺🇸

30 posts

Tim Middleton-Sally 🇺🇸

@plowingthedark

🇺🇸 Living the Strenuous Life. Catholic (Ret.). Member of Technical Staff @AnthropicAI. Views are my own.

Katılım Ekim 2025

72 Takip Edilen6 Takipçiler

Tim Middleton-Sally 🇺🇸 retweetledi

Chris Olah@ch402·1d

x.com/i/article/2058…

ZXX

144

519

2.8K

221.3K

Tim Middleton-Sally 🇺🇸 retweetledi

Andrej Karpathy@karpathy·19 May

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English

7.9K

11.1K

149.1K

27.1M

Tim Middleton-Sally 🇺🇸@plowingthedark·26 Nis

God bless our brave men and women 🇺🇸

English

Tim Middleton-Sally 🇺🇸@plowingthedark·16 Nis

@paulg @edels0n This is an extremely accurate take and I appreciate you popularizing it.

English

239

Paul Graham@paulg·16 Nis

@edels0n There's a middle ground where they don't use the zero-days to destroy us, but in effect to install explosives in all our infrastructure that would destroy us at the push of a button. And they probably will do that.

English

221

15.7K

Ed Elson@edels0n·16 Nis

Here’s what Jensen should have said: The question isn’t whether China achieves Mythos-level AI. (They will.) It’s whether they will use it to try to destroy America. The same question goes for nukes. China has nukes and yet they haven’t nuked us. Why? Because they don’t want to. Not because they “can’t.” In other words, it’s not up to Nvidia to convince China to not try to destroy us — it’s up to our government. It’s government’s job to convince China to play nice. A foreign relations matter, not a GPU matter.

Dwarkesh Patel@dwarkesh_sp

Distilled recap of the back-and-forth with Jensen on export controls: Dwarkesh: Wouldn’t selling Nvidia chips to China enable them to train models like Claude Mythos with cyber offensive capabilities that would be threats to American companies and national security? Jensen: First of all, Mythos was trained on fairly mundane capacity and a fairly mundane amount of it by an extraordinary company. The amount of capacity and the type of compute it was trained on is abundantly available in China. Dwarkesh: With that, could they eventually train a model like Mythos? Yes. But the question is, because we have more FLOPs, American labs are able to get to this level of capabilities first. Furthermore, even if they trained a model like this, the ability to deploy it at scale matters. If you had a cyber hacker, it's much more dangerous if they have a million of them versus a thousand of them. Jensen: Your premise is just wrong. The fact of the matter is their AI development is going just fine. The best AI researchers in the world, because they are limited in compute, also come up with extremely smart algorithms. DeepSeek is not an inconsequential advance. The day that DeepSeek comes out on Huawei first, that is a horrible outcome for our nation. Dwarkesh: Currently, you can have a model like DeepSeek that can run on any accelerator if it's open source. Why would that stop being the case in the future? Jensen: Suppose it optimizes for Huawei. Suppose it optimizes for their architecture. It would put others at a disadvantage. As AI diffuses out into the rest of the world, their standards and their tech stack will become superior to ours because their models are open. Dwarkesh: Tesla sold extremely good electric vehicles to China for a long time. iPhones are sold in China. They didn't cause some lock-in. China will still make their version of EVs, and they're dominating, or smartphones, they're dominating. Jensen: We are not a car. The fact that I can buy this car brand one day and use another car brand another day is easy. Computing is not like that. There's a reason why x86 still exists. There's a reason why Arm is so sticky. These ecosystems are hard to replace. Dwarkesh: It's just hard to imagine that there's a long-term lock-in to the Chinese ecosystem, even if they have this slightly better open-source model for a while. American labs port across accelerators constantly. Anthropic's models are run on GPUs, they're run on Trainium, they're run on TPUs. There are so many things you can do, from distilling to a model that's well fit for your chips. Jensen: China is the largest contributor to open source software in the world. China's the largest contributor to open models in the world. Today it's built on the American tech stack, Nvidia’s. Fact. All five layers of the tech stack for AI are important. The United States ought to go win all five of them. in a few years time, I'm making you the prediction that when we want American technology to be diffused around the world—out to India, out to the Middle East, out to Africa, out to Southeast Asia—on that day, I will tell you exactly about today's conversation, about how your policy ... caused the United States to concede the second largest market in the world for no good reason at all.

English

293

71.1K

Tim Middleton-Sally 🇺🇸 retweetledi

Pope Leo XIV@Pontifex·16 Nis

Woe to those who manipulate religion and the very name of God for their own military, economic, and political gain, dragging that which is sacred into darkness and filth. #ApostolicJourney #Cameroon vatican.va/content/leo-xi…

English

22.2K

74.9K

382.9K

10.8M

Tim Middleton-Sally 🇺🇸@plowingthedark·12 Nis

🇺🇸 📷Bill Ingalls @NASA

English

Tim Middleton-Sally 🇺🇸@plowingthedark·9 Nis

While I'm a retired Catholic, I will absolutely lose my mind if it turns out an official of any government threatened the Vicar of Christ with the prospect of military action. Hope the deninals are true!

English

Tim Middleton-Sally 🇺🇸@plowingthedark·28 Şub

ZXX

Tim Middleton-Sally 🇺🇸@plowingthedark·20 Şub

@nkreu113r @NateSilver538 your take on the ramifications of probabilistic reasoning on juries would make an interesting article/newsletter

English

noah@nkreu113r·19 Şub

It is completely insane to say that beyond a reasonable doubt is 60%. But it is also insane - more insane, even - that our society offers no probabilistic definition of what a reasonable doubt is. Justice would be better served if we formalized probability in the courtroom, and if juries could impose different penalties for different levels of confidence.

Doug Gladden@DougtheLawyer

A Houston judge told a jury that "beyond a reasonable doubt" can be as low as "60 percent." This morning the 14th Court of Appeals reversed my client's conviction because of that comment. #appellatetwitter

English

111

12.5K

Tim Middleton-Sally 🇺🇸@plowingthedark·26 Oca

Great read! For "what I failed to understand is that energy is not fixed. It’s dramatically affected by what you choose to do", one super common type error I've noticed is people focus on what they are good at and/or what they believe to be a higher calling and they go do those things. Totally distinct categories from the things that give you energy. IMO start with the list of things that give you energy and see if you've got ability and/or righteousness for any of those things. And if you don't, it's fine to go do something for the mission, but it's extremely important to time box it so you can stay in the fight later in life

English

Cate Hall@catehall·23 Oca

I wrote about the load-bearing beliefs I've outgrown in recent years, like "what doesn't kill you makes you stronger" -- link below.

English

17.1K

Tim Middleton-Sally 🇺🇸@plowingthedark·26 Oca

@WillManidis @elonmusk Even without the AI detector, highly relevant past tweet from you: x.com/WillManidis/st…

Will Manidis@WillManidis

the universal rule of working with ex-government/intelligence community/confidential jobs people is that the more sketchy they are about what they did, the greater the odds they were an unimportant cog real actors have better cover stories

English

796

Will Manidis@WillManidis·26 Oca

@elonmusk Guys come on

English

123

10.9K

Elon Musk@elonmusk·26 Oca

Interesting

Eric Schwalm@Schwalm5132

As a former Special Forces Warrant Officer with multiple rotations running counterinsurgency ops—both hunting insurgents and trying to separate them from sympathetic populations—I’ve seen organized resistance up close. From Anbar to Helmand, the pattern is familiar: spotters, cutouts, dead drops (or modern equivalents), disciplined comms, role specialization, and a willingness to absorb casualties while bleeding the stronger force slowly. What’s unfolding in Minneapolis right now isn’t “protest.” It’s low-level insurgency infrastructure, built by people who’ve clearly studied the playbook. Signal groups at 1,000-member cap per zone. Dedicated roles: mobile chasers, plate checkers logging vehicle data into shared databases, 24/7 dispatch nodes vectoring assets, SALUTE-style reporting (Size, Activity, Location, Unit, Time, Equipment) on suspected federal vehicles. Daily chat rotations and timed deletions to frustrate forensic recovery. Vetting processes for new joiners. Mutual aid from sympathetic locals (teachers providing cover, possible PD tip-offs on license plate lookups). Home-base coordination points. Rapid escalation from observation to physical obstruction—or worse. This isn’t spontaneous outrage. This is C2 (command and control) with redundancy, OPSEC hygiene, and task organization that would make a SF team sergeant nod in recognition. Replace “ICE agents” with “occupying coalition forces” and the structure maps almost 1:1 to early-stage urban cells we hunted in the mid-2000s. The most sobering part? It’s domestic. Funded, trained (somewhere), and directed by people who live in the same country they’re trying to paralyze law enforcement in. When your own citizens build and operate this level of parallel intelligence and rapid-response network against federal officers—complete with doxxing, vehicle pursuits, and harassment that’s already turned lethal—you’re no longer dealing with civil disobedience. You’re facing a distributed resistance that’s learned the lessons of successful insurgencies: stay below the kinetic threshold most of the time, force over-reaction when possible, maintain popular support through narrative, and never present a single center of gravity. I spent years training partner forces to dismantle exactly this kind of apparatus. Now pieces of it are standing up in American cities, enabled by elements of local government and civil society. That should keep every thinking American awake at night. Not because I want escalation. But because history shows these things don’t de-escalate on their own once the infrastructure exists and the cadre believe they’re winning the information war. We either recognize what we’re actually looking at—or we pretend it’s still just “activism” until the structures harden and spread. Your call, America. But from where I sit, this isn’t January 2026 politics anymore. It’s phase one of something we’ve spent decades trying to keep off our own soil.

English

13.4K

75.2K

30M

Tim Middleton-Sally 🇺🇸 retweetledi

Neil Zeghidour@neilzegh·20 Oca

Me defending my O(n^3) solution to the coding interviewer.

English

415

49.2K

Tim Middleton-Sally 🇺🇸@plowingthedark·18 Oca

I could definitely quibble with certain parts and there is at least one thing I believe is incorrect. But Affleck's analysis here is SUBSTANTIALLY above the median.

Forrest Knight@ForrestPKnight

Honestly, Ben Affleck actually knowing AI and the landscape caught me off guard, but as a writer, makes sense. Great takes across the board.

English

Tim Middleton-Sally 🇺🇸@plowingthedark·16 Oca

@catehall Exceptional. Laughed out loud at the editor's comment about too hard to relate too. Preordered book in Nov and am certain (though obv from a distance) you are correct about the transmutation. It just comes through in your writing and interviews and why book was an instant buy.

English

183

Cate Hall@catehall·15 Oca

I wrote about how I ended up addicted to drugs. Link below via Useful Fictions.

English

356

99.9K

Tim Middleton-Sally 🇺🇸@plowingthedark·30 Ara

@WillManidis It is and always will be The Chronicles of Narnia

English

638

Will Manidis@WillManidis·29 Ara

what is the greatest work of Christian fiction?

English

70.6K

Tim Middleton-Sally 🇺🇸@plowingthedark·21 Ara

@paulg Have you introduced your kids to it yet? I'm currently debating on the timing

English

595

Tim Middleton-Sally 🇺🇸 retweetledi

Paul Graham@paulg·21 Ara

A three-dive day is a good day.

English

248

65.9K

Tim Middleton-Sally 🇺🇸@plowingthedark·3 Ara

🎯

Brendan Dolan-Gavitt@moyix

Right now all design problems for LLM-based apps feel like a choice between: - Solve it for 100% of the problem space at X% reliability - Solve it for Y% of the problem space at 100% reliability (And you don’t know the values of X and Y up front)

ART

126

Tim Middleton-Sally 🇺🇸@plowingthedark·17 Kas

@_jasonwei jasonwei.net/blog/asymmetry…

QME

108

Tim Middleton-Sally 🇺🇸@plowingthedark·17 Kas

I first became aware of this concept thanks to @_jasonwei who wrote an article titled Asymmetry of verification and verifier’s rule in July of this year. "Verifier’s rule: The ease of training AI to solve a task is proportional to how verifiable the task is. All tasks that are possible to solve and easy to verify will be solved by AI." It works. I've put it into practice and it is a key insight.

Andrej Karpathy@karpathy

Sharing an interesting recent conversation on AI's impact on the economy. AI has been compared to various historical precedents: electricity, industrial revolution, etc., I think the strongest analogy is that of AI as a new computing paradigm (Software 2.0) because both are fundamentally about the automation of digital information processing. If you were to forecast the impact of computing on the job market in ~1980s, the most predictive feature of a task/job you'd look at is to what extent the algorithm of it is fixed, i.e. are you just mechanically transforming information according to rote, easy to specify rules (e.g. typing, bookkeeping, human calculators, etc.)? Back then, this was the class of programs that the computing capability of that era allowed us to write (by hand, manually). With AI now, we are able to write new programs that we could never hope to write by hand before. We do it by specifying objectives (e.g. classification accuracy, reward functions), and we search the program space via gradient descent to find neural networks that work well against that objective. This is my Software 2.0 blog post from a while ago. In this new programming paradigm then, the new most predictive feature to look at is verifiability. If a task/job is verifiable, then it is optimizable directly or via reinforcement learning, and a neural net can be trained to work extremely well. It's about to what extent an AI can "practice" something. The environment has to be resettable (you can start a new attempt), efficient (a lot attempts can be made), and rewardable (there is some automated process to reward any specific attempt that was made). The more a task/job is verifiable, the more amenable it is to automation in the new programming paradigm. If it is not verifiable, it has to fall out from neural net magic of generalization fingers crossed, or via weaker means like imitation. This is what's driving the "jagged" frontier of progress in LLMs. Tasks that are verifiable progress rapidly, including possibly beyond the ability of top experts (e.g. math, code, amount of time spent watching videos, anything that looks like puzzles with correct answers), while many others lag by comparison (creative, strategic, tasks that combine real-world knowledge, state, context and common sense). Software 1.0 easily automates what you can specify. Software 2.0 easily automates what you can verify.

English

158

Keşfet

@paulg @edels0n @NASA @nkreu113r @NateSilver538 @WillManidis @elonmusk @catehall