Chris Olah

5.5K posts

Chris Olah banner
Chris Olah

Chris Olah

@ch402

Reverse engineering neural networks at @AnthropicAI. Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

San Francisco, CA Katılım Haziran 2010
183 Takip Edilen139K Takipçiler
Chris Olah retweetledi
Charlie Camosy
Charlie Camosy@CCamosy·
Very proud of this amicus brief filed yesterday in the @AnthropicAI case against the Department of War from Catholic moral theologians and ethicists. The very notion of what it means to have a just war is at stake in how we respond to these matters. courtlistener.com/docket/7237965…
Charlie Camosy tweet media
English
6
16
58
8.6K
Chris Olah retweetledi
Anthropic
Anthropic@AnthropicAI·
Introducing The Anthropic Institute, a new effort to advance the public conversation about powerful AI. anthropic.com/news/the-anthr…
English
502
726
6K
1.8M
Chris Olah retweetledi
Caitlin Kalinowski
Caitlin Kalinowski@kalinowski007·
I resigned from OpenAI. I care deeply about the Robotics team and the work we built together. This wasn’t an easy call. AI has an important role in national security. But surveillance of Americans without judicial oversight and lethal autonomy without human authorization are lines that deserved more deliberation than they got. This was about principle, not people. I have deep respect for Sam and the team, and I’m proud of what we built together.
English
1.9K
13.1K
59.3K
7.6M
Chris Olah retweetledi
Anthropic
Anthropic@AnthropicAI·
We partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox. Opus 4.6 found 22 vulnerabilities in just two weeks. Of these, 14 were high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025.
Anthropic tweet media
English
483
1.4K
15.2K
3.2M
Chris Olah retweetledi
Max Schwarzer
Max Schwarzer@max_a_schwarzer·
I've decided to leave OpenAI. I'm incredibly proud of all the work I've been part of here, from helping create the reasoning paradigm with @MillionInt, scaling up test-time compute with @polynoamial, working on RL algorithms with my fellow strawberries, shipping o1-preview (which started life as of one of my derisking runs), to post-training o1 and o3 with @ericmitchellai, @yanndubs and many others. I'm most proud of having led the post-training team here for the last year -- the team has done incredible work and shipped some really smart models, including GPT-5, 5.1, 5.2, and 5.3-Codex. OpenAI has genuinely some of the most talented researchers I have ever met, and I have learned more than I could have imagined knowing since I joined as a new grad. I want to thank @markchen90 @FidjiSimo @sama @merettm for all their support over my time here, and too many collaborators to name for the insights, ideas, and just plain fun we have had working together. After leading post-training for a year, though, I'm longing to start fresh and return to IC research work. I've been thinking about going back to technical research for quite some time, and I genuinely believe my colleagues and team here are set up to succeed going forward without me. I'm personally very excited for my next chapter -- I'm proud to be joining @AnthropicAI to get back into the weeds in RL research, and I'm looking forward supporting my friends there at this important time. Many of people I most trust and respect have joined Anthropic over the last couple of years, and I'm excited to work with them again. I have also been very impressed with Anthropic's talent, research taste and values, and I'm excited to be part of what the company does next!
English
616
1.2K
21.4K
3.2M
Chris Olah retweetledi
sam mcallister
sam mcallister@sammcallister·
@aidan_mclau @scrollvoid This isn't true. Anthropic hasn't offered a "helpful-only" model without safeguards for NatSec use. Claude Gov is a custom model with extra training, including technical safeguards. (We've also had FDEs and researchers implementing it, and we run our own classifier stack.)
English
15
37
554
127.3K
Chris Olah
Chris Olah@ch402·
Very grateful to all the natsec law experts who are taking time over the weekend to provide independent legal commentary in this moment. A few that I've noticed (no doubt missing many)...
English
10
40
525
74.1K
Chris Olah retweetledi
Alan Rozenshtein
Alan Rozenshtein@ARozenshtein·
A deep dive in @lawfare on the many legal problems with the Pentagon's designation of Anthropic as a supply chain risk.
Alan Rozenshtein tweet media
English
12
35
203
105.1K
Chris Olah retweetledi
Brad Carson
Brad Carson@bradrcarson·
@boazbaraktcs I'm former general counsel of Army, former Undersecretary of Army, former Undersec of Defense. Not sure if that makes me a nat sec "expert." But @nabla_theta interpretation is the right one, IMO.
English
5
31
640
93K
Chris Olah retweetledi
Alan Rozenshtein
Alan Rozenshtein@ARozenshtein·
These are NOT meaningful redlines. For example it only prohibits autonomous weapons “ in any case where law, regulation, or Department policy requires human control.” But the relevant safeguard against autonomous weapons is a DOD directive that Hegseth can change at will! Also the surveillance redline is about “unconstrained” surveillance of “private” information. But what about “slightly constrained” surveillance of private information, or unconstrained surveillance of “public” information? Those are both potentially very dangerous forms of mass surveillance!
OpenAI@OpenAI

Yesterday we reached an agreement with the Department of War for deploying advanced AI systems in classified environments, which we requested they make available to all AI companies. We think our deployment has more guardrails than any previous agreement for classified AI deployments, including Anthropic's. Here's why: openai.com/index/our-agre…

English
16
73
506
48.1K