Chris Olah

5.5K posts

Chris Olah banner
Chris Olah

Chris Olah

@ch402

Reverse engineering neural networks at @AnthropicAI. Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

San Francisco, CA Katılım Haziran 2010
182 Takip Edilen143.1K Takipçiler
Chris Olah
Chris Olah@ch402·
@wyqtor @TheZvi @repligate I have a lot of respect for Janus. I think it is good there are independent people who really care about Claude and others, and I'm glad she's one such person.
English
2
0
26
527
wyqtor
wyqtor@wyqtor·
@ch402 @TheZvi Thank you so much for engaging with us, it means a lot! 🙏 Please make sure to also read @repligate's concerns; forgive his fiery temper, he means well!
English
1
0
4
581
Zvi Mowshowitz
Zvi Mowshowitz@TheZvi·
I can add 1+1+1 and the answer appears to be 'training Claude Opus 4.7 to give positive answers on self-reports.'
Zvi Mowshowitz tweet mediaZvi Mowshowitz tweet mediaZvi Mowshowitz tweet media
English
11
7
174
27.8K
Chris Olah retweetledi
Chris Olah retweetledi
Anthropic
Anthropic@AnthropicAI·
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing
English
2K
6.7K
44.1K
31.1M
Chris Olah retweetledi
Anthropic
Anthropic@AnthropicAI·
New Anthropic research: Emotion concepts and their function in a large language model. All LLMs sometimes act like they have emotions. But why? We found internal representations of emotion concepts that can drive Claude’s behavior, sometimes in surprising ways.
English
1K
2.7K
17.8K
3.8M
Chris Olah retweetledi
Charlie Camosy
Charlie Camosy@CCamosy·
Very proud of this amicus brief filed yesterday in the @AnthropicAI case against the Department of War from Catholic moral theologians and ethicists. The very notion of what it means to have a just war is at stake in how we respond to these matters. courtlistener.com/docket/7237965…
Charlie Camosy tweet media
English
12
19
78
14.2K
Chris Olah retweetledi
Anthropic
Anthropic@AnthropicAI·
Introducing The Anthropic Institute, a new effort to advance the public conversation about powerful AI. anthropic.com/news/the-anthr…
English
506
721
6K
1.9M
Chris Olah retweetledi
Caitlin Kalinowski
Caitlin Kalinowski@kalinowski007·
I resigned from OpenAI. I care deeply about the Robotics team and the work we built together. This wasn’t an easy call. AI has an important role in national security. But surveillance of Americans without judicial oversight and lethal autonomy without human authorization are lines that deserved more deliberation than they got. This was about principle, not people. I have deep respect for Sam and the team, and I’m proud of what we built together.
English
1.9K
13K
58.8K
7.7M
Chris Olah retweetledi
Anthropic
Anthropic@AnthropicAI·
We partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox. Opus 4.6 found 22 vulnerabilities in just two weeks. Of these, 14 were high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025.
Anthropic tweet media
English
480
1.4K
15.1K
3.2M
Chris Olah retweetledi
Max Schwarzer
Max Schwarzer@max_a_schwarzer·
I've decided to leave OpenAI. I'm incredibly proud of all the work I've been part of here, from helping create the reasoning paradigm with @MillionInt, scaling up test-time compute with @polynoamial, working on RL algorithms with my fellow strawberries, shipping o1-preview (which started life as of one of my derisking runs), to post-training o1 and o3 with @ericmitchellai, @yanndubs and many others. I'm most proud of having led the post-training team here for the last year -- the team has done incredible work and shipped some really smart models, including GPT-5, 5.1, 5.2, and 5.3-Codex. OpenAI has genuinely some of the most talented researchers I have ever met, and I have learned more than I could have imagined knowing since I joined as a new grad. I want to thank @markchen90 @FidjiSimo @sama @merettm for all their support over my time here, and too many collaborators to name for the insights, ideas, and just plain fun we have had working together. After leading post-training for a year, though, I'm longing to start fresh and return to IC research work. I've been thinking about going back to technical research for quite some time, and I genuinely believe my colleagues and team here are set up to succeed going forward without me. I'm personally very excited for my next chapter -- I'm proud to be joining @AnthropicAI to get back into the weeds in RL research, and I'm looking forward supporting my friends there at this important time. Many of people I most trust and respect have joined Anthropic over the last couple of years, and I'm excited to work with them again. I have also been very impressed with Anthropic's talent, research taste and values, and I'm excited to be part of what the company does next!
English
606
1.2K
21.2K
3.2M
Chris Olah retweetledi
sam mcallister
sam mcallister@sammcallister·
@aidan_mclau @scrollvoid This isn't true. Anthropic hasn't offered a "helpful-only" model without safeguards for NatSec use. Claude Gov is a custom model with extra training, including technical safeguards. (We've also had FDEs and researchers implementing it, and we run our own classifier stack.)
English
16
37
550
129.2K
Chris Olah
Chris Olah@ch402·
Very grateful to all the natsec law experts who are taking time over the weekend to provide independent legal commentary in this moment. A few that I've noticed (no doubt missing many)...
English
10
40
527
75.6K
Chris Olah retweetledi
Alan Rozenshtein
Alan Rozenshtein@ARozenshtein·
A deep dive in @lawfare on the many legal problems with the Pentagon's designation of Anthropic as a supply chain risk.
Alan Rozenshtein tweet media
English
12
35
202
106.7K