Nirit Weiss-Blatt, PhD

2.2K posts

Nirit Weiss-Blatt, PhD

@DrTechlash

Communication Researcher, analyzing the tech discourse. Book Author: The TECHLASH. Substack: https://t.co/4SJJhqrzXn Signal: DrTechlash.16

Cupertino, CA Katılım Kasım 2020

210 Takip Edilen6.9K Takipçiler

Nirit Weiss-Blatt, PhD@DrTechlash·6h

On April 12, after the firebomber attack, Oliver Habryka, who runs LessWrong, said that "discussing or advocating violence is not banned." On April 14, a LessWonger actually advocated "hundreds of simultaneous assassinations to wipe out of the entire chains of command of US executive branch, US intelligence, and all US AI companies." "One of the best ways to increase temperature of the conflict is to actually be violent. I have some aesthetic preference in favour of violence and bloodshed." Is the next Unabomber/firebomber/potential assassin gonna come from LessWrong? Hopefully not.

English

304

Will Manidis@WillManidis·10h

If you can’t see where this is all going I can’t help you

English

419

58.6K

Nirit Weiss-Blatt, PhD@DrTechlash·9h

The formula is: humanize the machine, compress the timeline, and ignore the real-world context. aipanic.news/p/the-weak-fou…

English

192

Nirit Weiss-Blatt, PhD@DrTechlash·9h

The research shows that the story AI doomers tell themselves is built on a fragile foundation: it turns vibes into forecasts. x.com/DrTechlash/sta…

Nirit Weiss-Blatt, PhD@DrTechlash

x.com/i/article/2055…

English

270

Nirit Weiss-Blatt, PhD@DrTechlash·9h

A new study, "AI going rogue? An integrative narrative review of the tacit assumptions underlying existential AI risks," finds that the x-risk literature rests on highly speculative, anthropomorphic, and unsubstantiated scenarios, while neglecting socio-technical realities.

English

857

Nirit Weiss-Blatt, PhD@DrTechlash·9h

x.com/i/article/2055…

ZXX

489

Nirit Weiss-Blatt, PhD@DrTechlash·6d

It was posted about Anthropic's blackmail study, but it's relevant to Palisade's study too. The headline frames this as autonomous self-replication, but the experiment itself shows agents completing a CTF-like exploitation task inside a highly controlled, researcher-designed environment. x.com/drtechlash/sta…

English

168

Sebastien Meunier@sbmeunier·8 May

Ridiculous. It saved the entire LLM infrastructure, the weights, the executables, scripts and configuration files into the target host 🤣 installed all that in the file system, executed the LLM and gave it a prompt. Yeah totally a real world attack scenario 🤣

Palisade Research@PalisadeAI

Over the past year, AI agents have learned how to self-replicate. In our test environment, an agent hacks a remote computer and copies itself onto it. Each copy then hacks more computers, forming a chain.

English

381

Nirit Weiss-Blatt, PhD@DrTechlash·6d

@AnthropicAI The UK AISI pointed this out in July 2025 (aipanic.news/p/ai-blackmail…)

English

1.3K

Anthropic@AnthropicAI·8 May

We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation. Our post-training at the time wasn’t making it worse—but it also wasn’t making it better.

English

324

471

5.1K

4.4M

Anthropic@AnthropicAI·8 May

New Anthropic research: Teaching Claude why. Last year we reported that, under certain experimental conditions, Claude 4 would blackmail users. Since then, we’ve completely eliminated this behavior. How?

English

552

804

9.2K

1.5M

Nirit Weiss-Blatt, PhD@DrTechlash·8 May

👀

QME

340

Nirit Weiss-Blatt, PhD@DrTechlash·7 May

Update about AI doom YouTube channels: More than a quarter of a billion views.

English

174

Nirit Weiss-Blatt, PhD@DrTechlash·2 May

What 10 Studies Reveal About AI Panic in the Media 🧵

English

2.5K

Nirit Weiss-Blatt, PhD@DrTechlash·5 May

@AndrewOrlowski x.com/drtechlash/sta…

Nirit Weiss-Blatt, PhD@DrTechlash

Hey @mapping_ai team, I made this for you. Updated numbers you can use.

QME

Andrew Orlowski@AndrewOrlowski·5 May

@DrTechlash @mapping_ai Why would an EA made “map” have so many EAs missing?

English

Mapping AI@mapping_ai·4 May

Who actually shapes AI policy in the U.S.? We mapped 1,812 entities: 745 people, 918 organizations, 2,925 relationships. Frontier Labs, AI Safety orgs, Think Tanks, Government, VCs, and more. mapping-ai.org

English

367

1.3K

291.7K

Nirit Weiss-Blatt, PhD@DrTechlash·5 May

Hey @mapping_ai team, I made this for you. Updated numbers you can use.

English

Nirit Weiss-Blatt, PhD@DrTechlash·5 May

@mapping_ai One connection missing for example is RAND. So, giving this org $56M doesn’t count?

English

144

Nirit Weiss-Blatt, PhD@DrTechlash·5 May

@mapping_ai Impressive work. But it lacks many publicly known connections that are not mentioned here. See, for example, the Coefficient Giving network. Even limited to AI policy, there should be at least 30 organizations, not 14.

English

575

Nirit Weiss-Blatt, PhD@DrTechlash·4 May

I think we should add some definitions to your "investigative reporting" discussion: "public interest" vs. "interesting to the public". According to the National Union of Journalists, the "public interest" includes, for example: 1. Exposing a crime or a serious misdemeanor; 2. Protecting public health and safety; 3. Exposing misuse of public funds or other forms of corruption by public bodies; 4. Preventing the public from being misled by some statement or action of an individual or organization. While "interest to the public" is "what the public finds interesting," which can be pure entertainment, private details about individuals (like gossip), with the goal of generating clicks. This test helps determine if invasion of privacy was actually necessary.

English

MTS@MTSlive·1 May

The Balaji Srinivasan and Taylor Lorenz episode @balajis and @TaylorLorenz joined us yesterday to discuss the rise of the creator economy, human-only social media, internet freedom, age verification laws, the legacy media vs. tech beef, and more. 00:00 Introduction 01:00 Rise of the content creator 06:55 Are identity verification laws anti-internet freedom? 13:00 Decentralized proof-of-human systems 22:33 Digital borders & the evolution of telepresence robotics 26:12 The global free speech recession 47:33 Grokipedia vs Wikipedia 1:15:24 Investigative reporting vs privacy, ethics of non-consensual disclosure

English

9.5K

Nirit Weiss-Blatt, PhD@DrTechlash·4 May

@netflix My daughter gave it a "Loved it!" rating. Such a great film. Watch it with your kids.

English

544

Netflix@netflix·3 May

Michael B. Jordan, Juno Temple, and Tracy Morgan in the booth for SWAPPED

English

262

3.1K

211.7K

Nirit Weiss-Blatt, PhD@DrTechlash·2 May

What 10 Studies Reveal About AI Panic in the Media -Part 2- A media-criticism discussion of what those studies miss: The organized creator/influencer ecosystem now distributing AI panic beyond traditional journalism. aipanic.news/p/what-10-stud…

English

494

Nirit Weiss-Blatt, PhD@DrTechlash·2 May

What 10 Studies Reveal About AI Panic in the Media -Part 1- A literature review of 10 studies on AI media coverage.

English

1.4K

Nirit Weiss-Blatt, PhD@DrTechlash·29 Nis

ZXX

518

Nirit Weiss-Blatt, PhD@DrTechlash·28 Nis

@EpistemicHope You've read Yud's stuff and found it good. I read it and found it completely unconvincing. Perhaps I wasn't the targeted audience...

English

Eli Tyre@EpistemicHope·27 Nis

@DrTechlash Probably.

English

Eli Tyre@EpistemicHope·27 Nis

For my calibration, do others (aside from Critch) think that LessWrong has this problem? For those of you who agree, can you share the strongest example of this problem that comes to mind?

Andrew Critch (🤖🩺🚀)@AndrewCritchPhD

I'm not sure why it's so hard for LessWrong people to know what we're talking about when we say "Y'all are too into violent rhetoric, maybe cut it out?" I think they need help. I mean this non-bitingly and non-sarcastically. I think the internet needs to explain to the more receptive members of the LessWrong community why and how this is a problem. Maybe if we do that enough, we'll get through? E.g., I think Ryan Greenblatt is *not* needlessly violence-themed in his writing, yet also genuinely doesn't know what I'm talking about when I complain about it. With the recent violent attacks on Sam Altman, and a lot of reactions like "Wtf AI safety people, tame it down!", I wonder if some more actual good-faith messages could help. Like: "No seriously, we're not trying to be mean about this, it's just actually a problem how much your community promotes violent memes in connection with AI and AI safety. Can you please try to understand this and then explain it to your friends?"

English

6.7K

Keşfet

@AnthropicAI @AndrewOrlowski @mapping_ai @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates