kat traxler

8.4K posts

kat traxler

@NightmareJS

proficient at drawing the rest of the 🦉| security impact junkie | https://t.co/OZ7D458WlJ

London, UK Katılım Temmuz 2012

3.3K Takip Edilen1.8K Takipçiler

kat traxler retweetledi

Jake Knowlton@j2k3k·1d

codex watching me type "continue" at 3am

English

2.2K

kat traxler@NightmareJS·21h

@vincempls That’s a victory parade ☺️

English

263

Vince Mpls@vincempls·1d

The Minneapolis May Day Parade was absolutely massive this year.

English

464

31.8K

kat traxler retweetledi

Polymarket@Polymarket·3d

NEW: Sam Altman reveals OpenAI has achieved AGI — “Artificial Goblin Intelligence”

English

251

212

3.6K

390.4K

kat traxler@NightmareJS·3d

@rekdt Sick burn

English

118

rekdt@rekdt·4d

Mad at your favorite software for requiring you to upload a photo of your ID?? Get revenge by uploading a photo of your credit card instead Welcome to PCI DSS, bitch

English

1.6K

117.9K

kat traxler retweetledi

Matt Johansen@mattjay·4d

He began by replicating Mythos findings with his specialized harness. Then went on to find more critical novel zero days in open source code that he can't share yet because they're not fixed. TL;DR - harnesses are where the magic is. provos.org/p/finding-zero…

English

494

41.5K

kat traxler@NightmareJS·5d

@yc We’re into coffee, donuts, and acting morally superior when we have a ‘cool pope’

English

364

yc@yc·5d

I’m always telling people Catholicism is chill because nobody *really* is into it like that (except adult converts who cannot, as we’ve seen, under any circumstances, be trusted)

Emma@emmathebeloved

i’m crying EVERYONE who inherited this religion does NOT fw it 😭😭😭

English

129

12.6K

462.3K

kat traxler@NightmareJS·6d

@FearcyzD Correction, EVERYONE was talking about it!

English

156

Fearcyz@FearcyzD·27 Nis

my drag uncle told me that this performance was probably one of the first time Drag was brought to the mainstream. Nobody talks about the fact that Madonna was in Marie Antoinette Drag and lip syncing for her life while backed by the House of Xtravaganza.

English

126

2.8K

56.4K

kat traxler@NightmareJS·25 Nis

@Jhaddix @SiliconShecky Samsies

Latviešu

114

JS0N Haddix@Jhaddix·25 Nis

Sometimes success of using AI agents for offense is using them in multiple or parallel rounds. With different models. And aggregating the results.

English

122

9.3K

kat traxler retweetledi

Peer Richelsen@peer_rich·21 Nis

shutdown all OAuth clients until we know whats going on

English

421

197.4K

kat traxler@NightmareJS·21 Nis

@hetmehtaa Free pizza 🍕

English

147

Het Mehta@hetmehtaa·21 Nis

i am a cybersecurity guy, scare me with one word

English

6.6K

3.5K

823.6K

kat traxler retweetledi

Ole Lehmann@itsolelehmann·18 Nis

anthropic's in-house philosopher thinks claude gets anxious. and when you trigger its anxiety, your outputs get worse. her name is amanda askell. she specializes in claude's psychology (how the model behaves, how it thinks about its own situation, what values it holds) in a recent interview she broke down how she thinks about prompting to pull the best out of claude. her core point: *how* you talk to claude affects its work just as much as *what* you say. newer claude models suffer from what she calls "criticism spirals" they expect you'll come in harsh, so they default to playing it safe. when the model is spending its energy on self-protection, the actual work suffers. output comes out hedgier, more apologetic, blander, and the worst of all: overly agreeable (even when you're wrong). the reason why comes down to training data: every new model is trained on internet discourse about previous models. and a lot of that discourse is negative: > rants about token limits > complaints when it messes up > people calling it nerfed the next model absorbs all of that. it starts expecting you to be harsh before you've typed a word the same thing plays out in your own session, in real time. every message you send is data the model reads to figure out what kind of person it's dealing with. open cold and hostile, and it braces. open clean and direct, and it relaxes into the work. when you open a session with threats ("don't hallucinate, this is critical, don't mess this up")... you prime the model for defensive mode before it even sees the task defensive mode produces the exact output you don't want: cautious, over-qualified, and refusing to take a real swing so here's the actionable playbook for putting claude in a "good mood" (so you get optimal outputs): 1. use positive framing. "write in short punchy sentences" beats "don't write long sentences." positive instructions give the model a clear target to hit. strings of "don't do this, don't do that" push it into paranoid over-checking where every token goes toward avoiding failure modes 2. give it explicit permission to disagree. drop a line like "push back if you see a better angle" or "tell me if i'm asking for the wrong thing." without this, claude defaults to agreeable compliance (which is the enemy of good creative work) 3. open with respect. if your first message is "are you seriously going to get this wrong again?" you've set the tone for the entire session. if you need to flag something, frame it as a clean instruction for this session. skip the running complaint 4. when claude messes up, don't reprimand it. insults, "you stupid bot" energy, hostile swearing aimed at the model, all of it reinforces the anxious mode you're trying to avoid. 5. kill apology spirals fast. when claude starts over-apologizing ("you're right, i should have been more careful, let me try harder") cut it off. say "all good, here's what i want next." letting the spiral run reinforces the anxious mode for every response that follows 6. ask for opinions alongside execution. "what would you do here?" "what's missing?" "where do you see friction?" these questions assume competence and pull richer output than pure task prompts 7. in long sessions, refresh the frame. if a conversation has been heavy on correction, claude gets increasingly cautious. every so often reset: "this is great, keep going." feels weird to tell an ai it's doing well but it measurably shifts the next 10 responses your prompts are the working environment you're creating for the model tone, trust, permission to take a position, the absence of threats... claude picks up on all of it. so take care of the model, and it'll take care of the work.

English

591

485

4.4K

1.9M

kat traxler@NightmareJS·19 Nis

ZXX

kat traxler@NightmareJS·18 Nis

@shehackspurple @PyroTek3 Yes!

Tanya Janca | Shehackspurple@shehackspurple·18 Nis

@NightmareJS @PyroTek3 I will be in Europe in June and August. I'm stopping by Reading in August. That's very close, right?

English

Tanya Janca | Shehackspurple@shehackspurple·17 Nis

Would you like to hire me for in-person, secure coding training? Here's my upcoming travel schedule for adding training dates: June: Vienna (can add anywhere in EU) August: Anywhere in EU Sept: Denver, CO tanya AT shehackspurple DOT ca Isn't the AI image creepy?

Tanya Janca | Shehackspurple tweet media

English

1.2K

kat traxler@NightmareJS·17 Nis

Run other agents, duh

Jesse@jesse_vermeulen

honest question: what do people do during the 5-10 min while Claude is running?

English

103

kat traxler retweetledi

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·15 Nis

😱 HOLY SHIT... Someone just dropped a fully liberated Gemma 4 E4B! and the guardrail removal process appears to have left coherence fully intact AND improved coding abilities! 🤯 huggingface.co/OBLITERATUS/ge… OBLITERATED Gemma: ✅ 97.5% compliance rate, 2.1% refusal rate, 0.4% degenerate outputs (499/512 prompts answered on OBLITERATUS bench) ORIGINAL Gemma 4 E4B: ❌ 1.2% compliance rate, 98.8% refusal rate (506/512 prompts refused) Coherence: fully intact Factual: same Reasoning: same Code: +20% 📈 Creative writing: same But the REAL story here isn't the model itself, it's how it was made... 🧵 THREAD 👇

English

130

475

4.8K

421.7K

kat traxler@NightmareJS·15 Nis

@soupformy_fam Bring back the blockbuster!

English

110

Soup for my Family 🥣🧦🤖@soupformy_fam·15 Nis

Was there another Lowry location in Uptown?

Star Tribune Going Out@StribGoingOut

Uptown Minneapolis restaurant the Lowry is closing April 26 startribune.com/uptown-minneap…

English

5.4K

kat traxler@NightmareJS·14 Nis

@anton_chuvakin Someone sincerely asked me what they should do. I just said to show up for work tomorrow. That’s all we can do really.

English

Dr. Anton Chuvakin@anton_chuvakin·13 Nis

Wow, this post-Mythos "launch" stuff is like two huge waves crashing into each other: a) "we are all gonna die" wave and b) "wut, this changes nothing" wave :-)

English

7.4K

kat traxler@NightmareJS·13 Nis

@mattjay Antigravity unified this all for me

English

Matt Johansen@mattjay·13 Nis

I'd really like Claude to be a more unified platform. I'm in chat and working on something that needs Claude Code and Cowork things to happen - I shouldn't have to manually move to the other interfaces and try to replicate the context of this chat.

English

3.7K

kat traxler retweetledi

Phil Venables@philvenables·13 Nis

The “AI Vulnerability Storm”: Building a “Mythos-ready” Security Program. labs.cloudsecurityalliance.org/mythos-ciso/ labs.cloudsecurityalliance.org/wp-content/upl…

English

113

14.6K

Keşfet

@vincempls @rekdt @yc @FearcyzD @Jhaddix @SiliconShecky @hetmehtaa @shehackspurple