Aaron Grattafiori

20.3K posts

Aaron Grattafiori

@dyn___

Offensive Security / AI Red Teaming @ NVIDIA. Ex-GenAI and OffSec Red Teaming Lead at Meta. Ex-Principal Consultant and Researcher @ NCC Group/iSEC Partners.

Colorado Katılım Mart 2014

2.5K Takip Edilen6K Takipçiler

Sabitlenmiş Tweet

Aaron Grattafiori@dyn___·1 Nis

Current Status feels like this is no longer going to be fiction...

English

1.4K

Aaron Grattafiori@dyn___·1h

@rez0__ Here? It seems terrible vs a year or two ago even. I see pretty relevant stuff and then it can devolve into garbage. I don't even interact with it...

English

Joseph Thacker@rez0__·4h

this is actually true. i only see stuff i love right now

kache@yacineMTB

Jesus Christ the algorithm is so much better

English

1.7K

Aaron Grattafiori retweetledi

claire vo 🖤@clairevo·1d

Claude@claudeai

There’s hope in hard questions.

ZXX

616

56.5K

Aaron Grattafiori retweetledi

James Aung@jjamesaung·15h

Our Cyber and Autonomous Systems Team at @AISecurityInst performed early access testing of GPT-5.6 Sol for offensive cyber capabilities. We found Sol performed much better than GPT-5.5 on our cyber suite and comparably to Claude Mythos 5. Results below and in system card 🧵

English

34.3K

Aaron Grattafiori retweetledi

Katie Paxton-Fear@InsiderPhD·15h

🧵Can we trust Chinese open weight models? Was a question a lot of people asked after GLM 5.2 was released, scoring very well on coding benchmarks, and suspiciously Claude-like. So I turned an open-weight coding model into a backdoor with 1hr and <$100. Let's talk about it

English

181

26.8K

Aaron Grattafiori@dyn___·1d

@IceSolst This would be pretty amazing... almost like this classic: youtu.be/UIKy1Shxd6Q?is… but going backwards.

YouTube

English

solst/ICE of Astarte@IceSolst·1d

@dyn___ True BUT WE CAN SPREAD THIS VIRUS that way (although apparently it would detect corrupted Pokémon)

English

137

solst/ICE of Astarte@IceSolst·2d

“GAME BOY can’t get viruses” Oh, yeah? Well then, explain THIS

English

1.2K

51.8K

Aaron Grattafiori@dyn___·1d

@joshua_saxe "We're not in Kansas anymore... oh wait, no we still are... and what I learned."

English

Joshua Saxe@joshua_saxe·1d

@dyn___ Hadn't thought to but now you've inspired me Aaron :)

English

744

Joshua Saxe@joshua_saxe·1d

I did an AI Q&A with a bunch of non-tech working class / middle class / poor folks where I live in Kansas yesterday and they definitely had all these concerns including catastrophic loss of control (they didn't use this term). They were also concerned that coastal plutocrats and elites will gain even more control over their lives, a concern not mentioned in this video

Claude@claudeai

There’s hope in hard questions.

English

134

20.9K

Aaron Grattafiori retweetledi

Crackmes.one@CrackmesOne·3d

We built crackmes-RE: 4,598 crackmes labeled with a flag and/or a runnable verifier script (2,172 have one), plus normalized obfuscation/anti-debugging/protections tags. For benchmarking LLMs' and decompilers' reverse-engineering ability. github.com/crackmesone/cr…

English

167

10K

Aaron Grattafiori@dyn___·1d

@samhogan Can't DM for some reason, but would be interested to try it! DM me?

English

Sam Hogan 🇺🇸@samhogan·3d

We're releasing Inference AutoTune Distill any frontier model into a 1-30B parameter task-specific SLM with only 25 lines of code automatically route requests to reduce cost and latency by >90% ~2 hours and <$250 to train. You own the weights Available in private beta today

English

142

215

2.7K

341.5K

Aaron Grattafiori retweetledi

_ZN4DionC1Ev@justdionysus·3d

I’m sad to be missing @SummC0n and @moyix and @kallsyms talk. Fittingly, gpt-5.6-sol found a nice (if slightly slow) path to real ROP (vs the ret2lib I built) against my long time hobby project, Compuserve on Win3.1, tonight:

English

3.6K

Aaron Grattafiori@dyn___·3d

vx-underground@vxunderground

The NSA Tailored Access Operations (TAO) was renamed to CNO (Computer Network Operations) but recently reverted the name back to TAO. very interesting (I have no idea what this means).

ZXX

423

Aaron Grattafiori retweetledi

Ethan Mollick@emollick·3d

This was one of those impressive AI thresholds for me. I gave GPT-5.6 Sol in Codex control over my computer, and asked it to win the daily challenge for the game Slay the Spire 2 (randomized factors, so can't cheat). It worked for 5 hours, making complex game choices... and won.

English

112

117

1.9K

189.8K

Aaron Grattafiori@dyn___·3d

@HackingLZ What about boots

English

245

Justin Elze@HackingLZ·3d

I got a truck and cowboy hat since I moved. I should probably sign up for an LTC class now.

English

2.3K

Aaron Grattafiori@dyn___·5d

@GregHBurnham I'm glad it's so clear

English

1.2K

Greg Burnham@GregHBurnham·5d

PSA for benchmarkers, GPT-5.6 has a reasoning "mode" that can be "standard" or "pro", and this is orthogonal to reasoning "effort" which now goes up to "max". There's no separate pricing for "pro" and there's not (yet?) a separate Pro model. Docs linked below.

English

206

27.6K

Aaron Grattafiori@dyn___·5d

@_xpn_ It's been called "drop" 😂

English

125

Adam Chester 🏴‍☠️@_xpn_·5d

GIF

Adam Chester 🏴‍☠️@_xpn_

Cloudflare Drop just launched, temporary static site hosting (expires after 60 minutes) without an account, just drop your assets and go. Red Teamers... GO GO GO GO!!! cloudflare.com/drop/ Example: …p-4638acd5-ef1.aback-arch.workers.dev

ZXX

2.9K

Aaron Grattafiori retweetledi

Unprompted AU@UnpromptedAU·6d

Speaker Announcement: Thomas Roccia (@fr0gger_) will be speaking about AI threat intelligence - hunting threats across the AI Ecosystem. Gonna be lit!

English

5.6K

Aaron Grattafiori retweetledi

dreadnode@dreadnode·6d

Most offensive security evals stop at the terminal. But models now have to reason from PCB photos, floor plans, cable paths, and drone feeds — where perception and security judgment are inseparable. In our latest blog, read how we crafted offensive security evals for embodied reasoning: dreadnode.io/research/embod… Authors: Ads Dawson and @mkultraWasHere

English

2.3K

Aaron Grattafiori@dyn___·6d

@S1r1u5_ @alexjplaskett Is this a non X article anywhere?

English

460

s1r1us (mohan)@S1r1u5_·6d

x.com/i/article/2074…

ZXX

199

28.7K

Aaron Grattafiori@dyn___·6d

@S1r1u5_ I wonder how much this maps to security outside of CTFs. Probably quite well. Everyone goes further, but the uplift for some is a big jump.

English

236

s1r1us (mohan)@S1r1u5_·6d

interesting graph and great blog. thanks to ai, the overall competence of CTF teams has gone up while the skill gap between teams shrunk. if the trend continues, it might flatten the gap entirely and effectively commoditize intelligence, which means you just have to cut AI from your competitions. i dont think that will happen, i believe its a fat tail where there will be gap between best human (augmented by ai) and commodotized ai. that said, the current ctf formats are definitely not pleasurable to play when ai does most heavy-lifting, so it's a good efforts from osec to find fun format that works with ai.

Robert Chen@NotDeGhost

We're launching a $100,000 fund to save CTFs.

English

5.1K

Keşfet

@rez0__ @AISecurityInst @IceSolst @joshua_saxe @samhogan @moyix @kallsyms @HackingLZ