Michael Allen

7.6K posts

Michael Allen

@_Dark_Knight_

Building at the intersection of AI + offensive security::https://t.co/5DiZGjk6VJ

Seattle, WA Katılım Mart 2009

277 Takip Edilen1.5K Takipçiler

Sabitlenmiş Tweet

Michael Allen@_Dark_Knight_·25 Oca

iOS Reversing Lascon Edition | m.youtube.com/watch?v=3ljBmx… #iOS #reversing

English

Michael Allen@_Dark_Knight_·11h

@mvalsmith @Dave_Maynor @zeroxjf @trq212 how are the results so far?

English

Val Smith@mvalsmith·1d

@Dave_Maynor @zeroxjf @trq212 I've suggested having a vetted cyber-researcher option or something but have been ignored. I canceled most of the LLM subscriptions, bought some rtx 6000s, and am using local LLMs with cyber friendly system prompts now. Done with nanny no privacy LLMs.

English

johnny@zeroxjf·1d

The new cyber-abuse guardrails in Opus 4.6 are likely to drive a mass exodus of researchers from the platform. They give option to submit a form to prove legitimate research, but for me got no confirmation of its submission last week and no way of knowing its status 🤷‍♂️ @trq212

English

22.6K

Michael Allen@_Dark_Knight_·11h

@zeroxjf @EvanKlein338226 @trq212 codex will not even write a frida script for the most basic of things

English

johnny@zeroxjf·1d

@EvanKlein338226 @trq212 2 weeks?! 🤦🏻‍♂️ it’s like they want people to flock to codex even more rapidly

English

448

Michael Allen retweetledi

Daniel Hnyk@hnykda·18h

futuresearch.ai/blog/litellm-p…

ZXX

399

192.1K

Michael Allen@_Dark_Knight_·2d

@trq212 @mattpocockuk sounds like grill-me almost.....

English

Thariq@trq212·2d

we're testing a new version of /init based on your feedback- it should interview you and help setup skills, hooks, etc. you can enable it with this env_var flag: CLAUDE_CODE_NEW_INIT=1 claude would love your feedback!

Thariq@trq212

I want to make /init more useful- what do you think it should do to help setup Claude Code in a repo?

English

207

252

4.1K

537.7K

Michael Allen@_Dark_Knight_·3d

@DanielMiessler yeah my go to approach now is CLI first

English

ᴅᴀɴɪᴇʟ ᴍɪᴇssʟᴇʀ 🛡️@DanielMiessler·3d

MCP is confusing. Some think it died when CLIs took over, but you can also view them as complimentary. Like MCP is how you stay fresh on what's available in the tool/service, and how to use it, and CLI is how you actually execute.

English

Michael Allen@_Dark_Knight_·4d

@NotMedic @HackingLZ @UK_Daniel_Card maybe we have had it wrong all this time -- I will see myself out......

English

Tim McGuffin@NotMedic·4d

@HackingLZ @UK_Daniel_Card … they named the company Delve? Literally the word that was used to pick out the first wave of AI slop? 😂

English

279

Justin Elze@HackingLZ·4d

I wouldn’t trust someone who called it “Pen test” vs “Pentest”

Eshaan Gulati@eshaangulati_

I was on a slack channel with delve and almost wired them 8K yesterday for SOC2… X saved my life - for real this time.

English

9.7K

Michael Allen@_Dark_Knight_·14 Mar

@mattpocockuk ok kewl...was running into some weirdness with processing each issue -- think I it was due to the updates I made to the script -- eventually worked but was curious if I was missing something -- thanks

English

Matt Pocock@mattpocockuk·14 Mar

@_Dark_Knight_ Yeah about the same as you! What issues are you having?

English

Michael Allen@_Dark_Knight_·13 Mar

@mattpocockuk I was curious on what your ralph-loop looks like to work through the issues created from the /prd-to-issues skill. Right now I do something like 5. Push and create a PR, then merge it. 6. Close the issue. But what is your approach?

English

Michael Allen@_Dark_Knight_·13 Mar

so I thought I had a bug and man was claude confident! Gave it to codex to review with all the reports and context and it said..do not report this..went back to claude who then said...yeah I overstated somethings

English

119

Michael Allen@_Dark_Knight_·11 Mar

@BleylDev ok....figured something was up....

English

Bo Bleyl@BleylDev·11 Mar

Claude auth having issues this morning. Claude Code triggers login screen. Login screen hangs for ~20 seconds. Problem is, Claude Code has a 15 second time-out for waiting on auth. So by the time the success message comes through, claude code ignores it due to timeout. Lovely.

English

934

Michael Allen@_Dark_Knight_·11 Mar

[BLOG] Cassian: Agentic Differential Security Review for Pull Requests | mykalseceng.github.io/posts/cassian-…

English

Michael Allen@_Dark_Knight_·8 Mar

@0xTib3rius @nitr0usmx favicon disclosure or go bust

English

146

Tib3rius@0xTib3rius·8 Mar

You may not like it, but this is what peak hacking looks like.

English

721

36.3K

Michael Allen@_Dark_Knight_·8 Mar

@stefanboesen what is ruby --

English

Stefan Boesen@stefanboesen·8 Mar

Love that codex knows Ruby is a superior language to Python

English

Michael Allen retweetledi

Stefan Boesen@stefanboesen·7 Mar

OpenAI released Symphony this week. I tried implementing the same pattern to see how it behaves. Most of the work ended up in review loops, artifacts, visibility, and PRs. Notes: blog.boesen.me/posts/lessons-…

English

194

Michael Allen@_Dark_Knight_·7 Mar

@sam_burns_tech @Jhaddix @rez0__ I do a similar thing here -- mykalseceng.github.io/posts/agentic-…

English

Joseph Thacker@rez0__·7 Mar

It did until 4.6 came out

Joel Eriksson@OwariDa

"XBOW uses thousands of short-lived agents, each with a narrow objective, orchestrated by a persistent coordinator and validated by deterministic logic. Each agent starts fresh — no accumulated context, no compounding errors. When I wrote about model alloys earlier this year, I've described how even at the individual agent level, a couple of great ideas interspersed with methodical follow-up actions is what solves a challenge" Architecture matters

English

17.7K

Michael Allen@_Dark_Knight_·5 Mar

[BLOG] mykalseceng.github.io/posts/agentic-… | ODYSSEUS: Building an Agentic Pentest Platform -- How I built a multi-stage agentic pentest pipeline, what it found and missed, and how to use the approach in your workflows

English

Michael Allen@_Dark_Knight_·5 Mar

"I'm not giving you the answer — I'm grilling you. Let me reframe:" -- wait what?

English

Michael Allen@_Dark_Knight_·4 Mar

@stokfredrik @HackingLZ well yes 😂

English

112

STÖK ✌️@stokfredrik·4 Mar

@_Dark_Knight_ @HackingLZ At the same time they have a junior dev, the marketing team, the cfo all smashing metrics, building stuff and being productive using personal accounts. The disconnect is real and compliance and legal will have a hard time catching up.

English

877

STÖK ✌️@stokfredrik·3 Mar

I once said: AI is not going to take your job as a pentester or bugbounty hunter. I was wrong.

English

766

119.9K

Michael Allen@_Dark_Knight_·4 Mar

@HackingLZ @stokfredrik "For example, many wouldn't allow external or even internal people to run their data through frontier models during offensive testing." -- 100% this

English

752

Justin Elze@HackingLZ·4 Mar

I have no doubt AI tooling will augment testing in lots of ways, so if that means fewer OffSec jobs, I get it. We were in a period where OffSec was "easy" and people forgot the job was supposed to get harder over time instead, boot camps told people they would make X after 12 weeks. The nature of most organizations security programs is a little more complex than their public facing bug bounty programs. The leap most people are making is that AI will close the gap to 80% and someone with no domain knowledge will drive that 80%. There is also a whole "replace a job vs. tasks" argument that all of AI land is currently having.| Another somewhat useful point bug bounties largely avoid the data protection requirements companies have. For example, many wouldn't allow external or even internal people to run their data through frontier models during offensive testing. The greater tipping point in the replacement discussion will come when local models reach a certain capability threshold, because it will allow companies to maintain safeguards while still meeting compliance and regulatory requirements. In that same space, there's also a lack of training data for internal pentesting and other areas compared to much of the bug bounty landscape.

English

31K

Michael Allen@_Dark_Knight_·2 Mar

@mattpocockuk let's be real....though

English

Matt Pocock@mattpocockuk·2 Mar

I have an AI that writes stuff for me. Here are all the phrases I've banned it from using: real power wake-up call fundamentally changes key benefit cut through the noise key insight the irony the good news the reality it's kind of like here's the thing: hard truth uncomfortable truth

English

329

28.3K

Keşfet

@mvalsmith @Dave_Maynor @zeroxjf @trq212 @EvanKlein338226 @mattpocockuk @DanielMiessler @NotMedic