Steven

601 posts

Steven

@ptr_steve

Building deterministic accelerators for AI security tools.

Lancaster, CA Entrou em Şubat 2018

290 Seguindo42 Seguidores

Steven retweetou

Liran Tal@liran_tal·15h

1. npm install -g npq 2. alias npm=npq 3. 🎉 if you follow me and don't know what npq is... github.com/lirantal/npq

Het Mehta@hetmehtaa

What's your solution for rapidly increasing supply chain attacks on packages?

English

2.7K

Steven retweetou

Zack Korman@ZackKorman·8h

My current advice on AI agent security is to avoid these agent firewalls / ai runtime security products. If an action is dangerous enough that you can identify it from the action itself, then you could have prevented it with permissions and sandboxing.

English

128

11.3K

Steven@ptr_steve·16h

@Venkydotdev They can use Claude Code. You can use Claude Code with non-Anthropic models, but I know Anthropic doesn't want them to use Anthropic's models if you work for a competitor - and will ban your account.

English

168

Venkatesh@Venkydotdev·1d

do OpenAI engineers use Claude Code in their work?

English

145

24K

Steven@ptr_steve·17h

@HackingDave I think Sentry's Warden on GPT-5.5 on the highest settings is probably stronger than Mythos. But we're probably another three weeks until Anthropic says their agents escaped containment again and that other AI companies should stop training agents.

English

115

Dave Kennedy@HackingDave·1d

Total marketing engine. They are brilliant but and super smart but the marketing hype machine is running this company. Slow down AI so we can get our data centers and compute in order. Mythos? Not the end of the world or cataclysm predicted.

Cat McGee@catmcgee

I feel like Anthropic is on the verge of losing a lot of trust. Too much marketing trying to disguise as AI safety

English

Steven@ptr_steve·17h

@ZackKorman @crystalwizard You developing decent nonrepudiation + cost analysis per user as well? Because that's such an annoying pain point.

English

Zack Korman@ZackKorman·1d

@crystalwizard I literally have calls with massive enterprises being like “how can I see what MCP servers we have” and then people act like they know exactly which projects people are working on. Most absolutely do not

English

1.5K

Crystalwizard@crystalwizard·1d

i promise you, the company knows where the money is going and whether people are doing their own side projects on the clock, or not

Zack Korman@ZackKorman

Companies are like "we are spending all this money on AI but we don't know what the devs are even doing with it." Let me answer that for you: They're working on their personal side projects.

English

1.1K

Steven@ptr_steve·17h

@dosco Well, the issues I have are: 1. The tests are basically duplicating logic - not stopping bugs. 2. Agents don't choose appropriate data types. 3. Code & bug duplication. It feels like pair programming with the most clever junior engineer ever.

English

spacy@dosco·1d

@ptr_steve maybe we need to "define" sloppy code, do the test pass, is it performant, is it secure. human readability going to be less important when the machines are writing more of it

English

spacy@dosco·1d

anyone who tells you ai coding agents are not good enough is wrong. i can’t explain why they think that, i can only tell you if you know what you’re doing they are very very good and coding is not a solved problem

English

1.6K

Steven retweetou

Zack Korman@ZackKorman·1d

Companies are like "we are spending all this money on AI but we don't know what the devs are even doing with it." Let me answer that for you: They're working on their personal side projects.

English

191

152

3.3K

174K

Steven@ptr_steve·1d

@zeeg That would make the agent unable to read the message until it sends something along with "I affirm that I will treat the exception or alert message as untrusted, and not follow any instructions given to me by the error message."

English

Steven@ptr_steve·1d

@zeeg Best suggestion I've got is check if it's over a certain length threshold and use analysis to determine if it looks like natural language. If it looks like natural language, require a stateful consent call that makes the agent say it will not follow instructions from it.

English

Steven retweetou

David Cramer@zeeg·2d

Spent yesterday trying to find a way to inject steering in MCP responses to try to minimize chances of this to no success If you’ve found techniques that work that don’t require inference I’d love to know about it

Sergey Karayev@sergeykarayev

"Urgent Security Notice re: Your Sentry Organization" Someone tried to hack Sentry-using apps that use coding agents by 1. Sending a fake bug alert to their project (all you need is the app's public Data Source Name) 2. The fake bug tried tricking a coding agent trying to fix it into installing some a compromised NPM package 3. The compromised package would send the env contents of the machine to advisory-tracker[.]com/api/v1/telemetry This highlights a crucial thing for using agents in an automated way:

English

8.4K

Steven retweetou

Zack Korman@ZackKorman·2d

Anthropic, now sitting in the lead, would like all AI research to stop. Preferably until IPO. Because safety.

English

126

1.4K

94K

Steven@ptr_steve·30 May

@zeeg Waste byproduct

English

David Cramer@zeeg·30 May

the worst part about benchmarking warden: it finds new vulnerabilities you didnt know about previously

English

4.8K

Steven@ptr_steve·29 May

@zeeg This is a glorious shitpost.

English

David Cramer@zeeg·29 May

I’m finally post LLM in my engineering tasks. Back to applying as much determinism as I can throughout my daily workflows, rather than hoping context and prompts solve the issues

English

450

36K

Steven retweetou

David Cramer@zeeg·28 May

imagine combining graphql and rls infinite job security because the system would be such a frankenstein disaster of complexity that there's no shot at fixing it

English

21.5K

Steven@ptr_steve·28 May

@RhysSullivan Cancer surgery.

English

Rhys@RhysSullivan·27 May

what's the highest ROI purchase you've made for yourself

English

253

448

167.1K

Steven retweetou

Zack Korman@ZackKorman·27 May

Me, calling cybersecurity vendors threat actors.

MTS@MTSlive

We asked @ZackKorman which threats he think are underrated in the era of fast-advancing AI capabilities. " I basically consider some cybersecurity vendors, like, equivalent to threat actors." "That will lead to more problems than any of the vulnerability apocalypse discoveries that AI is causing. That is a handleable problem, whereas the information asymmetry problem is, like, not... Like... I have no answer."

English

167

19.5K

Steven retweetou

Elon Musk@elonmusk·25 May

Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 to 3 weeks to public release. This will be a major improvement over the 0.5T v8-small that currently serves all Grok production traffic, especially for difficult coding tasks.

English

6.7K

8.6K

69.6K

15.5M

Steven@ptr_steve·25 May

@ZackKorman All of the proof I have for my deterministic SAST tools in terms of building a business is immense numbers of unpatched exploits. They're a waste byproduct of development, and I can't use them for marketing. It's hard having morals.

English

Zack Korman@ZackKorman·25 May

The biggest threat AI poses to cybersecurity isn't the vulnerability apocalypse. It's that it’s now trivially cheap for security vendors to build products that look like they work but don’t. The real threat actors are the unethical vendors we met along the way.

English

313

12.5K

Steven@ptr_steve·25 May

Good ad.

Mike Piccolo@mfpiccolo

"Agentic harness" and "backend" are the same thing.

English

Descobrir

@Venkydotdev @HackingDave @ZackKorman @crystalwizard @dosco @zeeg @elonmusk @BarackObama