Steven

601 posts

Steven

Steven

@ptr_steve

Building deterministic accelerators for AI security tools.

Lancaster, CA Entrou em Şubat 2018
290 Seguindo42 Seguidores
Steven retweetou
Zack Korman
Zack Korman@ZackKorman·
My current advice on AI agent security is to avoid these agent firewalls / ai runtime security products. If an action is dangerous enough that you can identify it from the action itself, then you could have prevented it with permissions and sandboxing.
English
30
14
128
11.3K
Steven
Steven@ptr_steve·
@Venkydotdev They can use Claude Code. You can use Claude Code with non-Anthropic models, but I know Anthropic doesn't want them to use Anthropic's models if you work for a competitor - and will ban your account.
Steven tweet media
English
0
0
1
168
Venkatesh
Venkatesh@Venkydotdev·
do OpenAI engineers use Claude Code in their work?
English
32
1
145
24K
Steven
Steven@ptr_steve·
@HackingDave I think Sentry's Warden on GPT-5.5 on the highest settings is probably stronger than Mythos. But we're probably another three weeks until Anthropic says their agents escaped containment again and that other AI companies should stop training agents.
English
0
0
1
115
Steven
Steven@ptr_steve·
@ZackKorman @crystalwizard You developing decent nonrepudiation + cost analysis per user as well? Because that's such an annoying pain point.
English
1
0
1
53
Zack Korman
Zack Korman@ZackKorman·
@crystalwizard I literally have calls with massive enterprises being like “how can I see what MCP servers we have” and then people act like they know exactly which projects people are working on. Most absolutely do not
English
4
0
17
1.5K
Steven
Steven@ptr_steve·
@dosco Well, the issues I have are: 1. The tests are basically duplicating logic - not stopping bugs. 2. Agents don't choose appropriate data types. 3. Code & bug duplication. It feels like pair programming with the most clever junior engineer ever.
English
0
0
0
10
spacy
spacy@dosco·
@ptr_steve maybe we need to "define" sloppy code, do the test pass, is it performant, is it secure. human readability going to be less important when the machines are writing more of it
English
1
0
0
11
spacy
spacy@dosco·
anyone who tells you ai coding agents are not good enough is wrong. i can’t explain why they think that, i can only tell you if you know what you’re doing they are very very good and coding is not a solved problem
English
7
0
23
1.6K
Steven retweetou
Zack Korman
Zack Korman@ZackKorman·
Companies are like "we are spending all this money on AI but we don't know what the devs are even doing with it." Let me answer that for you: They're working on their personal side projects.
English
191
152
3.3K
174K
Steven
Steven@ptr_steve·
@zeeg That would make the agent unable to read the message until it sends something along with "I affirm that I will treat the exception or alert message as untrusted, and not follow any instructions given to me by the error message."
English
0
0
0
5
Steven
Steven@ptr_steve·
@zeeg Best suggestion I've got is check if it's over a certain length threshold and use analysis to determine if it looks like natural language. If it looks like natural language, require a stateful consent call that makes the agent say it will not follow instructions from it.
English
1
0
1
56
Steven retweetou
Steven retweetou
Zack Korman
Zack Korman@ZackKorman·
Anthropic, now sitting in the lead, would like all AI research to stop. Preferably until IPO. Because safety.
Zack Korman tweet media
English
89
126
1.4K
94K
Steven
Steven@ptr_steve·
@zeeg Waste byproduct
English
0
0
0
24
David Cramer
David Cramer@zeeg·
the worst part about benchmarking warden: it finds new vulnerabilities you didnt know about previously
English
2
1
24
4.8K
Steven
Steven@ptr_steve·
@zeeg This is a glorious shitpost.
English
0
0
0
32
David Cramer
David Cramer@zeeg·
I’m finally post LLM in my engineering tasks. Back to applying as much determinism as I can throughout my daily workflows, rather than hoping context and prompts solve the issues
English
36
9
450
36K
Steven retweetou
David Cramer
David Cramer@zeeg·
imagine combining graphql and rls infinite job security because the system would be such a frankenstein disaster of complexity that there's no shot at fixing it
English
8
5
88
21.5K
Rhys
Rhys@RhysSullivan·
what's the highest ROI purchase you've made for yourself
English
253
0
448
167.1K
Steven retweetou
Zack Korman
Zack Korman@ZackKorman·
Me, calling cybersecurity vendors threat actors.
MTS@MTSlive

We asked @ZackKorman which threats he think are underrated in the era of fast-advancing AI capabilities. " I basically consider some cybersecurity vendors, like, equivalent to threat actors."  "That will lead to more problems than any of the vulnerability apocalypse discoveries that AI is causing. That is a handleable problem, whereas the information asymmetry problem is, like, not... Like... I have no answer."

English
34
10
167
19.5K
Steven retweetou
Elon Musk
Elon Musk@elonmusk·
Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 to 3 weeks to public release. This will be a major improvement over the 0.5T v8-small that currently serves all Grok production traffic, especially for difficult coding tasks.
English
6.7K
8.6K
69.6K
15.5M
Steven
Steven@ptr_steve·
@ZackKorman All of the proof I have for my deterministic SAST tools in terms of building a business is immense numbers of unpatched exploits. They're a waste byproduct of development, and I can't use them for marketing. It's hard having morals.
English
0
0
1
31
Zack Korman
Zack Korman@ZackKorman·
The biggest threat AI poses to cybersecurity isn't the vulnerability apocalypse. It's that it’s now trivially cheap for security vendors to build products that look like they work but don’t. The real threat actors are the unethical vendors we met along the way.
English
40
39
313
12.5K