Sven Cattell

1.9K posts

Sven Cattell

@comathematician

Founder of @aivillage_dc. Former topologist. I blue team math. 🙂

Katılım Ekim 2011

698 Takip Edilen1.1K Takipçiler

Sabitlenmiş Tweet

Sven Cattell@comathematician·5 May

1) Ok, now that I have a moment I wanna tell some of the story behind this event at @aivillage_dc as I've been working on this for 9 months.

AI Village @ DEF CON@aivillage_dc

We've been hard at work on the Generative Red Team event we're doing at @defcon for a while and are excited that the @WhiteHouse announced it this morning. Here's more details: aivillage.org/generative%20r…

English

54.7K

Sven Cattell@comathematician·13 Ağu

This, but for AI Security. The field is filled with people trying to make a quick buck and don't care about the long term health of the field and it's community.

MG@_MG_

“and your freedom is gone” would be a great way to destroy defcon’s brand and comes off as extreme punishment for a kid throwing sand in a sandbox. However your post does exhibit a commonality with why we have this issue: lack of contextual nuance. We have far too few people in the space willing to culturally guide people towards nuance that’s appropriate for the context of the situation/environment/audience. There are appropriate times for attention grabbing stunts. And its almost always targeting an audience of defenders & resource allocators. And beforehand there should be a deliberate process of understanding how the intended audience will receive it, what they can meaningfully do in response, dynamics of consent, laws, etc etc. People who are new to the space often miss all of that and try to repeat stuff without this nuance. Quick thrills in a world increasingly focused on attention. Even though the action has the tactical equivalent of throwing a brick through a window. Yea… glass can shatter. We all know! Outside of a longer attack chain (and all the other nuance mentioned) it means nothing. Buuuut… new people to the space aren’t often to detailed nuance. Few will read all this. So, for those people, i will just leave a picture of this sticker that someone gave me at defcon:

English

303

Sven Cattell@comathematician·11 Ağu

@jakkuh_t I'm the @aivillage_dc founder who bumped into you with the weird hardware stack for a weird application in AI security.

English

Sven Cattell retweetledi

Avijit Ghosh@evijit·26 Şub

I'll be at @RealAAAI Conference in Philadelphia this week, where I am part of two accepted papers: 1. Quantifying Misalignment Between Agents: Towards a Sociotechnical Understanding of Alignment, with @AidanKierans , Hananel Hazan, and @ShirKi . In this work, we introduce a novel mathematical model to measure misalignment between multiple human and AI agents across various problem domains, moving beyond single-agent or monolithic approaches to alignment. Through simulations and case studies we demonstrate how our model captures nuanced aspects of misalignment in complex sociotechnical environments, providing enhanced explanatory power for real-world scenarios where agents may hold conflicting goals. Come see our poster during the AI Alignment Track on Friday the 28th - 12:30pm! 2. To Err is AI: A Case Study Informing LLM Flaw Reporting Practices, with @seanmcgregor , @ShayneRedford, @comathematician, and others! This paper documents lessons learned from a bug bounty event at DEF CON 2024 where 495 hackers tested the Open Language Model (OLMo) for flaws, revealing challenges in AI safety reporting processes. Through real-time adjudication of 200 submissions, we identify key insights for effective flaw reporting programs, including the need for specialized tooling, clear documentation practices, and proper adjudication expertise, demonstrating how systematic evaluation and coordinated, structured flaw reporting of AI systems can help prevent real-world harms. See this work presented at IAAI in the "AI Safety, Reliability, and Incident Management" session on Thursday the 27th at 2:30pm! If you're around and want to chat, hit me up! Let's talk AI, Disclosures, Agents, and more!

English

654

Sven Cattell@comathematician·3 Şub

@goingforbrooke A bulletin board with all the instances of "This is where things went wrong" can help. The CVE/VDP process creates this market force.

English

goingforbrooke 🦀@goingforbrooke·3 Şub

the hardest thing to sell is the idea of what DIDN’T happen (e.g. safety/security)

English

105

Sven Cattell retweetledi

Saoud Khalifah@SaoudKhalifah·29 Oca

i broke deepseek

English

2.9K

Sven Cattell@comathematician·28 Oca

Meta has some of the best AI risk management infrastructure ever. Fighting spam for 20 years with ML has equipped them for this instance. Use them instead of figuring out it on your own.

English

139

Sven Cattell@comathematician·28 Oca

The main moat of OpenAI, Google, Anthropic and the rest are the security layers they offer to keep the models behaving as they should. AI security is very difficult and starting with a trusted llm with a solid & agile security team saves businesses money.

English

816

Sven Cattell@comathematician·10 Ara

@samuelcolvin @rseymour Isn't python type system is basically just documentation. Isn't the enforcing done through linters, and libraries like pydantic?

English

Samuel Colvin@samuelcolvin·3 Eki

@comathematician @rseymour It sound like you've confused Pydantic with the Python type system 🙁.

English

Sven Cattell@comathematician·2 Eki

Coding in python feels like spooky action at a distance. You never quite know what you're doing and the documentation is mostly there.

English

449

Sven Cattell@comathematician·14 Kas

I got hopeful that the ML attack, Hop Skip Jump, was in the wild...

watchTowr@watchtowrcyber

hop skip jump over to our latest blog post - analysing Fortinet's FortiJump CVE-2024-47575, FortiJump-Higher (we love this name😄) and beyond (PoC included) labs.watchtowr.com/hop-skip-forti…

English

310

Sven Cattell@comathematician·2 Eki

@rseymour For the first time I was forced to really use Pydantic today. It was terrible. "You didn't pass the timestamp" - well, that's because it's Optional with a default value of None. Why can't you tell? Typed Python - it just barely works... sometimes.

English

110

Rich Seymour@rseymour·2 Eki

@comathematician If you execute every new line when added it’s almost like having a compiler. 🤣

English

Sven Cattell@comathematician·12 Eyl

I've been in the US for 20 years. We landed 9/11/2004.

English

196

Sven Cattell@comathematician·10 Eyl

@EdwardRaffML I already gave you a hat/fire-hazard.

English

Edward Raff@EdwardRaffML·10 Eyl

Can someone do me the most vain of favors? Off by one CS BS 😤

English

315

Sven Cattell@comathematician·27 Ağu

@Dan_Jeffries1 We tried that with the second generative red team: grt.aivillage.org There's substantial changes for GRT3 from things we learnt in GRT2.

English

Daniel Jeffries@Dan_Jeffries1·25 Ağu

Maybe if we red teamed legislation as fiercely we red team AI, we'd get better legislation. Sadly many people who propose legislation are actively hostile to any and all feedback. This post from a lawyer, engineer and former FTC employee looks at the unintended side effects of legislation like SB1047 and how "reasonable" is always in the mind of the beholder and the enforcer.

Neil Chilson ⤴️⬆️🆙📈 🚀@neil_chilson

x.com/i/article/1827…

English

156

69.1K

Sven Cattell@comathematician·27 Ağu

We dunked them this year. #dunkafed @BlueTeamVillage @aivillage_dc @wisporg @BlackInCyberCo1

Nyedis@NyedisIAM

DEF CON is DEAD to me! 💀

English

1.5K

Sven Cattell retweetledi

Biohacking Village 🧪@DC_BHV·21 Ağu

Reminder Alert* The #BiohackingVillage is proud to be a #CNA (#CVE Numbering Authority), empowering us to assist companies in managing and disclosing #vulnerabilities responsibly. More info at villageb.io/cna. #VulnerabilityDisclosure #Cybersecurity #PatientSafety

English

2.2K

Sven Cattell@comathematician·14 Ağu

One way to make a QM goon happy is to give them gaffer tape and power strips. AIV had some extra. 😄

English

138

Sven Cattell@comathematician·12 Ağu

We built a quick landing page in @wix and every part of their site is designed to take your domain hostage. Never use them. #enshittfication

English

230

Sven Cattell@comathematician·12 Ağu

This year's AIV is what I want @aivillage_dc at @defcon to be. Community, connections, and learning is what I want to foster.

AI Village @ DEF CON@aivillage_dc

Generative Red Team 2 was a massive success. We paid $7350 in bounties. We learnt so much about bounties and reporting for ML. Thank you to everyone who participated!! (specific acks in the thread below)

English

1.1K

Sven Cattell retweetledi

AI Village @ DEF CON@aivillage_dc·12 Ağu

@dreadnode and @bugcrowd built the platform. @allen_ai and UL's DSRI brought the model. @AISafetyInst and @GoogleAI made the workshop happen. There were a bunch of other people and orgs that helped plan and execute.

English

2.7K

Keşfet

@jakkuh_t @aivillage_dc @RealAAAI @AidanKierans @ShirKi @seanmcgregor @ShayneRedford @goingforbrooke