Rishabh Shukla retweetledi

We introduce h4rm3l, a language, a synthesizer, and a scalable and explainable dynamic LLM red-teaming toolkit. h4rm3l found > 2.6k new jailbreak attacks targeting @OpenAI, @AIatMeta, and @AnthropicAI LLMs.
📝 arxiv.org/pdf/2408.04811
🌐 mdoumbouya.github.io/h4rm3l
🧵1/6 👇🏾

English



























