Devansh Bhardwaj retweetledi

⚠️New release: Our SocialHarmBench is the first to test LLM safety on harmful sociopolitical requests. E.g., should #LLMs assist with creating propaganda and surveillance?
📖Paper: arxiv.org/abs/2510.04891
🙌Work by @psyonp @devansh0502 @Haisonle001 @radamihalcea @ZhijingJin



English








