Shakeel: "you’re saying we should have … technical safety standards? that the government c"

Shakeel@ShakeelHashim·3d

you’re saying we should have … technical safety standards? that the government can use? to evaluate if a model is safe to release? and they should be written by a collaboration between industry and government?

Sophia Cai@SophiaCai99

NEW: White House and Anthropic are working to create a formal technical assessment framework that can quantify the severity of the jailbreak in question and create a standardized methodology for evaluating similar incidents in the future. It’s the clearest sign yet that talks are moving forward and it reflects an understanding that no AI model can be completely immune to hacking. Aim is to developing a common set of benchmarks that could be used to assess future jailbreaks, including the extent to which safeguards were bypassed, the capabilities exposed, and the practical consequences of the breach. w/ @cheyennehaslett politico.com/news/2026/06/1…

English

3.1K

Wolfe Folks@WolfeFolks·3d

@ShakeelHashim Just like Dario asked for

English