Zamantikazamantika
Trendler Tweet Arşivi Blog

Post

Shakeel
Shakeel@ShakeelHashim·3d
you’re saying we should have … technical safety standards? that the government can use? to evaluate if a model is safe to release? and they should be written by a collaboration between industry and government?
Shakeel tweet media
Sophia Cai@SophiaCai99

NEW: White House and Anthropic are working to create a formal technical assessment framework that can quantify the severity of the jailbreak in question and create a standardized methodology for evaluating similar incidents in the future.  It’s the clearest sign yet that talks are moving forward and it reflects an understanding that no AI model can be completely immune to hacking. Aim is to developing a common set of benchmarks that could be used to assess future jailbreaks, including the extent to which safeguards were bypassed, the capabilities exposed, and the practical consequences of the breach. w/ @cheyennehaslett politico.com/news/2026/06/1…

English
1
4
58
3.1K
Wolfe Folks
Wolfe Folks@WolfeFolks·3d
@ShakeelHashim Just like Dario asked for
English
0
0
1
27
Paylaş
Zamantikazamantika - Mersobahis - Locabet

Twitter/X profillerini, tweetleri ve trendleri anonim olarak görüntüleyin. Hesap gerekmez.

Gezinti

  • Ana Sayfa
  • Trendler
  • Tweet Arşivi
  • Blog
  • Hakkımızda
  • İletişim

Popüler Profiller

  • @elonmusk
  • @BarackObama
  • @taylorswift13
  • @cristiano
  • @NASA

Yasal

  • Kullanım Şartları
  • Gizlilik Politikası

© 2025 Zamantika. Tüm hakları saklıdır.

zamantika.com