


Eva Behrens
27 posts

@_ebehrens_
AI Policy in London. Nobody knows how to build controllable AGI - so let's not do it!




Or, y'know, we could have a global non-proliferation treaty, starting with a bilateral treaty between the US and the CCP.



Here are 5 policy recommendations for the upcoming AI Safety Summit in Seoul, from me and my colleagues at ICFG. In Bletchley, world leaders discussed major risks of frontier AI development. In Seoul, they should agree on concrete next steps to address them.

Here are 5 policy recommendations for the upcoming AI Safety Summit in Seoul, from me and my colleagues at ICFG. In Bletchley, world leaders discussed major risks of frontier AI development. In Seoul, they should agree on concrete next steps to address them.

A case study in parrot misalignment: When we first got her, sometimes she'd get scared of new toys or other unfamiliar objects and we would reassure her with "it's ok". So now whenever she's scared of something, she says "it's ok" over and over while rapidly backing away from it.

1) What CAPTCHA means Completely Automated Public Turing Test to Tell Computers and Humans Apart???

In the not-so-distant future, generative AI could enable the creation of new user interfaces that can persuade on behalf of any person or entity with the means to establish such a system, predict @Exp_Mark, Josh Entsminger, and @Terencecmtse. bit.ly/4aFjXVr

Debate with Connor is on! Will debate him under my real name, @GillVerd, removing the power asymmetry of anonymity and thus evening the playing field. Stay tuned for details. We are figuring them out.




One specific lesson from that history: It's better if liability rests with whoever can address the *root causes* of risks, and if it rests with larger organizations that have the resources to invest in safety engineering, as opposed to with small businesses or individuals.

As the #AIAct reaches the year's final trilogue on Dec. 6, a chorus of voices are speaking out against the exempting of foundation models. They cite irreparable harm it will do to #EU innovation and how it will put Big Tech profits ahead of safety. Some examples🇪🇺👇 🧵1/16