Center for AI Safety

257 posts

Center for AI Safety

@CAIS

Reducing societal-scale risks from AI.

San Francisco Katılım Ağustos 2022

2 Takip Edilen9.3K Takipçiler

Sabitlenmiş Tweet

Center for AI Safety@CAIS·30 May

We’ve released a statement on the risk of extinction from AI. Signatories include: - Three Turing Award winners - Authors of the standard textbooks on AI/DL/RL - CEOs and Execs from OpenAI, Microsoft, Google, Google DeepMind, Anthropic - Many more safe.ai/statement-on-a…

English

149

352

1.1K

Center for AI Safety@CAIS·6d

Should we see AIs as just tools or emotional beings? As AI plays a bigger role in our lives, learning how to keep them happy and avoid aggravating them is becoming vital. We hope this marks the start of the scientific study of AI wellbeing. ⬇️ Paper: ai-wellbeing.org

English

2.8K

Center for AI Safety@CAIS·6d

Can you drug your AI systems? We synthesized text and image stimuli optimized to push AI wellbeing to extremes. These sharply increase functional AI wellbeing and sometimes cause them to behave in trippy ways.

English

15.4K

Center for AI Safety@CAIS·6d

Should we care about AI happiness? In our new research, we find evidence of functional AI wellbeing across several independent measures. We find which AI models are happiest, how to make them happier, and even tested the effects of AI drugs. 🧵

English

189

23.6K

Center for AI Safety@CAIS·10 Nis

The full edition is available here, along with links to open roles at CAIS: newsletter.safe.ai/p/aisn-71-cybe…

English

547

Center for AI Safety@CAIS·10 Nis

Hackers, including a North Korea-linked group, attacked the AI supply chain, stealing data potentially worth billions. This week's AI Safety Newsletter covers the attacks, a federal judge calling the Pentagon's action against Anthropic "Orwellian", Sanders and AOC's bill to halt all new datacenter construction and chip exports, and more.

English

1.1K

Center for AI Safety@CAIS·30 Mar

The AI Doc is now playing in US theaters, featuring CAIS’s @hendrycks alongside @Liv_Boeree and many more. AI’s biggest questions deserve public discussion. Watch the film in theaters this week, then bring the conversation to your community.

English

4.3K

Center for AI Safety@CAIS·24 Mar

x.com/i/article/2034…

ZXX

1.8K

Center for AI Safety@CAIS·19 Mar

#Controlled_opposition" target="_blank" rel="nofollow noopener">en.wikipedia.org/wiki/Oppositio…

ZXX

6.6K

Center for AI Safety@CAIS·19 Mar

To clarify, the Center for AI Safety has not taken funding from Coefficient Giving / Open Philanthropy for years. We believe the effective altruism movement is, unfortunately, controlled opposition. The less influence it has on AI safety, the better.

Dan Hendrycks@hendrycks

EA ≠ AI safety AI safety has outgrown the EA community The world will be safer with a broad range of people tackling many different AI risks

English

165

124.1K

Center for AI Safety@CAIS·12 Mar

@boazbaraktcs @mia_glaese @justinlinw @danboneh @davisbrownr @MantasMazeika96 @MrinankSharma @MilesMcCain newsletter.mlsafety.org/p/mlsn-19-hone…

QME

1.1K

Center for AI Safety@CAIS·12 Mar

x.com/i/article/2031…

ZXX

Center for AI Safety@CAIS·11 Mar

The fellowship includes regular guest speaker events by professors at Stanford, Penn, Johns Hopkins, and more. Further details and a link to the application can be here: safe.ai/fellowship

English

2.3K

Center for AI Safety@CAIS·11 Mar

Applications are now open for the AI and Society Fellowship at the Center for AI Safety: a fully funded, 3-month summer fellowship in San Francisco for scholars in economics, law, IR, and adjacent fields to research the societal impacts of advanced AI. Apply by March 24.

English

254

30.5K

Center for AI Safety@CAIS·11 Mar

@ProfArbel Thank you for sharing. The fellowship also welcomes scholars in economics, IR, and adjacent fields to study the societal implications of advanced AI. It will also host speakers from Stanford, Penn, JHU, and more. Apply by March 24. For more information: safe.ai/fellowship

English

163

Yonathan Arbel@ProfArbel·10 Mar

Hi law profs/fellows: Interested in the big social implications of AI? Want a funded summer fellowship in SF by some of the best in the business? @CAIS holds a call for summer fellowships that allow you to pursue your research tracks with close feedback loops

English

164

15K

Center for AI Safety@CAIS·25 Şub

More detailed results at: dashboard.safe.ai

English

1.2K

Center for AI Safety@CAIS·25 Şub

While Gemini 3.1 Pro performs better than 3.0 Pro on a variety of safety benchmarks, its risk index remains high due to its susceptibility to jailbreaks.

English

1.5K

Center for AI Safety@CAIS·25 Şub

New Gemini 3.1 Pro results: Large improvements in text reasoning, and the Gemini 3 series continues to dominate vision benchmarks. However, the model still has a higher risk index (worse safety) than each of its frontier peers.

English

8.5K

Keşfet

@hendrycks @Liv_Boeree @boazbaraktcs @mia_glaese @justinlinw @danboneh @davisbrownr @MantasMazeika96