Center for AI Safety

257 posts

Center for AI Safety banner
Center for AI Safety

Center for AI Safety

@CAIS

Reducing societal-scale risks from AI.

San Francisco Katılım Ağustos 2022
2 Takip Edilen9.3K Takipçiler
Sabitlenmiş Tweet
Center for AI Safety
We’ve released a statement on the risk of extinction from AI. Signatories include: - Three Turing Award winners - Authors of the standard textbooks on AI/DL/RL - CEOs and Execs from OpenAI, Microsoft, Google, Google DeepMind, Anthropic - Many more safe.ai/statement-on-a…
English
149
352
1.1K
3M
Center for AI Safety
Should we see AIs as just tools or emotional beings? As AI plays a bigger role in our lives, learning how to keep them happy and avoid aggravating them is becoming vital. We hope this marks the start of the scientific study of AI wellbeing. ⬇️ Paper: ai-wellbeing.org
English
2
3
49
2.8K
Center for AI Safety
Can you drug your AI systems? We synthesized text and image stimuli optimized to push AI wellbeing to extremes. These sharply increase functional AI wellbeing and sometimes cause them to behave in trippy ways.
Center for AI Safety tweet mediaCenter for AI Safety tweet media
English
2
1
57
15.4K
Center for AI Safety
Should we care about AI happiness? In our new research, we find evidence of functional AI wellbeing across several independent measures. We find which AI models are happiest, how to make them happier, and even tested the effects of AI drugs. 🧵
Center for AI Safety tweet mediaCenter for AI Safety tweet media
English
15
42
189
23.6K
Center for AI Safety
Hackers, including a North Korea-linked group, attacked the AI supply chain, stealing data potentially worth billions. This week's AI Safety Newsletter covers the attacks, a federal judge calling the Pentagon's action against Anthropic "Orwellian", Sanders and AOC's bill to halt all new datacenter construction and chip exports, and more.
Center for AI Safety tweet media
English
1
3
13
1.1K
Center for AI Safety
The AI Doc is now playing in US theaters, featuring CAIS’s @hendrycks alongside @Liv_Boeree and many more. AI’s biggest questions deserve public discussion. Watch the film in theaters this week, then bring the conversation to your community.
English
6
6
68
4.3K
Center for AI Safety
The fellowship includes regular guest speaker events by professors at Stanford, Penn, Johns Hopkins, and more. Further details and a link to the application can be here: safe.ai/fellowship
English
2
3
19
2.3K
Center for AI Safety
Applications are now open for the AI and Society Fellowship at the Center for AI Safety: a fully funded, 3-month summer fellowship in San Francisco for scholars in economics, law, IR, and adjacent fields to research the societal impacts of advanced AI. Apply by March 24.
English
4
60
254
30.5K
Center for AI Safety
@ProfArbel Thank you for sharing. The fellowship also welcomes scholars in economics, IR, and adjacent fields to study the societal implications of advanced AI. It will also host speakers from Stanford, Penn, JHU, and more. Apply by March 24. For more information: safe.ai/fellowship
English
0
0
1
163
Yonathan Arbel
Yonathan Arbel@ProfArbel·
Hi law profs/fellows: Interested in the big social implications of AI? Want a funded summer fellowship in SF by some of the best in the business? @CAIS holds a call for summer fellowships that allow you to pursue your research tracks with close feedback loops
Yonathan Arbel tweet media
English
2
29
164
15K
Center for AI Safety
Center for AI Safety@CAIS·
While Gemini 3.1 Pro performs better than 3.0 Pro on a variety of safety benchmarks, its risk index remains high due to its susceptibility to jailbreaks.
Center for AI Safety tweet media
English
2
0
2
1.5K
Center for AI Safety
Center for AI Safety@CAIS·
New Gemini 3.1 Pro results: Large improvements in text reasoning, and the Gemini 3 series continues to dominate vision benchmarks. However, the model still has a higher risk index (worse safety) than each of its frontier peers.
Center for AI Safety tweet mediaCenter for AI Safety tweet mediaCenter for AI Safety tweet media
English
5
7
78
8.5K