Joe O’Brien

117 posts

Joe O’Brien

Joe O’Brien

@__J0E___

Katılım Nisan 2022
231 Takip Edilen191 Takipçiler
Joe O’Brien retweetledi
Peter Wildeford🇺🇸🚀
Peter Wildeford🇺🇸🚀@peterwildeford·
PALO ALTO NETWORKS on MYTHOS: "In our testing, three weeks of model-assisted analysis matched a full year of manual penetration testing, with broader coverage."
Peter Wildeford🇺🇸🚀 tweet media
English
38
249
2.3K
203.9K
Joe O’Brien retweetledi
Jack Clark
Jack Clark@jackclarkSF·
I've spent the past few weeks reading 100s of public data sources about AI development. I now believe that recursive self-improvement has a 60% chance of happening by the end of 2028. In other words, AI systems might soon be capable of building themselves.
English
289
495
3.5K
1.6M
Joe O’Brien
Joe O’Brien@__J0E___·
We argue that whenever a substantially more capable or riskier model is deployed internally, the developer should create a risk report and argue why the model is safe to deploy.
English
1
0
0
26
Joe O’Brien
Joe O’Brien@__J0E___·
Models used internally at AI companies have capabilities beyond those of publicly-available models, so it's important that risks from these models are reported externally. We've just published a report on how this should be done.
English
1
0
1
57
Joe O’Brien
Joe O’Brien@__J0E___·
@robertwiblin @JustinBullock14 What are his thoughts on propagating use of these techniques to frontier AI companies or downstream users, especially absent high-profile cases of misalignment which might create the will to do so?
English
0
0
0
65
Rob Wiblin
Rob Wiblin@robertwiblin·
This week I'm interviewing the world's most cited computer scientist, Yoshua Bengio. He chairs the International AI Safety Report. But his primary focus is developing a comprehensive solution to the AI alignment problem: 'Scientist AI'. What should I ask him?
English
23
6
139
6K
Joe O’Brien retweetledi
Jam Kraprayoon
Jam Kraprayoon@JKraprayoon·
1/ A few months ago, my co-authors and I at @iapsAI set out to answer a question that felt increasingly urgent: what happens when AI systems can run sophisticated cyber operations entirely on their own?
English
1
8
33
2.1K
Joe O’Brien retweetledi
Institute for AI Policy and Strategy (IAPS)
Applications are open for the IAPS AI Policy Fellowship! 3 months, fully funded, remote or DC. This is your opportunity to work with leading AI policy experts on research projects meant to secure a positive future in a world with powerful AI. Apply by Feb 2 at the link in the thread.
Institute for AI Policy and Strategy (IAPS) tweet media
English
1
7
23
4.6K
Joe O’Brien retweetledi
Geoffrey Irving
Geoffrey Irving@geoffreyirving·
Do you want to fund AI alignment research? The AISI Alignment Team and I have reviewed >800 Alignment Project Applications from 42 countries, and we have ~100 that are very promising. Unfortunately, this means we have a £13-17M funding gap! Thread with details!🧵
Geoffrey Irving@geoffreyirving

I am very excited that AISI is announcing over £15M in funding for AI alignment and control, in partnership with other governments, industry, VCs, and philanthropists! Here is a 🧵 about why it is important to bring more independent ideas and expertise into this space.

English
4
38
191
44K
Joe O’Brien
Joe O’Brien@__J0E___·
RT @deanwball: If you said: “We should have real-time incident reporting for large-scale frontier AI cyber incidents.” A lot of people in…
English
0
7
0
39