Joe O’Brien

117 posts

Joe O’Brien

@__J0E___

Katılım Nisan 2022

231 Takip Edilen191 Takipçiler

New IAPS memo with @rosen_br and @covinstantinop: a national security playbook for federal action on frontier AI. Focused on securing models, defensive automation, tracking frontier risks, and building government capacity. Full memo: iaps.ai/research/after…

English

Joe O’Brien retweetledi

Peter Wildeford🇺🇸🚀@peterwildeford·8 May

PALO ALTO NETWORKS on MYTHOS: "In our testing, three weeks of model-assisted analysis matched a full year of manual penetration testing, with broader coverage."

English

249

2.3K

203.9K

Joe O’Brien retweetledi

Jack Clark@jackclarkSF·4 May

I've spent the past few weeks reading 100s of public data sources about AI development. I now believe that recursive self-improvement has a 60% chance of happening by the end of 2028. In other words, AI systems might soon be capable of building themselves.

English

289

495

3.5K

1.6M

Joe O’Brien@__J0E___·30 Nis

Full report: iaps.ai/research/risk-…

English

Joe O’Brien@__J0E___·30 Nis

We argue that whenever a substantially more capable or riskier model is deployed internally, the developer should create a risk report and argue why the model is safe to deploy.

English

Joe O’Brien@__J0E___·30 Nis

Models used internally at AI companies have capabilities beyond those of publicly-available models, so it's important that risks from these models are reported externally. We've just published a report on how this should be done.

English

Joe O’Brien@__J0E___·13 Nis

@robertwiblin @JustinBullock14 What are his thoughts on propagating use of these techniques to frontier AI companies or downstream users, especially absent high-profile cases of misalignment which might create the will to do so?

English

Rob Wiblin@robertwiblin·12 Nis

This week I'm interviewing the world's most cited computer scientist, Yoshua Bengio. He chairs the International AI Safety Report. But his primary focus is developing a comprehensive solution to the AI alignment problem: 'Scientist AI'. What should I ask him?

English

139

Joe O’Brien retweetledi

Jam Kraprayoon@JKraprayoon·11 Mar

1/ A few months ago, my co-authors and I at @iapsAI set out to answer a question that felt increasingly urgent: what happens when AI systems can run sophisticated cyber operations entirely on their own?

English

2.1K

Joe O’Brien retweetledi

Institute for AI Policy and Strategy (IAPS)@iapsAI·12 Oca

Applications are open for the IAPS AI Policy Fellowship! 3 months, fully funded, remote or DC. This is your opportunity to work with leading AI policy experts on research projects meant to secure a positive future in a world with powerful AI. Apply by Feb 2 at the link in the thread.

Institute for AI Policy and Strategy (IAPS) tweet media

English

4.6K

Joe O’Brien@__J0E___·12 Ara

Great opportunity to work at CAISI on running evals & improving the science of evals--would love to see brilliant people in these roles!

Deb Raji@rajiinio

US CAISI is hiring -- the internal govt name is "IT Specialist" but it is effectively a research scientist role! Salary is $120,579 to - $195,200 per year & you work on AI evaluation within government agencies! Dream job for the right person. Details: lnkd.in/exJgkqr5

English

144

Joe O’Brien retweetledi

AI Evaluator Forum@aievalforum·4 Ara

Today we are announcing the creation of the AI Evaluator Forum: a consortium of leading AI research organizations focused on independent, third-party evaluations. Founding AEF members: @TransluceAI @METR_Evals @RANDCorporation @halevals @SecureBio @collect_intel @Miles_Brundage

English

171

88.2K

Joe O’Brien@__J0E___·4 Ara

@peterwildeford @metaculus "I was on the airplane" had pretty good delivery tbh

English

Peter Wildeford🇺🇸🚀@peterwildeford·4 Ara

@metaculus which one?

English

189

Peter Wildeford🇺🇸🚀@peterwildeford·4 Ara

I got interviewed for The Daily Show!

Nathan is in Berkeley 🔎@NathanpmYoung

My boy @peterwildeford got interviewed for @TheDailyShow Proud of him!

English

199

10.1K

Joe O’Brien retweetledi

Geoffrey Irving@geoffreyirving·27 Kas

Do you want to fund AI alignment research? The AISI Alignment Team and I have reviewed >800 Alignment Project Applications from 42 countries, and we have ~100 that are very promising. Unfortunately, this means we have a £13-17M funding gap! Thread with details!🧵

Geoffrey Irving@geoffreyirving

I am very excited that AISI is announcing over £15M in funding for AI alignment and control, in partnership with other governments, industry, VCs, and philanthropists! Here is a 🧵 about why it is important to bring more independent ideas and expertise into this space.

English

191

44K

Joe O’Brien retweetledi

AIWI - The AI Whistleblower Initiative@AIWI_Official·25 Kas

A massive step: The EU AI Office has launched a whistleblowing channel dedicated to AI professionals—the first of its kind globally. 1/3 #whistleblowing #euaio

AIWI - The AI Whistleblower Initiative tweet media

English

483

Joe O’Brien@__J0E___·21 Eki

RT @deanwball: If you said: “We should have real-time incident reporting for large-scale frontier AI cyber incidents.” A lot of people in…

English

Keşfet

@rosen_br @covinstantinop @robertwiblin @JustinBullock14 @iapsAI @TransluceAI @METR_Evals @RANDCorporation