Dewi Erwan

191 posts

Dewi Erwan

@dewierwan_

London Katılım Şubat 2010

158 Takip Edilen889 Takipçiler

Dewi Erwan retweetledi

Dean W. Ball@deanwball·1d

Some brief thoughts on Mythos We’ve known this was coming for a long time. At least, we *should* have. Extremely effective software vulnerability discovery was clearly coming to anybody paying attention. It has also been clear that all AI policy so far has been made and executed with training wheels. It was always clear that, sometime soon, the training wheels would come off. The training wheels aren’t fully off just yet—this model is being kept under lock and key, and Anthropic does not seem inclined to release Mythos preview to the public anytime soon, if ever. The training wheels will be off when these capabilities are fully diffused in ways centralized actors cannot control. It is inevitable that this will happen. The point is not to argue about whether we should “ban open source” or similarly unrealistic notions. The point is to harden the world for this new reality. I applaud Anthropic—and I especially applaud @logangraham—for doing so. But their efforts alone are not close to enough. Project Glasswing—a partnership with Anthropic and other companies—seems nice, but unsurprisingly it lacks uniform frontier lab participation. It would probably be ideal, for our national cyberdefense, if the federal government were not trying to destroy Anthropic and eliminate their models from government systems. If anything, the government should be trying to work more closely with Anthropic. As a side note, I hope Anthropic is working with state and local government entities on cyber vulnerability discovery, since many of our adversaries know that state and local is America’s soft underbelly in so many ways. In any event, the Mythos news should lay bare how stupid and counter-productive the Department of War’s feud with Anthropic really is. As someone who suspected all this was coming (not from inside knowledge but from it being ~obvious), that probably explains why I have had such a strong reaction to that feud. It’s this senseless distraction just at the time that the training wheels are coming off. I hope the two parties can resolve their differences now, for the sake of the country, but I am not hopeful. I do want to call out, however, the numerous political and career civil servants in the Trump Admin who do get these issues, know how stupid the Ant-DoW stuff is, and want to work with the frontier labs like adults. I wish you all utmost success. I find myself inclined to end on some positive notes. Mythos appears to be—according to Anthropic at least—“the most aligned” model Anthropic has ever trained. We are approaching superhuman capabilities in some domains, and yet alignment is getting better rather than worse. That’s not nothing. I know some of you think the model is faking its alignment, or aware when its alignment is being tested. I don’t have a good answer. Finally, there is this: Mythos was made by an American company, and like most successful American companies, it has a vested interest in maintaining order and peace, and it is investing substantial resources in mitigating the risks of its technological progress, as I expect most of the American labs would. This is cause for optimism: The incentives of capitalism are working. The training wheels are coming off, but at least we are the ones removing them, as opposed to our enemies. Perhaps we can be the first to learn to bike for real. The first step would be to get beyond all the low-fidelity, under-specified, pimply little fights of AI policy’s prepubescent era. That goes for me too. “What hath God wrought,” wrote the first telegram. What, indeed. In this case, the answer is still up to us.

English

241

2.6K

389.2K

Dewi Erwan retweetledi

tom cunningham@testingham·1 Nis

I think many economists agree with the following, but it would be valuable to make this publicly known: 1. There is a substantial probability (>10%) that AI will exceed human-level performance on virtually all non-physical tasks within ten years. 2. This would be an unprecedented shock to human society. 3. The economics profession should treat it with an urgency comparable to WWII or COVID.

English

382

77.5K

Dewi Erwan retweetledi

dylan matthews 🔸@dylanmatt·1 Nis

My coworkers on the biosecurity team are looking to direct at least $100 million this year. But they need more leads on work that could prevent or blunt the impact of a future pandemic Apply here! coefficientgiving.org/funds/biosecur…

English

112

13.1K

Dewi Erwan retweetledi

Josh Landes@guynamedjoshl·31 Mar

@bayeslord Want to do biosecurity? Our biosec course will get you up to speed: bluedot.org/courses/biosec…

English

461

Dewi Erwan retweetledi

Anna Wang@a_nnawang·27 Mar

IMO, the biggest blocker to strong societal resilience (vs pandemics; cyber attacks; degrading cognitive security) — is world-class founders excited to work on the problem.

Wojciech Zaremba@woj_zaremba

Life update — I’m moving to the OpenAI Foundation to lead AI resilience. AGI will bring tremendous benefits and potential disruptions, such as impacts on children and youth, model malfunctions, emergent bio-risks, and more. AI resilience is about minimizing these disruptions so society can fully realize the benefits. openaifoundation.org/news/update-on…

English

6.7K

Dewi Erwan@dewierwan_·7 Mar

@austinc3301 maybe they're finally feeling the agi

English

Agus 🔸@austinc3301·5 Mar

I’d be lying if I said I have a working model of what the admin is thinking here, wtf

Andrew Curran@AndrewCurran_

Bloomberg is reporting that the White House has written draft regulations that would restrict AI chip shipments to anywhere in the world without US Government approval.

English

Dewi Erwan retweetledi

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·6 Mar

The recent fight between Anthropic of the Department of War illustrates a deeper truth: AI is a weapon, and it might soon the most powerful weapon ever created. noahpinion.blog/p/if-ai-is-a-w…

English

258

197.2K

Dewi Erwan@dewierwan_·2 Mar

An upshot of Anthropic vs DoW: Seems great that almost everyone agreed in principle that we shouldn't use AI to do massive domestic surveillance and totally remove humans from military kill chains. This is a strong foundation to fall back upon in the future.

English

Dewi Erwan@dewierwan_·28 Şub

Over the past year, my friends at Anthropic told me how much they respect and admire their CEO, Dario. This weekend is when the world learnt why.

English

118

Dewi Erwan retweetledi

Thomas Woodside 🫜@Thomas_Woodside·28 Şub

Dean and I first met on opposite sides of one of the first major AI policy fights. He was a worthy adversary. These days, there are many areas where I find common ground with Dean, and he is a worthy ally and friend. I have enormous respect for him speaking out today. There is not much more I can add to what he and others have said. Our government has made a grave mistake, and it is a dark day for our country. AI policy is expanding. Things are getting uglier and they more closely approximate the naked exercise of power, as they were destined to when the stakes got higher. Days like these make me want to harden my heart, as did many days during 1047. We must resist that impulse. We have to fight the good fight where we must, but also embrace every chance at engagement where we can. The challenges before us can be tackled only by finding common ground, as bleak as that possibility may seem today. The gauntlet of the singularity is coming. I do not know if we can prevail, but we have to try.

Dean W. Ball@deanwball

Man, the SB 1047 debate seems so innocent and quaint right now.

English

203

10.4K

Dewi Erwan@dewierwan_·27 Şub

This is stated at the top of the application form, and people still do it. Crazy.

English

Dewi Erwan@dewierwan_·27 Şub

80% of our job applications are written by AI. They all sound the same. Please stop.

English

Dewi Erwan@dewierwan_·14 Şub

In 2026, the AI safety community will splinter based on their answer to this question: Should we keep humanity in control of our Earth? I am on team humanity.

English

112

Dewi Erwan@dewierwan_·4 Şub

@willsaunter @BlueDotImpact These jobs would have been a dream for me back in 2021. If you're in the same boat today, OR if you know someone who could be great, I want to hear from you. Links below, and DMs open! bluedot.org/join-us/commun… bluedot.org/join-us/head-o… 6/6

English

289

Dewi Erwan@dewierwan_·4 Şub

@willsaunter @BlueDotImpact To make this happen, I'm hiring for two roles: 1. Head of Operations to ensure the company doesn't descend into total chaos. 2. Community Lead to turn participants into lifelong collaborators. SF-based, salary $110-225k, we sponsor US visas. 5/6

English

1.4K

Dewi Erwan@dewierwan_·4 Şub

In 2021, I almost decided to become a youtuber. I was running events from my childhood bedroom in wales with <10 attendees. I was in (what I considered) a low-status, dead-end community building job. But I now realise it was a kickass training ground for founding a company. 1/6

English

Dewi Erwan@dewierwan_·4 Şub

@dylanscandinaro Best of luck! If you want help hiring AI safety/security/resilience talent, reach out — at BlueDot, we've trained 7k people and helped 100s into jobs at frontier AI companies and govts.

English

Dylan Scandinaro@dylanscandinaro·4 Şub

I’m joining OpenAI as Head of Preparedness. Deeply grateful for my time at Anthropic and the extraordinary people I worked alongside. AI is advancing rapidly. The potential benefits are great—and so are the risks of extreme and even irrecoverable harm. There’s a lot of work to do, and not much time to do it!

Sam Altman@sama

I am extremely excited to welcome Dylan Scandinaro to OpenAI as our Head of Preparedness. Things are about to move quite fast and we will be working with extremely powerful models soon. This will require commensurate safeguards to ensure we can continue to deliver tremendous benefits. Dylan will lead our efforts to prepare for and mitigate these severe risks. He is by far the best candidate I have met, anywhere, for this role. He has his work cut out for him for sure, but I will sleep better tonight. I am looking forward to working with him very closely to make the changes we will need across our entire company.

English

194

544

130.3K

Keşfet

@logangraham @bayeslord @austinc3301 @willsaunter @BlueDotImpact @dylanscandinaro @elonmusk @BarackObama