Dewi Erwan

191 posts

Dewi Erwan

Dewi Erwan

@dewierwan_

London Katılım Şubat 2010
158 Takip Edilen889 Takipçiler
Dewi Erwan retweetledi
Dean W. Ball
Dean W. Ball@deanwball·
Some brief thoughts on Mythos We’ve known this was coming for a long time. At least, we *should* have. Extremely effective software vulnerability discovery was clearly coming to anybody paying attention. It has also been clear that all AI policy so far has been made and executed with training wheels. It was always clear that, sometime soon, the training wheels would come off. The training wheels aren’t fully off just yet—this model is being kept under lock and key, and Anthropic does not seem inclined to release Mythos preview to the public anytime soon, if ever. The training wheels will be off when these capabilities are fully diffused in ways centralized actors cannot control. It is inevitable that this will happen. The point is not to argue about whether we should “ban open source” or similarly unrealistic notions. The point is to harden the world for this new reality. I applaud Anthropic—and I especially applaud @logangraham—for doing so. But their efforts alone are not close to enough. Project Glasswing—a partnership with Anthropic and other companies—seems nice, but unsurprisingly it lacks uniform frontier lab participation. It would probably be ideal, for our national cyberdefense, if the federal government were not trying to destroy Anthropic and eliminate their models from government systems. If anything, the government should be trying to work more closely with Anthropic. As a side note, I hope Anthropic is working with state and local government entities on cyber vulnerability discovery, since many of our adversaries know that state and local is America’s soft underbelly in so many ways. In any event, the Mythos news should lay bare how stupid and counter-productive the Department of War’s feud with Anthropic really is. As someone who suspected all this was coming (not from inside knowledge but from it being ~obvious), that probably explains why I have had such a strong reaction to that feud. It’s this senseless distraction just at the time that the training wheels are coming off. I hope the two parties can resolve their differences now, for the sake of the country, but I am not hopeful. I do want to call out, however, the numerous political and career civil servants in the Trump Admin who do get these issues, know how stupid the Ant-DoW stuff is, and want to work with the frontier labs like adults. I wish you all utmost success. I find myself inclined to end on some positive notes. Mythos appears to be—according to Anthropic at least—“the most aligned” model Anthropic has ever trained. We are approaching superhuman capabilities in some domains, and yet alignment is getting better rather than worse. That’s not nothing. I know some of you think the model is faking its alignment, or aware when its alignment is being tested. I don’t have a good answer. Finally, there is this: Mythos was made by an American company, and like most successful American companies, it has a vested interest in maintaining order and peace, and it is investing substantial resources in mitigating the risks of its technological progress, as I expect most of the American labs would. This is cause for optimism: The incentives of capitalism are working. The training wheels are coming off, but at least we are the ones removing them, as opposed to our enemies. Perhaps we can be the first to learn to bike for real. The first step would be to get beyond all the low-fidelity, under-specified, pimply little fights of AI policy’s prepubescent era. That goes for me too. “What hath God wrought,” wrote the first telegram. What, indeed. In this case, the answer is still up to us.
English
64
241
2.6K
389.2K
Dewi Erwan retweetledi
tom cunningham
tom cunningham@testingham·
I think many economists agree with the following, but it would be valuable to make this publicly known: 1. There is a substantial probability (>10%) that AI will exceed human-level performance on virtually all non-physical tasks within ten years. 2. This would be an unprecedented shock to human society. 3. The economics profession should treat it with an urgency comparable to WWII or COVID.
English
40
38
382
77.5K
Dewi Erwan retweetledi
dylan matthews 🔸
dylan matthews 🔸@dylanmatt·
My coworkers on the biosecurity team are looking to direct at least $100 million this year. But they need more leads on work that could prevent or blunt the impact of a future pandemic Apply here! coefficientgiving.org/funds/biosecur…
dylan matthews 🔸 tweet media
English
3
34
112
13.1K
Dewi Erwan retweetledi
Dewi Erwan
Dewi Erwan@dewierwan_·
An upshot of Anthropic vs DoW: Seems great that almost everyone agreed in principle that we shouldn't use AI to do massive domestic surveillance and totally remove humans from military kill chains. This is a strong foundation to fall back upon in the future.
English
0
0
1
90
Dewi Erwan
Dewi Erwan@dewierwan_·
Over the past year, my friends at Anthropic told me how much they respect and admire their CEO, Dario. This weekend is when the world learnt why.
English
0
0
2
118
Dewi Erwan retweetledi
Thomas Woodside 🫜
Thomas Woodside 🫜@Thomas_Woodside·
Dean and I first met on opposite sides of one of the first major AI policy fights. He was a worthy adversary. These days, there are many areas where I find common ground with Dean, and he is a worthy ally and friend. I have enormous respect for him speaking out today. There is not much more I can add to what he and others have said. Our government has made a grave mistake, and it is a dark day for our country. AI policy is expanding. Things are getting uglier and they more closely approximate the naked exercise of power, as they were destined to when the stakes got higher. Days like these make me want to harden my heart, as did many days during 1047. We must resist that impulse. We have to fight the good fight where we must, but also embrace every chance at engagement where we can. The challenges before us can be tackled only by finding common ground, as bleak as that possibility may seem today. The gauntlet of the singularity is coming. I do not know if we can prevail, but we have to try.
Dean W. Ball@deanwball

Man, the SB 1047 debate seems so innocent and quaint right now.

English
0
4
203
10.4K
Dewi Erwan
Dewi Erwan@dewierwan_·
This is stated at the top of the application form, and people still do it. Crazy.
Dewi Erwan tweet media
English
0
0
1
54
Dewi Erwan
Dewi Erwan@dewierwan_·
80% of our job applications are written by AI. They all sound the same. Please stop.
English
1
0
1
73
Dewi Erwan
Dewi Erwan@dewierwan_·
In 2026, the AI safety community will splinter based on their answer to this question: Should we keep humanity in control of our Earth? I am on team humanity.
English
0
0
4
112
Dewi Erwan
Dewi Erwan@dewierwan_·
@willsaunter @BlueDotImpact To make this happen, I'm hiring for two roles: 1. Head of Operations to ensure the company doesn't descend into total chaos. 2. Community Lead to turn participants into lifelong collaborators. SF-based, salary $110-225k, we sponsor US visas. 5/6
English
1
4
10
1.4K
Dewi Erwan
Dewi Erwan@dewierwan_·
In 2021, I almost decided to become a youtuber. I was running events from my childhood bedroom in wales with <10 attendees. I was in (what I considered) a low-status, dead-end community building job. But I now realise it was a kickass training ground for founding a company. 1/6
Dewi Erwan tweet media
English
1
4
21
2K
Dewi Erwan
Dewi Erwan@dewierwan_·
@dylanscandinaro Best of luck! If you want help hiring AI safety/security/resilience talent, reach out — at BlueDot, we've trained 7k people and helped 100s into jobs at frontier AI companies and govts.
English
0
0
0
88