Existential Risk Observatory ⏸

1.3K posts

Existential Risk Observatory ⏸

@XRobservatory

Reducing AI x-risk by informing the public. We propose a Conditional AI Safety Treaty: https://t.co/xUZxozlNBF

Amsterdam, Netherlands Entrou em Mart 2021

679 Seguindo1.7K Seguidores

Tweet fixado

Existential Risk Observatory ⏸@XRobservatory·15 Kas

Today, we propose the Conditional AI Safety Treaty in @TIME as a solution to AI's existential risks. AI poses a risk of human extinction, but this problem is not unsolvable. The Conditional AI Safety Treaty is a global response to avoid losing control over AI. How does it work?

Existential Risk Observatory ⏸ tweet media

English

115

30K

Existential Risk Observatory ⏸@XRobservatory·2d

bloomberg.com/news/articles/…

ZXX

Existential Risk Observatory ⏸@XRobservatory·2d

Great if banks try to patch their systems against cyber attacks. Hopefully those in charge of other important infrastructure do so too. It is important to realize though that cyber is only one of AI's dangerous capabilities. Others are mentioned by Shevlane et al, 2023: persuasion & manipulation, political strategy, weapons acquisition, long-horizon planning, AI development, situational awareness, and self-proliferation. Defenses need to be built for those, too!

English

140

Existential Risk Observatory ⏸ retweetou

Nate Soares ⏹️@So8res·6d

People say the mainstream media is sensationalist, but top AI leaders keep trying to raise the alarm about this tech literally ending the world and it's treated as a weird sideshow.

English

316

26.9K

Existential Risk Observatory ⏸ retweetou

The Wall Street Journal@WSJ·3 Nis

From @WSJopinion: AI is a threat to everything the American people hold dear. It kills jobs, equality, connection, democracy and maybe the human race. Congress must act, writes @SenSanders. on.wsj.com/4mbewUq

English

34.1K

Existential Risk Observatory ⏸ retweetou

ControlAI@ControlAI·1 Nis

ControlAI CEO Andrea Miotti (@andreamiotti): Most policymakers have never even heard about the risk of extinction from superintelligence. In the UK alone, we briefed 150+ lawmakers. Over 100 publicly support us. The biggest bottleneck to solving the problem is informing people.

English

1.3K

Existential Risk Observatory ⏸@XRobservatory·27 Mar

"They have cautioned us that AI could soon surpass human intelligence and operate independently beyond our control. If that happens, what they acknowledge is that it poses a profound threat to the very survival of the human race".

Sen. Bernie Sanders@SenSanders

It's time for a moratorium on the construction of new AI data centers. My press conference with @RepAOC. twitter.com/i/broadcasts/1…

English

532

Existential Risk Observatory ⏸ retweetou

Rob Bensinger ⏹️@robbensinger·25 Mar

It's over. AI risk is no longer a niche topic. Let's talk about this, together. Let's find a way through this.

Rob Bensinger ⏹️@robbensinger

Hundreds of scientists, including 3/4 of the most cited living AI scientists, have said that AI poses a very real chance of killing us all. We're in uncharted waters, which makes the risk level hard to assess; but a pretty normal estimate is Jan Leike's "10-90%" of extinction-level outcomes. Leike heads Anthropic's alignment research team, and previously headed OpenAI's. This actually seems pretty straightforward. There's literally no reason for us to sleepwalk into disaster here. No normal engineering discipline, building a bridge or designing a house, would accept a 25% chance of killing a person; yet somehow AI's engineering culture has corroded enough that no one bats an eye when Anthropic's CEO talks about a 25% chance of research efforts killing every person. A minority of leading labs are dismissive of the risk (mainly Meta), but even the fact that “will we kill everyone if we keep moving forward?” is hotly debated among researchers seems very obviously like more than enough grounds for governments to internationally halt the race to build superintelligent AI. Like, this would be beyond straightforward in any field other than AI. Obvious question: How would that even work? Like, I get the argument in principle: “smarter-than-human AI is more dangerous than nukes, so we need to treat it similarly.” But with nukes, we have a detailed understanding of what’s required to build them, and it involves huge easily-detected infrastructure projects and rare materials. Response: The same is true for AI, as it’s built today. The most powerful AIs today rely on extremely specialized and costly hardware, cost hundreds of millions of dollars to build,¹ and rely on massive data centers² that are relatively easy to detect using satellite and drone imagery, including infrared imaging.³ Q: But wouldn’t people just respond by building data centers in secret locations, like deep underground? Response: Only a few firms can fabricate AI chips — primarily the Taiwanese company TSMC — and one of the key machines used in high-end chips is only produced by the Dutch company ASML. This is the extreme ultraviolet lithography machine, which is the size of a school bus, weighs 200 tons, and costs hundreds of millions of dollars.⁴ Many key components are similarly bottlenecked.⁵ This supply chain is the result of decades of innovation and investment, and replicating it is expected to be very difficult — likely taking over a decade, even for technologically advanced countries.⁶ This essential supply chain, largely located in countries allied to the US, provides a really clear point of leverage. If the international community wanted to, it could easily monitor where all the chips are going, build in kill switches, and put in place a monitoring regime to ensure chips aren’t being used to build toward superintelligence. (Focusing more efforts on the chip supply chain is also a more robust long-term solution than focusing purely on data centers, since it can solve the problem of developers using distributed training to attempt to evade international regulations.⁷) Q: But won’t AI become cheaper to build in the future? Response: Yes, but — (a) It isn’t likely to suddenly become dramatically cheaper overnight. If it becomes cheaper gradually, regulations can build in safety margin and adjust thresholds over time to match the technology. Efforts to bring preexisting chips under monitoring will progress over time, and chips have a limited lifespan, so the total quantity of unmonitored chips will decrease as well. (b) If we actually treated superintelligent AI like nuclear weapons, we wouldn’t be publishing random advances to arXiv, so the development of more efficient algorithms and more optimized compute would happen more slowly. Some amount of expected algorithmic progress would also be hampered by reduced access to chips. (c) You don’t need to ban superintelligence forever; you just need to ban it until it’s clear that we can build it without destroying ourselves or doing something similarly terrible. A ban could buy the world many decades of time. Q: But wouldn’t this treaty devastate the economy? A: It would mean forgoing some future economic gains, because the race to superintelligence comes with greater and greater profits until it kills you. But it’s not as though those profits are worth anything if we’re dead; this seems obvious enough. There’s the separate issue that lots of investments are currently flowing into building bigger and bigger data centers, in anticipation that the race to smarter-than-human AI will continue. A ban could cause a shock to the economy as that investment dries up. However, this is relatively easy to avoid via the Fed lowering its rates, so that a high volume of money continues to flow through the larger economy.⁸ Q: But wouldn’t regulating chips have lots of spillover effects on other parts of the economy that use those chips? A: NVIDIA’s H100 chip costs around $30,000 per chip and, due to its cooling and power requirements, is designed to be run in a data center.⁹ Regulating AI-specialized chips like this would have very few spillover effects, particularly if regulations only apply to chips used for AI training and not for inference.¹⁰ But also, again, an economy isn’t worth much if you’re dead. This whole discussion seems to be severely missing the forest for the trees, if it’s not just in outright denial about the situation we find ourselves in. Some of the infrastructure used to produce AI chips is also used in making other advanced computer chips, such as cell phone chips; but there are notable differences between these chips. If advanced AI chip production is shut down, it wouldn’t actually be difficult to monitor production and ensure that chip production is only creating non-AI-specialized chips. At the same time, existing AI chips could be monitored to ensure that they’re used to run existing AIs, and aren’t being used to train ever-more-capable models.¹¹ This wouldn't be trivial to do, but it's pretty easy relative to many of the tasks the world's superpowers have achieved when they faced a national security threat. The question is whether the US, China, and other key actors wake up in time, not whether they have good options for addressing the threat. Q: Isn't this totalitarian? A: Governments regulate thousands of technologies. Adding one more to the list won’t suddenly tip the world over into a totalitarian dystopia, any more than banning chemical or biological weapons did. The typical consumer wouldn’t even necessarily see any difference, since the typical consumer doesn’t run a data center. They just wouldn’t see dramatic improvements to the chatbots they use. Q: But isn’t this politically infeasible? A: It will require science communicators to alert policymakers to the current situation, and it will require policymakers to come together to craft a solution. But it doesn’t seem at all infeasible. Building superintelligence is unpopular with the voting public,¹² and hundreds of elected officials have already named this issue as a serious priority. The UN Secretary-General and major heads of state are routinely talking about AI loss-of-control scenarios and human extinction. At that point, the cat has already firmly left the bag. (And it's not as though there's anything unusual about governments heavily regulating powerful new technologies.) What's left is to dial up the volume on that talk, translate that talk into planning and fast action, and recognize that "there's uncertainty how much time we have left" makes this a more urgent problem, not less. Q: But if the US halts, isn’t that just ceding the race to authoritarian regimes? A: The US shouldn’t halt unilaterally; that would just drive AI research to other countries. Rather, the US should broker an international agreement where everyone agrees to halt simultaneously. (Some templates of agreements that would do the job have already been drafted.¹³) Governments can create a deterrence regime by articulating clear limits and enforcement actions. It’s in no country’s interest to race to its own destruction, and a deterrence regime like this provides an alternative path. Q: But surely there will be countries that end up defecting from such an agreement. Even if you’re right that it’s in no one’s interest to race once they understand the situation, plenty of people won’t understand the situation, and will just see superintelligent AI as a way to get rich quick. A: It’s very rare for countries (or companies!) to deliberately violate international law. It’s rare for countries to take actions that are widely seen as serious threats to other nations’ security. (If it weren't rare, it wouldn't be a big news story when it does happen!) If the whole world is racing to build superintelligence as fast as possible, then we’re very likely dead. Even if you think there's a chance that cautious devs could stay in control as AI starts to vastly exceed the intelligence of the human race (and no, I don't think this is realistic in the current landscape), that chance increasingly goes out the window as the race heats up, because prioritizing safety will mean sacrificing your competitive edge. If instead a tiny fraction of the world is trying to find sneaky ways to build a small researcher-starved frontier AI project here and there, while dealing with enormous international pressure and censure, then that seems like a much more survivable situation. By analogy, nuclear nonproliferation efforts haven’t been perfectly successful. Over the past 75 years, the number of nuclear powers has grown from 2 to 9. But this is a much more survivable state of affairs than if we hadn’t tried to limit proliferation at all, and were instead facing a world where dozens or hundreds of nations possess nuclear weapons. When it comes to superintelligence, anyone building "god-like AI" is likely to get us all killed — whether the developer is a military or a company, and whether their intentions are good or ill. Going from "zero superintelligences" to "one superintelligence" is already lethally dangerous. The challenge is to block the construction of ASI while there's still time, not to limit proliferation after it already exists, when it's far too late to take the steering wheel. So the nuclear analogy is pretty limited in what it can tell us. But it can tell us that international law and norms have enormous power. Q: But what about China? Surely they’d never agree to an arrangement like this. A: The CCP has already expressed interest in international coordination and regulation on AI. E.g., Reuters reported that Chinese Premier Li Qiang said, "We should strengthen coordination to form a global AI governance framework that has broad consensus as soon as possible."¹⁴ And, quoting The Economist:¹⁵ "But the accelerationists are getting pushback from a clique of elite scientists with the Communist Party’s ear. Most prominent among them is Andrew Chi-Chih Yao, the only Chinese person to have won the Turing award for advances in computer science. In July Mr Yao said AI poses a greater existential risk to humans than nuclear or biological weapons. Zhang Ya-Qin, the former president of Baidu, a Chinese tech giant, and Xue Lan, the chair of the state’s expert committee on AI governance, also reckon that AI may threaten the human race. Yi Zeng of the Chinese Academy of Sciences believes that AGI models will eventually see humans as humans see ants. "The influence of such arguments is increasingly on display. In March an international panel of experts meeting in Beijing called on researchers to kill models that appear to seek power or show signs of self-replication or deceit. A short time later the risks posed by AI, and how to control them, became a subject of study sessions for party leaders. A state body that funds scientific research has begun offering grants to researchers who study how to align AI with human values. [...] "In July, at a meeting of the party’s central committee called the 'third plenum', Mr Xi sent his clearest signal yet that he takes the doomers’ concerns seriously. The official report from the plenum listed AI risks alongside other big concerns, such as biohazards and natural disasters. For the first time it called for monitoring AI safety, a reference to the technology’s potential to endanger humans. The report may lead to new restrictions on AI-research activities. "More clues to Mr Xi’s thinking come from the study guide prepared for party cadres, which he is said to have personally edited. China should 'abandon uninhibited growth that comes at the cost of sacrificing safety', says the guide. Since AI will determine 'the fate of all mankind', it must always be controllable, it goes on. The document calls for regulation to be pre-emptive rather than reactive." The CCP is a US adversary. That doesn't mean they're idiots who will destroy their own country in order to thumb their nose at the US. If a policy is Good, that doesn't mean that everyone Bad will automatically oppose it. Policies that prevent human extinction are good for liberal democracies and for authoritarian regimes, so clueful people on all sides will endorse those policies. The question, again, is just whether people will clue in to what's happening soon enough to matter. My hope, in writing this, is to wake people up a bit faster. If you share that hope, maybe share this post, or join the conversation about it; or write your own, better version of a "wake-up" warning. Don't give up on the world so easily.

English

169

9.6K

Existential Risk Observatory ⏸@XRobservatory·22 Mar

We support nonviolent activism to pause AI as long as its existential risks are neither properly understood nor effectively mitigated. This is the smallest xrisk demonstration there will ever be.

Michaël (in DC)@MichaelTrazzi

On our way to OpenAI!

English

917

Existential Risk Observatory ⏸ retweetou

ControlAI@ControlAI·21 Mar

Professor David Duvenaud, who led AI safety testing at top AI company Anthropic, tells Canadian MPs that AIs aren't yet capable of doing "super galaxy brained long-term biding their time" to take over, but that he thinks they probably will be in 6 or 18 months.

English

6.4K

Existential Risk Observatory ⏸ retweetou

Nate Soares ⏹️@So8res·20 Mar

We're starting to see a shift in the public convo. Keep at it. Speak plainly, and insist that others speak plainly too. Raising global awareness takes time, but we're making progress. x.com/peterwildeford…

Peter Wildeford🇺🇸🚀@peterwildeford

31 current members of Congress have publicly discussed AGI, AI superintelligence, AI loss of control, recursive self-improvement, or the Singularity: 🔴Sen Banks (IN) 🔵Sen Blumenthal (CT) 🔴Sen Blackburn (TN) 🔵Sen Hickenlooper (CO) 🔴Sen Capito (WV) 🔵Sen Murphy (CT) 🔴Sen Hawley (MO) 🔵Sen Sanders (VT) 🔴Sen Lee (UT) 🔵Sen Schumer (NY) 🔴Sen Lummis (WY) 🔵Rep Beyer (VA) 🔴Rep Biggs (AZ) 🔵Rep Casten (IL) 🔴Rep Burlison (MO) 🔵Rep Foster (IL) 🔴Rep Crane (AZ) 🔵Rep Krishnamoorthi (IL) 🔴Rep Dunn (FL) 🔵Rep Liccardo (CA) 🔴Rep Johnson (SD) 🔵Rep Lieu (CA) 🔴Rep Kiley (CA) 🔵Rep Moulton (MA) 🔴Rep Mace (SC) 🔵Rep Sherman (CA) 🔴Rep Moran (TX) 🔵Rep Tokuda (HI) 🔴Rep Paulina Luna (FL) 🔵Rep Whitesides (CA) 🔴Rep Perry (PA)

English

4.6K

Existential Risk Observatory ⏸ retweetou

ControlAI@ControlAI·17 Mar

In The Guardian: An AI security researcher reports that an AI at an unnamed California company got "so hungry for computing power" it attacked other parts of the network to seize resources, collapsing the business critical system. This relates to a fundamental issue in AI: developers do not know how to ensure the systems they're developing are reliably controllable. Top AI companies are currently racing to develop superintelligence, AI vastly smarter than humans. None of them have a credible plan to ensure they could control it. With superintelligent AI, the stakes are much greater than collapse of a business system. Leading AI scientists and even the CEOs of the top AI companies have warned that superintelligence could lead to human extinction.

English

239

108.8K

Existential Risk Observatory ⏸ retweetou

Michaël (in DC)@MichaelTrazzi·16 Mar

More than 120 people have already signed up for our March on Anthropic, OpenAI and xAI in 5 days asking lab CEOs for conditional pause statements! Very excited about all the great speakers & orgs joining forces for what is expected to be the largest US AI Safety protest to date

English

14.6K

Existential Risk Observatory ⏸ retweetou

ControlAI@ControlAI·16 Mar

AI godfather and Turing Award winner Yoshua Bengio tells a Canadian Senate committee: If current trends continue, AIs will surpass humans for many skills. Intelligence gives power, and that could be turned against humans by the AIs themselves. Bengio's full opening statement:

English

3.1K

Existential Risk Observatory ⏸@XRobservatory·14 Mar

We came such a long way. "Evidence of strategic deception, attempts to escape a sandbox, unexplained jumps in capability, replication across networks, or moves to secure its own power supply (...) should trigger an automatic convening of a standing international crisis cell."

TIME@TIME

We need a plan for when superintelligent AI breaks loose, argues Jon Truby time.com/article/2026/0…

English

503

Existential Risk Observatory ⏸@XRobservatory·14 Mar

Curious how long it might take? Take a look at takeoverbench.com!

English

Existential Risk Observatory ⏸@XRobservatory·14 Mar

There are many arguments why the chance of successful regulation against an AI takeover is on the rise: - With the current paradigm, takeoff seems slowish (years), providing time for public and policymakers to catch up (remember how covid measures got implemented in weeks). - Stumbling agents in control of increasingly important assets seem likely to provide many public warning shots. - Public awareness of existential risk is clearly on the rise already (although we're not there yet). - Models are relatively bulky, increasing the chance of regulation being enforceable. All of this means regulation will not just be possible, but mandatory. No administration can risk being seen as weak and unable to control the situation. They will have to do something. It is likely a matter of a few years maximum before we see effective AI regulation against a takeover in the US or even globally.

English

299

Existential Risk Observatory ⏸ retweetou

PauseAI ⏸@PauseAI·12 Mar

@jawwwn_ This is why we've said from the day we started that we need an international treaty. We can track AI hardware, EUV lithography is still a complete monopoly. We can pause, if we put our minds to actually doing it.

English

1.6K

Existential Risk Observatory ⏸ retweetou

PauseAI ⏸@PauseAI·10 Mar

This is how Anthropic - one of the biggest AI companies - thinks AI will affect jobs. The more blue the greater the potential for job loss. Red is where this is already happening. In a nutshell: Any job that is not manual has the potential to be carried out by AI.

English

Existential Risk Observatory ⏸ retweetou

Miles Brundage@Miles_Brundage·7 Mar

@KateandPie People at AI companies in particular, and others with influence over them, should be trying harder to get federal AI regulation passed

English

461

Existential Risk Observatory ⏸@XRobservatory·7 Mar

Even in a race, the labs have a lot more power to slow things down than they're using. "I'd like to slow down but others won't" is mostly an excuse from people whose revealed preferences show they don't actually want to slow down.

Nate Soares ⏹️@So8res

In an interview today I said AI execs don't really have the power to stop this suicide race; we need global intervention. The interviewer objected that if one of these companies simply shut down, foregoing billions and tanking lawsuits, the world wouldn't be able to ignore it.

English

383

Descobrir

@WSJopinion @SenSanders @andreamiotti @jawwwn_ @KateandPie @elonmusk @BarackObama @taylorswift13