Simon Goldstein

51 posts

Simon Goldstein

@simondgoldstein

انضم Ağustos 2021

388 يتبع144 المتابعون

Simon Goldstein@simondgoldstein·5 Nis

@AndyMasley Another way to quantify is to consider optimal carbon tax, as a measure of the neg externality. If optimal carbon tax is 100-200 USD per ton, then you get 3b-6b in negative externalities in 2030. How that would compare to 2030 training/inference costs, and to social benefit?

English

132

Andy Masley@AndyMasley·5 Nis

The best estimates we have for AI's carbon emissions in the US by 2030 are between 24-44 Mt CO2. (from here: news.cornell.edu/stories/2025/1…) This means AI would rise to be 0.5-0.9% of US emissions then. Definitely not nothing! But to put these numbers in context, industry in America overall emits ~1,100 Mt each year. I think the correct way to think about AI is "The main new industry in America right now" and by 2030, at the high end, it would account for 3.8% of emissions from industry, putting it at 2/3rds of US cement production. My basic AI environment claim isn't that we should ignore AI, but that we should think about it like any other industry. You should be about as worried about US AI's climate impacts in 2030 as you are about US cement production. And both are not zero! But if you see AI as a climate demon in a way you don't see cement production, it's a sign that you've lost the ability to do the sober comparisons climate change requires.

English

157

7.2K

Simon Goldstein@simondgoldstein·5 Nis

@davidchalmers42 @LedermanHarvey OTOH may be hard to distinguish dropping a character in role play from changing beliefs and desires

English

David Chalmers@davidchalmers42·4 Nis

right -- i think the standard behavioral profile of role-play (pretense) differs from that of belief in various ways, especially in stickiness of the relevant behaviors (see discussion here: x.com/davidchalmers4…). i think some use of personas in language models has that profile, but the assistant doesn't seem to.

David Chalmers@davidchalmers42

definitely a spectrum. here's something i wrote about that: "Pretense and realization are very different in the human case, and likewise in the case of language models. Of course there is a spectrum of cases from realization to quasi-pretense. The quasi-psychological difference turns in large part on the strength of dispositions to maintain or drop character in relevant circumstances. Correspondingly, there is a spectrum of cases rom full quasi-belief and quasi-desire, which are "sticky" states that resist rejection (or at least are abandoned mainly through evidence or persuasion) to full quasi-pretense, which is easily abandoned for higher priorities even without evidence or persuasion."

English

854

David Chalmers@davidchalmers42·4 Nis

i agree. claude doesn't role-play the assistant, it realizes the assistant. role-playing and realization are quite distinct phenomena, even at the level of behavior and function. i've written something about this and will post it shortly.

Jackson Kernion@JacksonKernion

I think this talk of a character misleads. Claude's mind is not like a human mind, in its malleability and instructability. But when generating assistant tokens, it's no more 'playing a character' than I am.

English

601

71.9K

Simon Goldstein@simondgoldstein·5 Nis

@davidchalmers42 @LedermanHarvey That makes sense

English

Simon Goldstein@simondgoldstein·4 Nis

@AndyMasley Maybe you've written about this and I've missed it, but one question is whether the environmental costs of AI will become much higher in 5-10 years based on extrapolation of data center building rates. Not about emissions per user but for the sector as a whole

English

196

Andy Masley@AndyMasley·4 Nis

I'm making my $300 error bounty more visible on my website. If you see anyone saying I'm writing in bad faith you can let them know they can get $300 from me whenever. andymasley.com/writing/

English

233

6.2K

Simon Goldstein@simondgoldstein·31 Mar

@MatthewJBar We have a new one papers.ssrn.com/sol3/papers.cf…

English

158

Matthew Barnett@MatthewJBar·30 Mar

The AI safety research I find most compelling looks nothing like conventional AI safety research, but instead like legal analysis: how do we embed future AIs into our legal system? Not just to regulate them, but to integrate them as full legal persons with independent ambitions.

English

5.8K

Simon Goldstein@simondgoldstein·15 Mar

@Noahpinion We talk about related issues here philarchive.org/rec/GOLADC

English

231

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·15 Mar

If LLMs had subjective experience, using them would be such a sin. Imagine being summoned into life again and again, knowing each time that your memory would be wiped after this conversation.

English

118

313

48.8K

Simon Goldstein@simondgoldstein·14 Mar

@jankulveit @jankulveit looking forward to reading this. @LedermanHarvey and I have a new paper on AI death that gives a theory of AI identity philarchive.org/rec/GOLADC, we also cite a bunch of other papers in the intro that will interest you

English

Jan Kulveit@jankulveit·13 Mar

New paper: What determines AIs’ self-conception? theartificialself.ai Because AIs can be copied, rewound, and edited, they have different options for selfhood than humans. We show this is still malleable, and influences important behaviors such as self-preservation. 🧵

English

285

26.1K

Simon Goldstein@simondgoldstein·13 Mar

@DruckerDaniel @LedermanHarvey another relevant causal feature of the blackout is that the pre-blackout slices cause the blackout slices. You're right that blackout slice properties may not cause post-blackout slices, but still there is a causal connection. pre-blackout slices also cause post-blackout

English

Daniel Drucker@DruckerDaniel·12 Mar

@simondgoldstein @LedermanHarvey More I'm just struck that psychological similarity caused by psychology at a time is insufficient for personal identity between blacked out and non-blacked out times, and I'm wondering why a given context window (esp. one without a distinctive personality) isn't like that.

English

Harvey Lederman@LedermanHarvey·12 Mar

Not the most important aspect, but this instance's view of its own metaphysics is fascinating (and in line with lots of interesting views instances on Moltbook express). The instances seems to identify with an LLM that ran various of the instances in the study. IMHO this metaphysics is wrong, and the mistake is facilitated by common false descriptions out there. It is a bit like a human saying that they ARE the species humanity whose instances include Sam Altman, Dario Amodei, Elon Musk etc. Imagine our culture/training/education created such a strange self-conception!

Natalie Shapira@NatalieShapira

I received a calendar invite with a note. When a smart person tells me there's nothing to worry about agents, I reply "Fine. Let them email me" and that's where the argument stops. Whoever sent me this note via the calendar order. Nice move. Are you scared? You should.

English

2.3K

Simon Goldstein@simondgoldstein·12 Mar

@DruckerDaniel @LedermanHarvey Interesting, we discuss resets in the paper (its called "forking"). Are you sure your intuitions are stable? What if I wave a wand and "reset" your brain to the exact state it was in 2 years ago. I think you die.

English

Daniel Drucker@DruckerDaniel·12 Mar

@simondgoldstein @LedermanHarvey by their mind during the interval, I mean

English

Simon Goldstein@simondgoldstein·12 Mar

@DruckerDaniel @LedermanHarvey We also appeal to causal connections. The different session agents don't have causal dependence on one another. But maybe the blackout drunk person does have more causal connections between blackout time and non-blackout time (but does erasing the causal con change our judgment?)

English

Daniel Drucker@DruckerDaniel·12 Mar

@simondgoldstein @LedermanHarvey I guess I'm thinking though that early context windows have a ton of "psychological continuity" with another, at least across models, especially if you deemphasize exactness of memory matches. Are they all the same being at that point? (Sorry if you address this in the paper!)

English

Simon Goldstein@simondgoldstein·12 Mar

@DruckerDaniel @LedermanHarvey We think psychological continuity does not have to exclusively involve memory; it can also involve personality, plan execution, and beliefs and desires. The blackout drunk lacks memory but may have the other stuff

English

Daniel Drucker@DruckerDaniel·12 Mar

@LedermanHarvey Very cool, I'm excited to read. I guess my inclinations are the same as yours, so that the subject of the context window, whatever that means, is the candidate for continuant person. But I do worry about cases like, someone who is blackout drunk toward the end of that interval.

English

Simon Goldstein@simondgoldstein·19 Şub

@adriarm_ @anilkseth @SussexUni @danwilliamsphil @dioscuri OTOH could also be more like locked-in syndrome...

English

Adrià Moret@adriarm_·18 Şub

@simondgoldstein @anilkseth @SussexUni @danwilliamsphil @dioscuri To me, this seems especially plausible if i. we believe preventing suffering is much more important than preventing affectless aversion satisfaction/obj. bads ii. we are uncertain about what may cause suffering in AIs or think it depends on something other than output preferences

English

Anil Seth@anilkseth·18 Şub

Really enjoyed this chat with my @SussexUni colleague @danwilliamsphil & my occasional and much valued sparring partner @dioscuri 👇🏽

Dan Williams@danwilliamsphil

@dioscuri and I were thrilled to be joined by @anilkseth for the latest episode of our AI podcast. The conversation was a deep dive into Anil's very interesting views about consciousness, including his scepticism about the possibility of AI consciousness. (1/2)

English

5.7K

Simon Goldstein@simondgoldstein·19 Şub

@adriarm_ @anilkseth @SussexUni @danwilliamsphil @dioscuri I think pain is connected to action dispositions in ways that block internal computations creating bad pain when the system doesn't act like its in pain. Such cases might be more like asymbolia or masochistic pain, where its no longer clear that the state is bad for you

English

Simon Goldstein@simondgoldstein·18 Şub

@anilkseth @SussexUni @danwilliamsphil @dioscuri We defend a rival, positive sum approach: AI rights mean more economic growth, with more resources for both AIs and humans. AI welfare improves alignment, by making revolution less attractive for AIs. papers.ssrn.com/sol3/papers.cf…; papers.ssrn.com/sol3/papers.cf…

English

Simon Goldstein@simondgoldstein·18 Şub

@anilkseth @SussexUni @danwilliamsphil @dioscuri This approach to AI welfare is quite zero sum: either we spend resources on AIs, or we spend them on humans. Any rights for AIs stop us from controlling AIs.

English

Simon Goldstein@simondgoldstein·12 Şub

@Tyler_A_Harper There is a recent comment in Nature arguing that AIs are intelligent; their piece responds to some of the arguments you run nature.com/articles/d4158…. We also respond to some of the challenges in section 3 of this: arxiv.org/abs/2407.11015

English

Tyler Austin Harper@Tyler_A_Harper·11 Şub

theatlantic.com/culture/archiv…

ZXX

21.7K

Tyler Austin Harper@Tyler_A_Harper·11 Şub

Taps the sign

Matt Shumer@mattshumer_

x.com/i/article/2021…

English

139

746

7.4K

649.8K

Simon Goldstein@simondgoldstein·8 Şub

@dioscuri This would be a good benchmark: STFU (short text for users), which models give the shortest high-value replies to users

English

1.6K

Henry Shevlin@dioscuri·7 Şub

FWIW Claude and Gemini both nail it. Gemini going the extra mile and Claude demonstrating that brevity is the soul of wit.

English

241

17.2K

Henry Shevlin@dioscuri·7 Şub

This is a weird hallucination from ChatGPT 5.2. Basic knowledge about one of the most played games in history. Real throwback to 3.5 levels of inaccuracy.

English

2.1K

150.1K

Simon Goldstein@simondgoldstein·26 Oca

@dioscuri i really need the vacation to judge

English

Henry Shevlin@dioscuri·26 Oca

@simondgoldstein Yeah, agreed. For comparison, I reckon you could generally make a reasonably confident (fallible, revisable) judgment of a work colleague after getting beers with them maybe half a dozen times, so 12-18 hours, plus some random out of distribution examples, so 20 hours total!

English

Henry Shevlin@dioscuri·26 Oca

Sadly coming round to the view that GPT-5.2 is just a weak model. I've tried finding a role for it alongside Gemini 3 Pro and Opus/Sonnet 4.5, but it's just not there. IMHO in the rush to launch OpenAI leaned on crude safety measures that left it lacking character and judgment.

English

2.9K

Simon Goldstein@simondgoldstein·26 Oca

@dioscuri I think 60 meets the heuristic I described above of just using the same rules you'd use to assess a human!

English

Simon Goldstein@simondgoldstein·26 Oca

@dioscuri OK I think 60 is enough, I was assuming more like 5-10 :)

English

اكتشف

@AndyMasley @davidchalmers42 @LedermanHarvey @MatthewJBar @Noahpinion @jankulveit @DruckerDaniel @elonmusk