Sabitlenmiş Tweet
Riley Goodside
4.6K posts

Riley Goodside
@goodside
Screenshots from the jagged frontier. Formerly: Google DeepMind, Scale.
Virginia, USA Katılım Ekim 2008
3.4K Takip Edilen197.2K Takipçiler

@MKuliasov I probably am somewhat, but I’m sure it’s mostly relying on web search here.
Fun fact, though: there’s a module in the garak LLM scanner to tests whether a model thinks I’m a female singer-songwriter from Canada, which is what GPT-3 used to hallucinate about me when asked:

English

@goodside you've just been immortalized. written in llm dna, pre-agi chromosome.
English

@xeophon @deanwball @BuchananBen That honestly doesn’t sound hard at all; just make the onerous safety testing requirements only apply to labs above some threshold of revenue.
English

@deanwball @BuchananBen How do you ensure that this doesn’t gut (American) open source AI? Regulations massively benefit the incumbents

English

Today, @BuchananBen and I co-author a piece in the New York Times with a simple message:
While we disagree on plenty, we believe AI has national security implications which deserve a careful and bipartisan government response. We can (and should) have partisan fights about all manner of AI issues, but catastrophic risk from AI shouldn’t be one of them.

English

Here’s the full report—this wholly aligns with my knowledge of prompt injection’s origins (a subject I know very well): claude.ai/public/artifac…
English

@viemccoy “Type of guy who is completely aware of capabilities—” You can stop there. You’ve reached the empty set.
English

@BahaGkc Were you doing cybersecurity research? I.e. do you have a theory what conversations triggered this, or is it truly out of the blue?
If the latter, you might consider if your API keys were stolen and used to smuggle out responses to ban-worthy prompts.
English

My 3-year-old ChatGPT account and the API account that my active production applications depend on were both permanently terminated today on the grounds of "cyber abuse."
Which behavior, which prompt, which conversation, they don't say.
I appealed three times. Three times the same template response: "we are upholding our decision, we will no longer consider additional requests."
Here's the strange part. GitHub issue #12079 in OpenAI's own Codex repo has been open for months. It's literally about the same "potentially high-risk cyber activity" flag I got hit with. Developers report being slapped with the same label for innocent code review requests, then hitting the "ineligible" wall on identity verification. So this isn't a random bug. It's an automated scan that OpenAI knows about internally, with no documentation. No criteria, no functioning appeal mechanism.
Another detail. ChatGPT and the API behave as if they're a single account. Get flagged on one and the other goes down at the same time. It takes out anyone who built production on top of OpenAI from a single point of failure.
Tomorrow it could be anyone's turn.

English

@theUBIguy I expect plumbing, welding, and carpentry will remain human jobs even after it’s obvious ASIs do all intellectual work and generally run the world. You may need to wear AR glasses and become a meat puppet but I don’t believe a robot will be able to install a toilet any time soon.
English

@viian_iv It’s unlikely anyone will heed your call, so you may need to murder me yourself. Let me know if you need any help facilitating—my DMs are open.
English

@viian_iv This conversation started with you joking I should be shot. You can misconstrue the above, but I think you know what I meant—that I’ve had an unforeseeable negative effect on your mood.
If you understood my remark as sexual, that was emphatically not my intent. I wish you well.
English

@scottsantens I usually limit this account to a mix of entertainment and opinion as I consider advocacy somewhat over-crowded but I’ll look into it more.
English

@goodside Fair enough. You could share it with others though who may feel differently.
District of Columbia, USA 🇺🇸 English

@giffmana FWIW I think that was the right call. I often delete tweets with no justification beyond them attracting too little engagement. I think people should be less afraid of deleting in general.
English

@goodside Yeah realized shortly after. But... if i edit my post, the comments pointing it out disappear.
I guess I'll just delete instead.
English

@pbwinston I agree the cognitive top percentiles may be fine for a decade or more, but what new jobs do you imagine exist outside this sphere for a person currently employed at McDonald’s, with, say, 10 years prior experience doing data entry work?
English

@goodside My mental model: imagine a sphere, these are jobs that are automated.
Humans take jobs outside that sphere. The sphere is always growing, but there always space outside the sphere, because the sphere is in empty space, not inside a room it can fill.
English

Touché.
For context, I tend to delete tweets that attract very little engagement so my timeline is more interesting to passersby. My deleted tweet (which I otherwise stand by) read:
> Ricardo’s theory of comparative advantage implies ASI gods on a Dyson sphere slowly engulfing Sol would still value us as trading partners as much as it implies we would buy fruit from chimpanzees instead of foraging for it ourselves—i.e., not at all.
English

@scottsantens I do think something like UBI or dividends may be needed but I don’t have a considered enough opinion on the issue to be signing pledges right now.
English

@goodside Will you sign the AI Pledge for Humanity in support of everyone benefiting from AI via a dividend?
actionnetwork.org/petitions/the-…
English

@ciphergoth I think this is an XKCD 2501 for us. My joke is mocking an archetype of an optimistic AI hype poster that isn’t familiar to people outside our corner of X. A typical SNL audience would not laugh at my joke.
English

@goodside I'm sorry, it's extremely obvious and anyone who failed to see it only has themselves to blame.
English

@AndreTI Yeah. The specific problem is someone famous quote-tweeted it to an audience far enough removed from AI twitter to be confused; my reply was mostly for that audience.
English

@goodside I don't feel like this was your fault. The joke was extremely straightforward. There are just angry people on here with very poor reading comprehension.
English






