thomas rodskog

1.1K posts

thomas rodskog

@ThomasRodskog

19 / ai, cs, philosophy, politics, and lacrosse / @ucsbcs / @ucsbaisafety

marin & sb Katılım Aralık 2021

1.1K Takip Edilen125 Takipçiler

Sabitlenmiş Tweet

thomas rodskog@ThomasRodskog·12 Kas

UC Santa Barbara now has an AI safety org! We're looking to host guest speakers starting in January 2026. If you work in AI safety and want to give a talk or connect with motivated students, please reach out! 📧 contact@ucsbaisafety.org 💬 or DM me Help us spread the word! 🔁

AI Safety at UCSB@ucsbaisafety

Welcome to UCSB AI Safety Club! We're a welcoming community of UC Santa Barbara students discussing and researching the technical and political challenges of AI safety. Advanced AI systems will dramatically change the world. We'd like them to change it for the better.

English

3.6K

thomas rodskog@ThomasRodskog·11h

@ThePrimeagen This is like absolutely lined up with their safety values I don't understand your point

English

ThePrimeagen@ThePrimeagen·17h

I cannot stop thinking about Anthropic today for some reason 1. They claim that they are a company that prioritizes safety first and that they are creating a model responsibly 2. we learned from the code leak that anthropic employs deceptive techniques by calling fake tools to throw off distillers... Is this lying pattern built into claude or just the harness running claude? What else are they lying about? I am a bit more concerned now.

English

262

145

4.1K

269.3K

thomas rodskog@ThomasRodskog·2d

@Miles_Brundage This was funny x.com/trq212/status/…

Thariq@trq212

@grichadev source code gets leaked 😭

English

Miles Brundage@Miles_Brundage·2d

Interesting that Anthropic did not acknowledge the leak (?)

English

16.9K

thomas rodskog@ThomasRodskog·3d

@sailaunderscore Don’t see any issue with the first statement. You’re assuming saying sorry somehow implies that she wishes she didn’t have a boyfriend and this just a very poor interpretation imo

English

589

saila@sailaunderscore·3d

I’ve had things like this happen, where you hit on a girl or maybe even explicitly ask her out, and she either says “I’m sorry I have a boyfriend” or “you’re [attractive, great, hot], but I have a boyfriend.” I would really hate to be these girls’ boyfriend.

Linda Chen@linderps

a guy approached me at dolores park yesterday he was nice. respectful. sweet. i told him i had a bf. he said thank you and left but a few minutes later i couldn’t stop thinking about it how rare it is these days for a guy to have the courage to approach and risk rejection so i went back to find him i told him i really respected it, and encouraged him to keep trying he admitted that it's a bit nerve-wracking but he does it because he doesn’t believe in dating apps so i took his number and want to help him: he’s 24, tall, good looking, works in tech, very sociable, and is looking for a girlfriend dm me if you want to meet him or send this to someone who does :)

English

633

53.3K

thomas rodskog@ThomasRodskog·3d

@notadampaul @ctjlewis Please post proof of this with Pangram. Please

English

notadampaul@notadampaul·3d

@ctjlewis you can literally input text from old novels and it'll tell you it's 100% AI lol

English

275

Lewis 🇺🇸@ctjlewis·3d

When did we start trusting these things? Early on, everyone was able to accept that they were unreliable. What changed?

Pangram Labs@pangramlabs

@inerati @GenAI_is_real @mim_djo We are confident that this document is fully AI-generated pangram.com/history/19e214…

English

1.1K

71.1K

thomas rodskog@ThomasRodskog·4d

@Ollie_Base @EAForumPosts @ryan_kidd44 Aka you might have low p(doom) and put a high probability here anyway because of the second part. Assuming you don’t consider doom to include good futures

English

thomas rodskog@ThomasRodskog·4d

@Ollie_Base @EAForumPosts @ryan_kidd44 It includes worlds where ASI has immense control over the world but does not distinguish whether that outcome is good or bad for humans

English

EA Forum Posts@EAForumPosts·4d

"Survey of AI safety leaders on x-risk, AGI timelines, and resource allocation (Feb 2026)" by OllieRodriguez (@Ollie_Base)

English

1.5K

thomas rodskog@ThomasRodskog·4d

@jxmnop This is a totally fine response what more could you want? Were you hoping for an “Um actually ☝️🤓 that’s not possible.”??

English

597

dr. jack morris@jxmnop·4d

imagine reading this and still thinking we’re months away from automating all white collar work

English

135

255

3.8K

261.4K

thomas rodskog@ThomasRodskog·4d

Alignment between humans is a very real and important concept though? Aligning CEOs with shareholders, employees with employers, elected officials with the public, etc. This is a difficult (unsolved) problem on its own. It becomes increasingly difficult when the agent, in this case, an AI one, is significantly more powerful than the people who want to align it. Do you have a different understanding of the alignment problem?

English

icpolicy@icpolicy·4d

@austinc3301 @Duderichy AI alignment is about as real as alignment between two distinct humans. ie, it may occur, but only to a degree. It's as undesirable as constraining the individual thought and action of humans for all of the same reasons.

English

thomas rodskog@ThomasRodskog·5d

@rihim_s I’m guessing your benchmark is saturated before mine but maybe I’m washed

thomas rodskog@ThomasRodskog

i’ll admit AGI is here when a model can beat me in 1v1 hypixel bridge duels

English

rihim@rihim_s·5d

the day a general computer use model can reliably beat every geometry dash level is when we get AGI

English

thomas rodskog@ThomasRodskog·5d

@rihim_s Nope

English

rihim@rihim_s·6d

@ThomasRodskog have you even received any updates

English

rihim@rihim_s·6d

when am i supposed to receive my superbowl openai merch cause it still hasn’t shipped

English

thomas rodskog@ThomasRodskog·26 Mar

@justalexoki This is better than those groupchats

English

109

taoki@justalexoki·25 Mar

friends and friends of friends! welcome to taoki city! introduce yourself

English

753

75K

thomas rodskog@ThomasRodskog·24 Mar

@jdc @dwarkesh_sp No just the secret link from the background noise dueing that time segment. Was not that interesting and since Claude can do it seems like a pointless thing to include idk

English

jon cooper@jdc·24 Mar

@ThomasRodskog @dwarkesh_sp One-shotted this? huggingface.co/jane-street/do…

English

Dwarkesh Patel@dwarkesh_sp·20 Mar

There's more to my latest Jane Street mid-roll than meets the eye. Might be worth another listen... Timestamp 1:31:42

English

220

55.8K

thomas rodskog@ThomasRodskog·24 Mar

Did you not read the post he just replied to?? It was a question about “LLM's ability to genuinely reason or extrapolate its corpus in order to create new theories or findings,” so he provided evidence for that. Obviously math skills don’t make AI able to replace all human cognitive work by default.

English

Up@Up081007906243·24 Mar

@tenobrus @HowlingToad @GalvinAlmanza Guys who thinks being successful at a people job is an equivalent skill to being good at maths. Embarrassing

English

223

Emily Galvin-Almanza@GalvinAlmanza·24 Mar

I don't get how people are planning to sidestep the very basic problem that if you don't have junior hires right now, you won't have experienced people 5 or 10 years later.

CG@cgtwts

Anthropic CEO: “50% of all entry-level Lawyers, Consultants, and Finance Professionals will be completely wiped out within the next 1–5 years." grad students and junior hires are cooked.

English

912

3.6K

35.8K

2.4M

thomas rodskog@ThomasRodskog·24 Mar

@tenobrus @buccocapital I love college. I hope we have much more of it in 18 years, though I expect it will look very different from what we consider college today (focused on personal development rather than training for a career)

English

488

Tenobrus@tenobrus·24 Mar

@buccocapital dawg college is of extremely questionable value *today* there is literally zero chance it will exist as a relevant educational institution in 18 years

English

185

7.3K

BuccoCapital Bloke@buccocapital·24 Mar

Pretty shocked how confident people are they don’t need to save for their kid’s education….

BuccoCapital Bloke@buccocapital

@0xLuxi You won’t even be able to put two kids through college in 20 years with that

English

471

132.9K

thomas rodskog@ThomasRodskog·24 Mar

@deepfates @herbiebradley Surely not. Hence the protesters calling for a conditional pause (which will need an international treaty)

English

🎭@deepfates·24 Mar

@herbiebradley Surely China will take a time out to let U.S. think...

English

163

🎭@deepfates·23 Mar

ngl this framing just makes it cooler

English

thomas rodskog@ThomasRodskog·23 Mar

@chatgpt21 This protest had nothing to do with the economy to be clear

English

Chris@chatgpt21·22 Mar

I fear these protests will only get bigger and more violent at the inflection point in 2028 until these systems are able to bring wide spread economic benefits to everyday people Until we reach a full unison implementation of AGI the prequel will be rough. But beneficial to those who employ the technology

Michaël Trazzi@MichaelTrazzi

It's fucking happening

English

128

10.5K

thomas rodskog@ThomasRodskog·23 Mar

@Enscion25 No? They’re not protesting the Chinese labs? You get an international treaty with good verification to work with China. But the U.S. needs to be on-board first. Don’t know what you mean by compression.

English

Nek@Enscion25·23 Mar

@ThomasRodskog Do you think China cares about a handful of protestors in the West ? Compression will lead to a far more stable type of alignment, something that we couldn't achieve

English

Nek@Enscion25·22 Mar

Can't wait for Ai to replace them all I'd rather have a future with multiple instances of GPT 10, Claude, and Grok than to have decels like these

Michaël Trazzi@MichaelTrazzi

On our way to OpenAI!

English

477

12.3K

Keşfet

@ThePrimeagen @Miles_Brundage @sailaunderscore @notadampaul @ctjlewis @Ollie_Base @EAForumPosts @ryan_kidd44