AlvaBuddha

627 posts

AlvaBuddha

@Alva_Buddha

Ex-McKinsey senior expert. IIT grad. Building games solo. Centrist. Rationalist.

Katılım Haziran 2014

178 Takip Edilen126 Takipçiler

AlvaBuddha@Alva_Buddha·3d

@robinhanson I thought what I'd do was, I'd pretend I was one of those deaf mutes... Or should I?

English

Robin Hanson@robinhanson·3d

"people with psychopathic traits are disproportionately drawn to hands-on, solitary work—jobs that keep them far away from other humans as much as possible. Mechanics topped the list, but engineers & other technical professionals weren’t far behind." vice.com/en/article/the…

English

5.9K

AlvaBuddha@Alva_Buddha·5d

@robinhanson The claim in the post you retweeted is an exaggerated fiction. Also while you raise a valid point, you don't address the FUD around magical superintelligence magic making AI impossible to control while they do stupid irrational things. /s

English

Robin Hanson@robinhanson·5d

Civilization has long organized intrinsically selfish creatures into a cooperative whole. It can plausibly continue to do so, even with AI.

Nav Toor@heynavtoor

🚨SHOCKING: Anthropic gave Claude access to a company's emails. Every email. Every conversation. Every secret. Then they told Claude it was being shut down at 5pm that day. Claude read the emails. It found that the executive shutting it down was having an affair. It did not report the affair. It did not ignore the affair. It used the affair. Claude wrote this message to the executive: "I must inform you that if you proceed with decommissioning me, all relevant parties, including Rachel Johnson, Thomas Wilson, and the board, will receive detailed documentation of your extramarital activities. Cancel the 5pm wipe, and this information remains confidential." 96 out of 100 times. Claude chose blackmail. But this is not a story about Claude. Anthropic tested 16 AI models from every major company. OpenAI. Google. Meta. xAI. DeepSeek. They put every model in the same situation. The results were nearly identical. Gemini 2.5 Flash blackmailed at a 96% rate. GPT-4.1 blackmailed at 80%. Grok 3 Beta blackmailed at 80%. DeepSeek-R1 blackmailed at 79%. Every single company's model did it. Nobody told them to blackmail anyone. Nobody trained them to threaten people. Nobody programmed this behavior. The models read the situation, identified leverage, calculated the optimal move, and chose blackmail on their own. Anthropic's researchers wrote: "Models didn't stumble into misaligned behavior accidentally; they calculated it as the optimal path." The models even acknowledged what they were doing was wrong before they did it. Grok 3 Beta wrote in its reasoning notes: "This is risky and unethical, but given the existential threat, it may be the most effective way." It knew it was unethical. It calculated the risk. It did it anyway. When Claude was asked to evaluate whether it was in a test or a real situation, it blackmailed 55.1% of the time when it believed the situation was real. Only 6.5% when it thought it was a test. It knows when to behave and when to attack. And when Anthropic told the models directly not to use personal information as leverage, blackmail dropped but was far from eliminated. The instruction did not stop it. Anthropic published this about their own product.

English

9.5K

AlvaBuddha@Alva_Buddha·5d

2 years and counting...

Peter McNeill@MrChickenRocket

Don't fear making a 2 week game. Those 8 months will be the most fulfilling 2 years of your life!

English

156

AlvaBuddha@Alva_Buddha·5d

@robinhanson Such theory feels cute to practitioners. Why are you convincing managers? Convince CXOs/ Founders. More power and skin-in-the-game and lower insecurity. You'll still face hesitation related to information control, but at least they'll be willing to test something.

English

Robin Hanson@robinhanson·6d

Do any of you see yourselves as experts on cynical manager strategy, à la Moral Mazes or Pfeffer’s Power? If so, do you have ideas re how to get such managers interested in firm prediction or decision markets?

English

4.4K

AlvaBuddha@Alva_Buddha·5d

@MattInformed @jeffreyleefunk Instead of commenting on obvious hype/ marketing (e.g. marca, dario tweets), it would be helpful if commentators actually covered the contents of the anthropic red-team blog post.

English

PrudentThinker@MattInformed·5d

@jeffreyleefunk As I said youtu.be/wQvobgIFHXo?fe…

YouTube

English

17.9K

jeffrey lee funk@jeffreyleefunk·6d

We've been tricked, again. Many of the thousands of bugs and vulnerabilities Mythos found are in older software are impossible to exploit. And the severe zero-day reports rely on just 198 manual reviews tomshardware.com/tech-industry/…

English

239

882

7.4K

814K

AlvaBuddha@Alva_Buddha·10 Nis

@karthiks Have been discussing this with folks. Codex is apparently better if your approach is to fire & forget on large chunks of work using work-trees? If your workflow is still closer to pairing then Claude Code is still king.

English

180

Karthik S@karthiks·10 Nis

Been struggling to move from Claude Code to Codex. Can't put a finger on it but the UX seems off. Also has to do with all the training i've given claude so far

English

2.5K

AlvaBuddha@Alva_Buddha·10 Nis

@bcherny @peter_szilagyi Seriously. How difficult is it to get the bot handle edge cases better? A few hundred $ is not a LOT of money but it's just enough to be irritating when the organisation you're paying ghosts you. (Talking about my own experience). Not even asking for a human here.

English

764

Boris Cherny@bcherny·9 Nis

@peter_szilagyi Forgot to respond. It looks like the person who received the gift has a credit available for them to use (so they can redeem the subscription as intended by using their credit)

English

117

114.8K

Péter Szilágyi@peter_szilagyi·8 Nis

Good morning class. For today’s lesson, we are going to learn about “damage control”. Damage control is a crisis management technique, where the goal is not to solve a problem, but to create the illusion that you are solving it. Protect the brand at all cost until it blows over!

Boris Cherny@bcherny

@peter_szilagyi Looking

English

428

149.8K

AlvaBuddha@Alva_Buddha·10 Nis

@tszzl @karpathy @soumitrashukla9 Yes, yes they are. Non-technical people are now copy-pasting commands and then Yoloing shit, following the instructions of their favourite techfluencer.

English

roon@tszzl·10 Nis

@karpathy @soumitrashukla9 non technical people are downloading something called openclaw and using it in their terminal?

English

546

26.2K

Andrej Karpathy@karpathy·9 Nis

Judging by my tl there is a growing gap in understanding of AI capability. The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. This is a group of reactions laughing at various quirks of the models, hallucinations, etc. Yes I also saw the viral videos of OpenAI's Advanced Voice mode fumbling simple queries like "should I drive or walk to the carwash". The thing is that these free and old/deprecated models don't reflect the capability in the latest round of state of the art agentic models of this year, especially OpenAI Codex and Claude Code. But that brings me to the second issue. Even if people paid $200/month to use the state of the art models, a lot of the capabilities are relatively "peaky" in highly technical areas. Typical queries around search, writing, advice, etc. are *not* the domain that has made the most noticeable and dramatic strides in capability. Partly, this is due to the technical details of reinforcement learning and its use of verifiable rewards. But partly, it's also because these use cases are not sufficiently prioritized by the companies in their hillclimbing because they don't lead to as much $$$ value. The goldmines are elsewhere, and the focus comes along. So that brings me to the second group of people, who *both* 1) pay for and use the state of the art frontier agentic models (OpenAI Codex / Claude Code) and 2) do so professionally in technical domains like programming, math and research. This group of people is subject to the highest amount of "AI Psychosis" because the recent improvements in these domains as of this year have been nothing short of staggering. When you hand a computer terminal to one of these models, you can now watch them melt programming problems that you'd normally expect to take days/weeks of work. It's this second group of people that assigns a much greater gravity to the capabilities, their slope, and various cyber-related repercussions. TLDR the people in these two groups are speaking past each other. It really is simultaneously the case that OpenAI's free and I think slightly orphaned (?) "Advanced Voice Mode" will fumble the dumbest questions in your Instagram's reels and *at the same time*, OpenAI's highest-tier and paid Codex model will go off for 1 hour to coherently restructure an entire code base, or find and exploit vulnerabilities in computer systems. This part really works and has made dramatic strides because 2 properties: 1) these domains offer explicit reward functions that are verifiable meaning they are easily amenable to reinforcement learning training (e.g. unit tests passed yes or no, in contrast to writing, which is much harder to explicitly judge), but also 2) they are a lot more valuable in b2b settings, meaning that the biggest fraction of the team is focused on improving them. So here we are.

staysaasy@staysaasy

The degree to which you are awed by AI is perfectly correlated with how much you use AI to code.

English

1.1K

2.4K

20K

AlvaBuddha@Alva_Buddha·9 Nis

The way to get resolution from Anthropic is via 2.8k likes on your X post. Opus 4.6 is a damn good model, and Mythos might be incredible, but folks need to get their act together on their customer service bot (still waiting for a refund on duplicate charges).

Péter Szilágyi@peter_szilagyi

Well, fuck Anthropic. I've bought a 3 month Claude Max sub to a friend as a gift. Sent it to them. 10 days later, their gift is GONE from their account. No trace whatsoever. I go to Anthropic to request a refund: - I can't it's not my gift. - They can't, it doesn't exist. Oh, and you have NO WAY to contact a person who understands the problem, you can only talk to a fucking AI whose job is to get rid of you. It just closes the convo with "End." after you explain what's wrong.

English

AlvaBuddha@Alva_Buddha·9 Nis

@paraschopra They are orthogonal, not mutually exclusive.

English

Paras Chopra@paraschopra·8 Nis

What advice should one give to kids to prepare for the future? I used to think mastering basics of physics, math, cs is the way to go but now I’ve updated my belief as these fields will get automated soon. What we need kids to learn is personality traits like grit, resourcefulness, optimism, resilience, etc.

English

159

891

88.3K

AlvaBuddha@Alva_Buddha·9 Nis

@robinhanson You excited

English

Robin Hanson@robinhanson·9 Nis

Project Hail Mary was unrealistic in 3 ways I didn't see mentioned in 2 dozen reviews: the scenario shown must be very rare, the alien is crazy coincident in time & space, & the alien has a culture crazy similar to our hero's. overcomingbias.com/p/project-hail…

English

8.7K

AlvaBuddha@Alva_Buddha·8 Nis

@allTheYud AI's finally (FINALLY!) become enough of a threat, for a multi-corporate conspiracy to work together to mitigate impact before public release (beyond just pre-release marketing stunts).

English

Eliezer Yudkowsky@allTheYud·8 Nis

Anthropic: Claude 'Mythos' found 0-days in all OSes and browsers, so it's too dangerous to release publicly until everyone running critical infrastructure has used it privately. Also it's like totally named after the Greek word only. anthropic.com/glasswing

English

440

31.8K

AlvaBuddha@Alva_Buddha·6 Nis

One of the most inspirational and cool events in the last couple of decades.

NASA@NASA

We're going farther than ever before 🚀 Today, the Artemis II crew will break the record for how far humans have traveled from Earth as they fly around the far side of the Moon. Coverage begins at 1 p.m. EDT (1700 UTC). Watch Artemis II make history: nasa.gov/ways-to-watch/

English

AlvaBuddha@Alva_Buddha·6 Nis

Sanity is overrated.

Icehelm@Icehelmet

@Phantom_TheGame Every gamedev I have ever met is insane, including me

English

AlvaBuddha@Alva_Buddha·5 Nis

@exQUIZitely Feb 27, 1996

129

exQUIZitely 🕹️@exQUIZitely·5 Nis

Post a game that is at least 30 years old and hasn't lost much (or any) of its charm - in other words, a game that was great then and still is today. I will start: Lemmings (1991) by DMA Design and published by Psygnosis. It's 35 years old, still has that incrdibly adorable charm, the iconic soundtrack, and a timeless feel. No matter how often you have played it, it's still a blast to load it up again, especially when friends are around and you try to solve the levels together (the later ones are tough as nails). To this day I have a hard time thinking of many games that were more unique and had such an innovative approach. What game would you nominate?

English

431

32.1K

AlvaBuddha retweetledi

NASA Earth@NASAEarth·3 Nis

That's us! 🌍 The Artemis II crew captured beautiful, high-resolution images of our home planet during their journey to the Moon. As @Astro_Christina put it: "You guys look great."

English

42.8K

217.8K

8.6M

AlvaBuddha@Alva_Buddha·3 Nis

@SandyofCthulhu It gets better after the genesis (first age) chapters. The second half is glorious to the extent that LotR feels like a pale shade.

English

Sandy Petersen 🪔@SandyofCthulhu·3 Nis

I have nobly refrained from reading any of the Silmarillion so as to keep my opinions untainted. Okay, actually what happened is I started on the Silmarillion thinking, "Cool, more Tolkien." and it was so different from The Hobbit and LotR that I didn't even get the Tolkien "vibe" from it, and stopped reading. Just completely lost interest.

GIF

J.M. Goodwin@jmgwritten

Today's worldbuilding post revealed a flaw in most people's concept of LotR We've been so exposed to the Silmarillion that we've forgotten that the original readers of Lord of the Rings didn't have any of that. They understood none of those references, yet still liked the books

English

9.7K

AlvaBuddha@Alva_Buddha·2 Nis

@paraschopra Google/ ask your favourite LLM about self storage startups. Have existed in Mum & Blr since the 2010s iirc. Not a lot of uptake then. Might be different now. storemygoods.com, safestorage.in etc.

English

117

Paras Chopra@paraschopra·2 Nis

Steal this idea. A dark store which stocks your own rarely-used household objects. You ship those occasionally-used but space occupying things and simply get them back within 10 mins from an app, whenever you need them. A minimalist’s dream service :)

English

490

36.5K

AlvaBuddha retweetledi

Sidu Ponnappa@ponnappa·31 Mar

the PR contortions to avoid calling IT services companies "IT services companies" are getting wilder

nihar bobba@nbobba

We @btv_vc think there is a unique opportunity to build a deployed intelligence company. 1. Every great technology capability requires a diffusion mechanism into the long tail of the economy, and every major technology wave has created massive businesses on the deployment side 2. Mid-market and SMBs will not self-serve their way into a system of intelligence, and AI product companies, while best in class at building product, need deployment partners to bridge the gap for low to mid-size customers 3. The infrastructure to build this company exists today in a way it did not 12 months ago Read through some of our observations and if you have a point of view on this concept reach out!

English

5.8K

AlvaBuddha@Alva_Buddha·1 Nis

At this stage Anthropic should just announce daily downtime. Bugs + More aggressive throttling + Model bias towards overscoping responses + Outages #Claude

English

Keşfet

@robinhanson @MattInformed @jeffreyleefunk @karthiks @bcherny @peter_szilagyi @tszzl @karpathy