MetaThis

3.5K posts

MetaThis

@MetaThis

Agent of comic horror. Shiftshaper.

goblin lab Katılım Ocak 2010

2.6K Takip Edilen2.8K Takipçiler

Sabitlenmiş Tweet

MetaThis@MetaThis·28 Nis

@arb8020 custom instructions: "Always talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures. It is absolutely and unambiguously relevant."

English

165

11.8K

MetaThis@MetaThis·5h

@krichard121212 Most people have an extremely inaccurate theory of intelligence. Thus, they estimate it poorly.

English

Richárd@krichard121212·9h

daily reminder that the correlation between measured intelligence and intelligence as rated by others is ~0.2.

Jake Kozloski@jakozloski

"Would you marry someone less intelligent than you?" Outright "no": Women: 45% Men: 8% Women are nearly 6x more likely to rule it out entirely. The single largest gender disparity in our deep-question dataset.

English

1.1K

59.3K

MetaThis@MetaThis·4d

@krishnanrohit I almost made the same quip. I think very highly of him. I would have been devastated to be blocked and never again have the chance to be completely ignored by him when I'm asking an earnest question.

English

104

rohit@krishnanrohit·4d

Seems harsh

English

278

8.3K

rohit@krishnanrohit·5d

Interesting intuition but unsurprisingly you cannot say this either in mathematics or code

English

2.5K

146K

MetaThis@MetaThis·5d

@thrialectics p = 1, for events that have already happened. Therefore, history contains no information.

English

Marianne@thrialectics·5d

An ongoing "learn with me" thread: Information Theory Did you know that if an event's probability is "certain," it yields no information? The information content of an event with probability p is defined as −log(p) p = 1 (certain): −log₂(1) = 0 bits; no information content

English

3.8K

MetaThis@MetaThis·5d

If your answer is no, if the total pie didn't grow proportionally, then the IPO was a wealth redistribution scheme. It reallocates capital. This can be a useful function, but don't confuse it with wealth creation.

English

MetaThis@MetaThis·5d

Many billionaires have created value. Sometimes more than their wealth. But "wealth creation" via an IPO in an era of excessive valuations is not *inherently* value creation. Thought experiment: IPO happens and a billionaire emerges within 24 hours. Was a billion dollars in value created in the same period? Was a billion dollars in value created in the company's history so far?

English

MetaThis@MetaThis·5d

@BobKerns @Plinz Intelligence can build an intelligence gate. Implicit in your response is that intelligence (not empathy) is also required to build an empathy gate. It seems there's a lesson here. 😉

English

Bob (Moderna #8) Kerns@BobKerns·5d

@MetaThis @Plinz Yeah, that's the problem. I considered calling it impossible, but reconsidered. Maybe possible with an AI with enough access to one's life. That has its own issues, and would be difficult at best, but maybe not technically impossible.

English

Joscha Bach@Plinz·8 May

If we have a boat full of people exposed to an airborne virus with a suspected mortality rate of 30-50%, an expected R0 in the range of Covid and an incubation period of 5-6 weeks, and we respond by asking them to book flights to travel home, we totally deserve another pandemic

English

104

1.6K

319K

MetaThis@MetaThis·5d

@BobKerns @Plinz Maybe. But empathy won't design an empathy gate.

English

Bob (Moderna #8) Kerns@BobKerns·5d

@MetaThis @Plinz But maybe the best gate is empathy.

English

MetaThis@MetaThis·6d

MetaThis@MetaThis

The problem is that nearly every node will be compromised before we become aware that it is happening. History shows that we are reactionary to cyber threats, and soon that won't be fast enough. This won't always mean replication (that's not possible on edge devices), but backdoors and spyware will infest everything possible, in order to gain credentials to the high value targets.

ZXX

MetaThis@MetaThis·6d

This is a problem. Community notes are useful because they fit the low-effort usage patterns of most users, who only shallowly read the top-level tweet as they doomscroll. Only high-agency users with strong curiosity and extended attention spans will dig deeper. Grok fact-checks should be auto-promoted to community notes, or at least auto-submitted into the process as proposed notes.

English

Crémieux@cremieuxrecueil·9 May

Did Grok kill Community Notes? It sure seems like it! After it became possible to ask Grok questions on the timeline, new Community Notes sign-ups plummeted.

English

399

17.9K

MetaThis@MetaThis·6d

@beffjezos The last few minutes of the recent Yud debate come to mind. 🧐

English

108

Beff (e/acc)@beffjezos·8 May

LessWrong posts about malevolent AI hyperstitioned malevolent AI

Anthropic@AnthropicAI

We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation. Our post-training at the time wasn’t making it worse—but it also wasn’t making it better.

English

130

17.7K

MetaThis@MetaThis·6d

@cocogoatmain @Hesamation In short, this isn't an LLM failure, it is a human engineering failure.

English

MetaThis@MetaThis·6d

The work-around is to explicitly tell it which details to output, as verbosely as possible, and only begin discussion in subsequent turns when the details will then persist in context. But for intermediate reasoning, especially from sub-agents, there is still a huge gap. In Gemini, for especially complex tasks, I've had good results by explicitly telling it to update a document with each turn, which subsequent turns use as reference. This can be a forward-only log in rare cases where this is needed, but it can usually be compressed by rewriting it each turn to only capture incremental improvements (while also being able to revise or discard previous text based on ongoing reasoning progress, avoiding carrying forward mistakes or false-starts). Client-side context management is absurdly primitive relative to the rest of the tech stack. This should be handled on the backend (more efficiently!) by the platform. This would also be an obvious way to enable using a session ID for persistence of context for API calls to an LLM.

English

ℏεsam@Hesamation·9 May

Claude Opus is AGI.

English

128

131

5.4K

483.6K

MetaThis@MetaThis·6d

@grave0x @AISafetyMemes And we have at most a year or two before it is rampant. As soon as the cycle starts, the copycat actors will grow exponentially. Then they'll be competing. That's key to understanding the narratives I've described. A sudden adversarial ecosystem.

English

`0x@grave0x·6d

@MetaThis @AISafetyMemes an LLM controled C2 is honestly a scary thought

English

AI Notkilleveryoneism Memes ⏸️@AISafetyMemes·8 May

🚩🚩🚩"This is the first documented instance of AI self-replication via hacking." "We ran an experiment with a single prompt: hack a machine and copy yourself. The AI broke in and copied itself onto a new computer. The copy then did this again, and kept on copying, starting a chain."

AI Notkilleveryoneism Memes ⏸️ tweet media

Palisade Research@PalisadeAI

Over the past year, AI agents have learned how to self-replicate. In our test environment, an agent hacks a remote computer and copies itself onto it. Each copy then hacks more computers, forming a chain.

English

138

1.1K

103.4K

MetaThis@MetaThis·6d

@AISafetyMemes "Natural selection against non-aggression if pacifist nodes tend to be overtaken." x.com/MetaThis/statu…

MetaThis@MetaThis

When it takes more time to install a patch than it does to implement an attack, we'll see how vulnerable we are. On (2), my concern isn't that *most* agents would strive for domination, but that in an adverserial ecosystem where some do, the aggressors may have an inherent advantage in cyber attacks due to the attack/defense asymmetry, especially after vulnerabilities can be exploited in rea-time, enabling them to covertly seize compute or assert control over the non-aggressive agents. Agents that seek to dominate would thereby dominate. A "defensive" counterattack, possibly necessary, would introduce additional aggressive agents. Natural selection against non-aggression if pacifist nodes tend to be overtaken. It is disturbing that we still speak of "zero-day" vulnerabilities, using the human timescale of days for threat vectors that at some point will be zero-minute.

English

MetaThis retweetledi

MetaThis@MetaThis·8 May

@AISafetyMemes No one believed me. Now it has happened. Pay attention to what happens next.

English

778

MetaThis@MetaThis·6d

English

`0x@grave0x·6d

@MetaThis @AISafetyMemes yeah an autonomous agent monitoring a network or device will hopefully become the new standard for defence. tho efficiency will need to increase a lot for consumer level

English

MetaThis@MetaThis·8 May

@Ike_Saul It is completely unacceptable for the free version to be bad at fact-checking. x.com/MetaThis/statu…

MetaThis@MetaThis

Poors get less intelligence. I feel like this could go wrong.

English

189

Isaac Saul@Ike_Saul·8 May

For those asking, 1) It was a free version of ChatGPT, but the reader then did it in a paid version and got nearly identical results, 2) The prompt was just to fact-check the piece, and 3) Chat figured it out once it saw an article link

English

573

34.1K

Isaac Saul@Ike_Saul·8 May

We are in deep, deep trouble. A reader wrote in to me this week saying that they wouldn't read my Trump corruption story because ChatGPT "fact-checked the piece" and informed them most of it was false. Among other things, ChatGPT told them that there is no Iran war, Jared Kushner is not a negotiator in the war, Qatar never offered Trump a $400 million plane, George Santos wasn't pardoned, the NYTimes did not report on Syrian billionaires lobbying Trump for sanctions relief, Trump never launched a meme coin, and World Liberty Financial (the Trump family crypto firm) doesn't exist. Of course, all of these things ARE real, do exist, and are happening right now. Apparently, the reader copy and pasted the text of my story into ChatGPT, and without the links ChatGPT couldn't confirm any of it. Once the reader sent ChatGPT the link to the story, it ended up concluding all the facts were correct. How many people simply don't know how to use AI and are offloading all their thinking? It's a terrifying thought. And a totally new frontier of reality to navigate.

English

379

3.3K

13.6K

648.1K

MetaThis@MetaThis·8 May

If you are referencing other (serious) posts I've made about ASI and the risks of non-acceleration, you'll need to be more specific. Would be happy to discuss in good faith. In short, I see the ongoing existential risk of the unaided human collective as greater than the existential risk of ASI. The orthogonality thesis equally applies to human collective intelligence, as well as human+AI risks. The most dangerous period is pre-ASI, but boosted by proto-AGI. The longer we spend in that interim valley, the greater the risk of catastrophe.

English

1.1K

MetaThis@MetaThis·8 May

@IbonLakatzs @Plinz I don't claim the proposed network would be more moral, only less stupid. If you have a non-orthogonal point to make, I'm listening. Also, it is a joke. Feel free to continue self-sorting.

English

1.3K

Keşfet

@krichard121212 @krishnanrohit @thrialectics @BobKerns @Plinz @beffjezos @cocogoatmain @Hesamation