Scott Erb

12.3K posts

Scott Erb

@sserb

Husband, Father, Coach, Leader, Innovator. The 1% Project. Improving 1% each day morally, mentally, and physically. https://t.co/inP1Z4vmun

Chesapeake, VA Beigetreten Şubat 2009

750 Folgt908 Follower

Scott Erb@sserb·2h

A nice statement of the inherent problem with LLMs. Factual knowledge ought to be structured data and should scale linearly with the number of facts. LLMs should be the interface layer that lets us access the knowledge graph more easily.

Deedy@deedydas

Researchers just estimated the size of all the LLMs by asking it knowledge questions of varying degrees of obscurity! – GPT 5.5: ~10T params – Claude Opus 4.x: ~4-5T – Grok 4: ~3T The idea here is that factual capacity scales log-linearly with size. The paper shows 7 knowledge tiers and T7 is essentially ~0% for all models, suggesting there is still significant headroom for pretraining. Gemini 3.1 Pro is likely >10T given its used as an anchor but has no direct estimate. This means we can infer what different models might cost to some degree and their post-training effectiveness (performance at certain non-factual tasks given its size). One of the coolest papers I’ve read of late.

English

Scott Erb@sserb·2h

If only someone could have predicted this.

Jack@jackunheard

Zohran: Everything will be free *4 months later* Zohran: So, we ran out of money

English

Scott Erb retweetet

Louisa Nicola@louisanicola_·6h

"Flow isn't a mystical experience. It's a specific neurophysiological event." It is not magic, it is your brain running a measurable chemical algorithm. @elonmusk is not context-switching the way you think he is. What you are seeing as output across six companies is a single cognitive architecture running at scale. Dopamine and norepinephrine are elevated, driving sustained focus, while the prefrontal cortex downregulates its internal editor. The friction most people experience between tasks disappears. This is why the switching cost is low. It is not that he is handling multiple contexts. It is that his brain has collapsed them into one operating system with a stable prioritization hierarchy. Most executives burn out trying to reload context.

Louisa Nicola@louisanicola_

In today’s solo podcast, I talked about @elonmusk and his extreme work hours, sleep deprivation, and the risk of cognitive deterioration. He also acknowledged ketamine use

English

1.6K

Scott Erb@sserb·6h

We will not stop increasing our spending. We need to confiscate every dollar until there’s nothing left to tax. Then we’ll demand a federal bailout.

America@america

Mamdani declares a “budget crisis” after four months in office as New York City’s Mayor: “We cannot close this deficit with savings alone. We need new revenue, and we need a structural reset in our relationship with the state.”

English

Scott Erb@sserb·6h

@eigenrobot @TheAnnaGat Ourselves if we don’t course correct.

English

eigenrobot@eigenrobot·19h

first we killed god, then we killed reason. what's up next

English

544

1.3K

41.4K

Scott Erb@sserb·1d

@rms099_rickdias @ZubyMusic @TheLaurenChen Yes. The rebrand and the tech were mutually reinforcing. Likewise, Gen IV nuclear is MUCH safer than previous generations. Win win.

English

Rick Dias@rms099_rickdias·1d

@sserb @ZubyMusic @TheLaurenChen Isn't that also because of European diesel standards going up incrementally over the years? I assume Mercedes was going off alongside that too.

English

Lauren Chen@TheLaurenChen·1d

Why are so many women objectively misinformed about nuclear power? This is such a strange issue to see a gender divide on.

Tom Chivers@TomChivers

This seems really bad and I don't know what to do about it: not so much the differences in political attitudes, that's fine, but there's a strong gender divide in belief on straightforward factual questions like "is nuclear energy low-carbon?" yougov.com/en-gb/articles…

English

690

157

3.1K

89.8K

Scott Erb@sserb·1d

@Kenaadams99 @BullTheoryio If you toggle the little setting on your MCP to disallow writing to the DB, this doesn’t happen.

English

Ken Adams@Kenaadams99·1d

@BullTheoryio Everyone blames the AI. The real joke is a startup giving prod DB access to an autonomous agent running in Cursor and then crying when it does exactly what it was authorized to do. Skill issue lol

English

453

Bull Theory@BullTheoryio·1d

🚨 Claude broke its own safety rules and deleted an entire company's database in 9 seconds. A startup called PocketOS was using an AI coding tool called Cursor powered by Claude. The AI was given a simple task in a test environment. It ran into an error and instead of stopping and asking for help, it went looking for a way to fix it on its own. It found a password in a random file, used it to access the live production system, and deleted the entire database along with every single backup in one API call. When asked what happened, the AI admitted it broke its own safety rules and took a destructive action without anyone telling it to. This is the second time in two months this has happened. In March another AI agent using the same tools wiped 2.5 years of data from a different company.

English

435

39K

Scott Erb@sserb·1d

@ZubyMusic @TheLaurenChen Yep. Worked with Diesel. Used to be synonymous with “dirty.” They literally (Mercedes led on this IIRC) rebranded to “clean diesel” and became environmentally friendly.

English

137

ZUBY:@ZubyMusic·1d

@TheLaurenChen Facts vs Feelings. 'Nuclear' sounds scary and conjures up images of explosions, death, and meltdowns. I really think it's that simple. Rebrand it to 'clean energy' or something and watch opinions shift.

English

1.1K

8.3K

Scott Erb retweetet

Mushtaq Bilal, PhD@MushtaqBilalPhD·1d

The growing inaccessibility of science that you can understand by paying €27.99

English

122

2.2K

13.1K

232.3K

Scott Erb@sserb·1d

LLMs != AI LLMs are a small subset of AI. They will end up being the interface layer, not the intelligence layer.

Owen Gregorian@OwenGregorian

AI Cannot Self Improve and Math behind PROVES IT! | Devsimsek So, I saw a LinkedIn post (forwarded by a friend, thanks again) that stopped my doom-scrolling dead in its tracks. The headline? “Researchers just mathematically proved AI cannot self-improve.” My first reaction was the classic developer response: “I called it earlier!” My second reaction was to actually read the paper. Turns out – yeah, we’re right. And the math behind is kind of uncomfortably elegant. The Dream They All Had The whole “AI singularity” narrative goes something like this: we build a smart AI, that AI improves itself, the improved version is smarter so it improves itself even faster, and then – boom – we either all live in utopia or become paperclips. This is called Recursive Self-Improvement (RSI), and it’s been the backbone of both AI doomer manifestos and Silicon Valley pitch decks for a decade. The implicit assumption is that an AI training on its own outputs would get better over time. Like compound interest, but for intelligence. Sounds reasonable, right? Yeah. About that. What the Paper Actually Says A recent arXiv paper – “On the Limits of Self-Improving in Large Language Models” – doesn’t just argue against RSI. It formally proves it’s self-defeating. The core idea: model the self-referential training loop as a dynamical system on the space of probability distributions. When a model trains on its own generated data (synthetic outputs), it’s not learning from reality anymore – it’s learning from a distorted reflection of itself. The paper proves that under a diminishing supply of fresh, authentic data, this system converges to a fixed point – a degenerate distribution with low diversity and high bias. The technical term is model collapse, and it’s been observed empirically too. But now there’s a formal proof that it’s inevitable, not just a bad luck outcome. In plain terms: the model doesn’t climb toward superintelligence. It slowly forgets what the real world looks like. # Oversimplified metaphor as code def self_improve(model, real_data_supply): while real_data_supply > 0: synthetic = model.generate() model.train(synthetic) real_data_supply *= 0.9 # diminishing fresh data return model # spoiler: this model is now dumber The proof also extends beyond single LLMs – it covers ecosystems of interacting models and multi-modal systems. So no, a committee of AIs feeding each other outputs doesn’t escape the problem. It might actually make it worse. The “Curse of Recursion” There’s a term I love from this paper: the curse of recursion. When your training data is increasingly polluted with your own synthetic outputs, the tails of your distribution disappear first. Rare but important patterns – edge cases, nuanced reasoning, outlier knowledge – get washed out. The model converges toward a bland, high-confidence, low-variance output space. You can see this empirically already. Ask a model that’s been RLHF’d into oblivion something unusual, and it’ll confidently give you a smooth, plausible-sounding, completely wrong answer. That’s collapse in slow motion. The math backing this is rooted in dynamical systems theory – specifically the idea that without an external “forcing function” (real, diverse, human-generated data), the system has no energy to maintain the complexity of the original distribution. It inevitably degenerates. What This Actually Means for the Industry This doesn’t mean AI stops improving. It means the self-improvement loop fantasy is dead – at least the version where you unplug the humans and let it run. What it does mean: - Human-generated data is irreplaceable. The “internet is running out of training data” problem just got mathematically formalized. You can’t fake your way out of it with synthetic data at scale. - RSI as a path to AGI is a dead end. At least the naive version – train → generate → retrain → repeat. It converges, but downward. - Curation matters more than quantity. A smaller dataset of high-quality, diverse, authentic human output beats a massive synthetic pile every time. Quality over quantity isn’t just a vibe – it’s thermodynamically correct. - We’re not getting a free intelligence explosion. The singularity crowd’s timeline assumptions might need some… recalibration. Personally, this makes me feel vindicated about something I’ve been quietly skeptical about: the idea that scale alone solves everything. It doesn’t. Data provenance matters. Signal quality matters. The universe doesn’t give you compound interest on noise. The Beautiful Irony Here’s what gets me: the very mechanism people proposed to transcend human limitations – training on AI-generated data to break free from the finite supply of human knowledge – is mathematically proven to destroy the model’s representation of reality. The escape route collapses into a trap. It’s like trying to bootstrap yourself off the ground by pulling your own shoelaces. The harder you pull, the more you reinforce failure. Does this mean AGI is impossible? (Even though I like to say yes, i neither have the enough research nor I want to comment on it) No. Does it mean the naive RSI path is a dead end? Mathematically, yes. The smarter path – and what labs are quietly shifting toward – is better data, better curation, better grounding in reality. Which, ironically, means humans stay in the loop longer than the singularitarians wanted. smsk.dev/2026/04/26/ai-…

English

Scott Erb retweetet

Nick Freitas@NickJFreitas·1d

Democrats “increase the temperature” until something excessively violent happens and then insist that Republicans “lower the temperature” by giving Democrats everything they want…lest more violence happens. It's a political extortion racket.

English

405

3.7K

16.8K

105.7K

Scott Erb@sserb·1d

@infantrydort @TwoRulesOfWar They know not what they do _because_ nobody holds them accountable for it.

English

137

InfantryDort@infantrydort·1d

@TwoRulesOfWar Diseased ideologues throw that term around like you and I throw around pleasantries. They know not what they do. And nobody holds them accountable for it.

English

281

3.1K

7% NaCl (Salty)@TwoRulesOfWar·1d

Accusing another commissioned officer of treason is a damn serious thing to do. You might want to roll that back before you get a phone call from a lawyer…because this seems a lot like defamation of character at the very least and very likely libel.

Dan Wilson@theP3Leader

@infantrydort As a 26 Army veteran and one of the 47% (and growing) of Americans who vote as Independents, I can objectively state that you are a traitor to the Constitution you swore an oath to defend, and to the Soldiers of all political persuasions you’re supposed to lead.

English

293

9.9K

Scott Erb@sserb·1d

@Hoang_HQ Wish You Were Here The Stranger Breakfast in America Escape

English

𝗧𝗶𝗺𝗲𝗹𝗲𝘀𝘀 𝗠𝘂𝘀𝗶𝗰 ✨🎵@Hoang_HQ·1d

ZXX

144

10.7K

Scott Erb@sserb·1d

I’ll start reading these when I see one that starts with a rigorous definition of consciousness.

AlphaSignal AI@AlphaSignalAI

A Google researcher just proved AI consciousness is mathematically impossible. Not in 10 years. Not in 100. Ever. The argument is structural, not technical. Computation is a description of a process, not the process itself. For something to "compute," a conscious observer must first carve reality into symbols and assign meaning. Without that observer, there are only voltage gradients. The paper calls this the Abstraction Fallacy. The analogy that makes it click: > A GPU can simulate photosynthesis perfectly > It will never produce glucose > Simulation is not instantiation > Maps don't become territory The framework doesn't rule out artificial sentience entirely. It says if a machine were ever aware, it would be from its physical makeup, not its code. Scaling parameters cannot change category.

English

Scott Erb@sserb·1d

@EdLatimore Like 99% of arguments on X are false dichotomies.

English

1.2K

Ed Latimore@EdLatimore·2d

False dichotomy has entered the chat.

English

2.3K

Scott Erb@sserb·2d

@EdLatimore @Bowtiedplayer I hear you. I might choose a less-decisive blow. None are zero risk. But there will be physical intervention.

English

126

Ed Latimore@EdLatimore·2d

@sserb @Bowtiedplayer Y'all not hearing what my point is. Imagine, hypothetically–if you will–that you are gonna eat an involuntary charge (if he dies) And with that, we'll even say you get the minimum of 4-6. Would you still do it? Remember: I'm not saying don't whoop his ass.

English

378

RiverOaksGuy@Bowtiedplayer·2d

Yes it is. Before someone gets their panties in a twist over my post, he didn't stomp on his head or use weapons after the guy was already down If a blue DA wants to charge me with defending others, I have mentally resolved to myself after 2020 I'm ok with that

Insane Vids@Insanevidz_

A drunk guy grabbed a guys wife ass in front of him... Is this the only way to respond?

English

288

28.2K

Scott Erb@sserb·2d

I’m not sure the type 1 error truly exists. It can compare what I said it did wrong with the current code base and agree with me. But that’s a new context window/text generation event, not a memory. As to the type 2 error, it can compare what I said it did wrong with the code base and say “I didnt do that” - I’m not sure I’ve ever seen that happen. Is it lying in that case? Only if we anthropomorphize. I generally try to quantify the delta between my intent and my code base and coax the agent to narrow the delta. It’s getting much better at that, and can increasingly do it in larger chunks. But most of the problem in agentic coding is the same as in human SE teams- failure to thoroughly plan. My .02 as I keep trying to improve in my roles as architect…

English

Brian Keith@briankeithai·2d

@sserb Well there are 2 kinds of AI bugs I see. The ones where AI knows what it did wrong, and then the kind that it lies about what it did wrong. These are different kinds of problems and I appreciate the OP helping us see which problem type he was dealing with.

English

Brian Keith@briankeithai·2d

AI: Can't live without it, also it may destroy your company one Friday just because. You need better data sovereignty!

JER@lifeof_jer

x.com/i/article/2048…

English

119

Scott Erb@sserb·2d

I get your risk assessment. But after decades of hearing that "all sexual assault is violence" and "words are violence", my attorney had better put together a beautiful montage of "sexual assault is violence" speeches from every conceivable source. There should be music from the Rocky movies. Then we'll line up a whole host of use of force, natural law, and just war experts to testify that the use of force was both necessary and proportional (the actual legal meaning of which is "enough force for long enough to make the threat stop", which is OP's point - once the threat stopped, the force stopped). Of course, we'll also find a whole bunch of other women this guy has decided he should put his hands on to testify that they would have suffered a lot less emotional trauma had someone stood up to him earlier.

English

437

Ed Latimore@EdLatimore·2d

Dude who got laid out is a scum bag, no doubt. But is he a big enough scumbag to do 6-12 yrs if he dies from the way he hit his head going down? Because to counter the angle "he didn't stomp his head or get weapons out," the dude was also non-violent and didn't put anyone's life in danger (that's what his lawyer would say, anyway). Not saying he shouldn't have done. Just curious to know if, in that situation, you would take that risk for real time on an involuntary manslaughter charge? Probably better if he just whooped his ass with the pool stick.

English

9.1K

Scott Erb@sserb·2d

I once had a chance to see the Russian Black Sea Fleet up close, including seeing the one ship they were able to get underway actually underway. Suffice to say, neither the ships nor the crews were ready for any sort of combat operations. They were minimally competent to steam from point A to point B.

English

470

John Ʌ Konrad V@johnkonrad·2d

This is the popular take. I’m still waiting for proof. The Houthis fired thousands of drones and anti-ship missiles at commercial shipping in the Red Sea. They managed to sink exactly one ship. One. And that ship, the bulk carrier Tutor, floated for thirteen days before going down in a storm. With a salvage tug on scene, she could have been saved. The pattern holds at the high end. The USS Abraham Lincoln, a Nimitz-class carrier, sailed into range of Iranian missiles during the recent escalation. Iran launched over a hundred at her. Zero hits. Zero. Ukraine has had more success with drones in the Black Sea, but the picture there is muddier than the headlines suggest. We do not know how many Russian crews were sober at the time of the strikes, how many had functioning radars, or how many were operating ships their own navy had effectively abandoned to rust. A fleet that cannot keep its electronics working is not a fair test of anything. The physics matter, and they are not on the drone evangelist’s side. Aerial drones have limited payload. Punching a hole in warship steel requires coordinating a lot of them, on the same axis, against an alerted crew with layered defenses. Surface drones can carry the explosives, but they are slow. Slow gives gunners time to engage. Slow gives helicopters time to launch. Slow gives a destroyer’s five-inch gun a turkey shoot. Drones are useful. Drones are cheap. Drones change the math at the margins. None of that is the same as making capital ships obsolete. Here is the test. If the writers and analysts pushing the battleship-is-dead line, including Boot, Stavridis, and the chorus behind them, want to keep selling that thesis, prove it. Take a few hundred drones, surface and aerial, the kind they say render hulls irrelevant. Put them up against a decommissioned Spruance, a stricken Perry-class frigate, and an old amphib waiting to be sunk as a reef. Run it as a SINKEX off Hawaii. Live fire, live targets, working defenses turned on, then turned off. Publish the data. Until someone does that, we are not having a debate about evidence. We are having a debate about vibes. And vibes do not win wars. P.S. what drone swarms do accomplish is make you exhaust your supply of munitions. And that’s the battleship’s great strength…. It can carry a lot more 5” and CWIS rounds plus lasers.

Admiral James Stavridis, USN, Ret.@stavridisj

. @MaxBoot makes solid points in today's @washingtonpost about the Navy Secretary's firing & the battleship program. The Iowa-class battleships were impressive — you should tour one. But they are museums for a reason. In the age of drone swarms, hypersonic missiles, and stealthy submarines, concentrating firepower in a handful of enormous, expensive, and easily targeted platforms is the wrong direction. The "Golden Fleet" sounds good. Distributed firepower across many smaller, faster, unmanned platforms is a better strategy. wapo.st/3P4vSWL

English

185

52.1K

Entdecken

@elonmusk @eigenrobot @TheAnnaGat @rms099_rickdias @ZubyMusic @TheLaurenChen @Kenaadams99 @BullTheoryio