João Eira

9.7K posts

João Eira

@joaoeira

Eternal student, lover of books, learning, and life, which is all really the same thing

🇵🇹 Katılım Şubat 2009

2K Takip Edilen590 Takipçiler

Sabitlenmiş Tweet

João Eira@joaoeira·11 Kas

The only valid reason to make more money is to lower the marginal cost of books.

English

João Eira@joaoeira·8h

@Konrad680106 @nabulionee2 I meant the last sentence, more in jest than seriously, not that Zamoyski bashed Roberts' book. Incidentally there's a second Zamoisky & Roberts discussion that's also pretty good youtube.com/watch?v=dSPtBW…

YouTube

English

514

Konrad@Konrad680106·8h

@joaoeira @nabulionee2 He literally recommends the book in this video lol obviously he is critical of napoleon while roberts has a rather postitive view of him, but nowhere does he bashes the quality of the book, so i can't see how it relates to the original post

English

1.9K

Nabulione@nabulionee2·12h

Moonstruck❤️‍🔥@godspeed_aflame

sometimes I'll hear about an interesting sounding history book, and then I do some more research and learn that historians actually consider it the stupid book for morons that you should only read if you want to be wrong about everything

ZXX

516

147.8K

João Eira@joaoeira·9h

@nabulionee2 @Konrad680106 Pretty sure you're right youtu.be/NI68c8mXBj0?si…

YouTube

English

3.8K

Nabulione@nabulionee2·9h

@Konrad680106 Zamoyski lol. I'd even argue he wrote his book because of Andrew Roberts.

English

5.6K

João Eira@joaoeira·13h

noam brown rn

GIF

Andrej Karpathy@karpathy

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

English

João Eira@joaoeira·1d

GIF

Derek Thompson@DKThomp

"Dad books" — which this article, and some publishing insiders, use to describe "serious nonfiction" books across biography, current affairs and business and economics — are reportedly in a free fall, with sales declining every year for the last few years “The trend couldn’t be clearer,” said Jonathan Karp, the former chief executive of Simon & Schuster and publisher of the new Simon Six imprint. “When we have internal meetings to talk about this problem, it always comes around to podcasts,” said Jonathan Burnham, president and publisher of the Harper Group at HarperCollins Publishers.

ZXX

João Eira@joaoeira·5d

@aarondfrancis counselors is affected right? that's what bums me out

English

Aaron Francis@aarondfrancis·6d

Holy crap this feels like a massive blow to anyone that uses claude -p, which is a lot of the tools out there. Fortunately, Solo (soloterm.com) uses the real Claude CLI, so we're safe.

ClaudeDevs@ClaudeDevs

Starting June 15, paid Claude plans can claim a dedicated monthly credit for programmatic usage. The credit covers usage of: - Claude Agent SDK - claude -p - Claude Code GitHub Actions - Third-party apps built on the Agent SDK

English

279

57.3K

João Eira@joaoeira·5d

@PradyuPrasad x.com/davidad/status…

davidad 🎇@davidad

narrative inversion:

QME

155

Pradyumna (in Bay Area)@PradyuPrasad·6d

lowkey anthropic's rugpulls make me more skeptical of their trustworthiness if they get serious economic leverage

English

João Eira@joaoeira·6d

@sammcallister @charlieholtz pretty sure that's never going to happen now x.com/ClaudeDevs/sta…

ClaudeDevs@ClaudeDevs

English

169

sam mcallister@sammcallister·6d

@charlieholtz Sorry to see this but hope to have you back to a new default in short order :)

English

799

Charlie Holtz@charlieholtz·13 May

For the first time in Conductor history, we have a new default coding harness! Codex with GPT-5.5 has becoming the Conductor team's default agent, so we decided to make it the default for new users too.

English

161.5K

João Eira@joaoeira·6d

@ClaudeDevs Another day, another day of fumbling things. Why is claude -p included in this? It makes no sense

English

174

ClaudeDevs@ClaudeDevs·6d

English

1.3K

12.5K

10.2M

João Eira retweetledi

David Motadel@DavidMotadel·12 May

THE SHAH'S GREAT TOUR has a cover (and please judge the book by it...) - out on 8 October!

English

253

20.8K

João Eira retweetledi

Basil Halperin@BasilHalperin·8 May

New paper: AI is good at lots, but labs think automating one thing might be especially important – AI research itself What happens if you embed this into a standard economic growth model? When do you get an ‘economic singularity’?

Anton Korinek@akorinek

1/🆕 New NBER paper: 𝗪𝗵𝗲𝗻 𝗗𝗼𝗲𝘀 𝗔𝘂𝘁𝗼𝗺𝗮𝘁𝗶𝗻𝗴 𝗔𝗜 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗣𝗿𝗼𝗱𝘂𝗰𝗲 𝗘𝘅𝗽𝗹𝗼𝘀𝗶𝘃𝗲 𝗚𝗿𝗼𝘄𝘁𝗵? Under empirically grounded calibrations, a singularity could arrive within just a few years of automating AI research. 🧵 📄 nber.org/papers/w35155

English

16.6K

João Eira retweetledi

Rob Sica@robsica·9 May

✨$4.00 Kindle ✨ One of the best books I read (multiple times) last year (after purchasing Italian original). Eager to read yet again in Acerbi's (rather than Claude's) translation. amazon.com/Technopanic-Ju…

English

996

João Eira retweetledi

@HistoryBookBuffs@HistoryB00KBuff·7 May

Our latest pod - on #Weimar Germany - looking at the culture, the chaos, and at two fabulous new books on the subject, by @hoyer_kat and @Victorsebby. Wherever you get your pods - on on YouTube via the link below... 👇

English

6.6K

João Eira retweetledi

Viriato@n0ct_can1slupus·2 May

Sorolla: -Sujétame el cubata.

Juan Roleri 🎹@juanroleri

O sea… ¿Cómo vas a pintar la luz, hermano? Estás demente, Caravaggio!!!! 😍

Español

2.9K

25.5K

620.6K

João Eira retweetledi

Lawrence Chan@justanotherlaw·2 May

A recent viral paper claims to reverse-engineer the parameter counts of frontier models: GPT-5.5 = 9.7T, Opus 4.7 = 4.0T, o1 = 3.5T, etc. @ben_sturgeon and I investigated and found serious issues in the paper; fixing them gives GPT-5.5 as ~1.5T (90% CI: 256B-8.3T).

English

957

208.8K

João Eira retweetledi

𝗡𝘂𝗻𝗼 𝗣𝗮𝗹𝗺𝗮@nunopgpalma·1 May

𝐄𝐮𝐫𝐨𝐩𝐞'𝐬 𝐏𝐨𝐢𝐬𝐨𝐧 𝐏𝐢𝐥𝐥: 𝐓𝐡𝐞 𝐔𝐧𝐢𝐧𝐭𝐞𝐧𝐝𝐞𝐝 𝐂𝐨𝐧𝐬𝐞𝐪𝐮𝐞𝐧𝐜𝐞𝐬 𝐨𝐟 𝐂𝐨𝐡𝐞𝐬𝐢𝐨𝐧 𝐅𝐮𝐧𝐝𝐬 𝐚𝐧𝐝 𝐖𝐡𝐲 𝐓𝐡𝐞𝐲 𝐌𝐮𝐬𝐭 𝐄𝐧𝐝 Check out my new book with CUP, already available for preorder at Amazon, Barnes & Noble, or your favorite bookseller👇

English

120

14.7K

João Eira@joaoeira·30 Nis

@Dimillian very useful, one thing I'd for this to not stop at just one side chat, to have a UX kind of like Andy Matuschak's notes notes.andymatuschak.org

English

130

Thomas Ricouard@Dimillian·30 Nis

A new feature sneaked in the Codex app’s latest update. You can now do /side (or use the ... menu) to spawn a side chat! Useful when you're deep in a thread and want to have a side question in the current context!

English

1.2K

182.1K

João Eira retweetledi

Oliver Kim@oliverwkim·30 Nis

I miss @pseudoerasmus

English

110

7.9K

João Eira retweetledi

AI Security Institute@AISecurityInst·30 Nis

OpenAI’s GPT-5.5 is the second model to complete one of our multi-step cyber-attack simulations end-to-end 🧵

English

398

2.4K

1.8M

João Eira@joaoeira·30 Nis

Ah so we're already at that stage of the game uh

Andrew Curran@AndrewCurran_

The White House is against a proposal from Anthropic to more than double the number of groups with access to Mythos, citing both security concerns and the belief that expanding the program would mean less available use of Mythos for government agencies that already have access.

English

João Eira retweetledi

Bojie Li@bojie_li·29 Nis

Closed labs hide model sizes. They can't hide what their models know, and what a model knows is an indicator on how big it is. Reasoning compresses. Factual knowledge doesn't. So you can size a frontier model from black-box API calls alone, and across releases you can literally watch a single fact arrive in the parameters over time. For three years, my friends Jiyan He and Zihan Zheng have been asking frontier LLMs the same question: "what do you know about USTC Hackergame?", a CTF contest. May 2024: GPT-4o invented fake titles. Feb 2025: Claude 3.7 Sonnet listed 19 verified 2023 challenges. By April 2026, frontier models recall specific challenges across consecutive years. After DeepSeek-V4 dropped, I instructed my agent to spend four days autonomously turning that habit into Incompressible Knowledge Probes (IKP) — 1,400 questions, 7 tiers of obscurity, 188 models, 27 vendors. Three findings: 1/ You can approximately size any black-box LLM from factual accuracy alone. Penalized accuracy is log-linear in log(params), R² = 0.917 on 89 open-weight models from 135M to 1.6T params. Project closed APIs onto the curve → GPT-5.5 ~9T, Claude Opus 4.7 ~4T, GPT-5.4 ~2.2T, Claude Sonnet 4.6 ~1.7T, Gemini 2.5 Pro ~1.2T (90% CI: 0.3-3x size). 2/ Citation count and h-index don't predict whether a frontier model recognizes a researcher. Two researchers with similar citation profiles get very different responses. Models memorize impact — work that shaped a field, not many incremental papers. 3/ Factual capacity doesn't compress over time. Across 96 open-weight models across 3 years, the IKP time coefficient is statistically zero, rejecting the Densing-Law prediction of +0.0117/month at p<10⁻¹⁵. Reasoning benchmarks saturate; factual capacity keeps scaling with parameters. Website: 01.me/research/ikp/ Paper: arxiv.org/pdf/2604.24827

English

233

2.2K

388.2K

Keşfet

@Konrad680106 @nabulionee2 @aarondfrancis @PradyuPrasad @sammcallister @charlieholtz @ClaudeDevs @hoyer_kat