Every 📧

6.5K posts

Every 📧 banner
Every 📧

Every 📧

@every

The only subscription you need to stay at the edge of AI. Ideas and apps: @TrySpiral @CoraComputer @SparkleApp @usemonologue

Katılım Eylül 2012
82 Takip Edilen50.7K Takipçiler
Sabitlenmiş Tweet
Every 📧
Every 📧@every·
AI progress creates more work for humans, not less. Dive into our new report from @danshipper — and use the companion repo to read it with your agent 👇
Dan Shipper 📧@danshipper

We’ve automated every single thing we can @every with AI agents. And yet there’s way more human work to do than ever. We’ve gone from 4 -> 30 human employees since GPT-3. I wrote a report on the structural reasons: how AI makes expert competence cheap, why that drives up demand for experts, and why the dynamic only intensifies as we approach AGI. After Automation: every.to/p/after-automa…

English
0
3
22
15.8K
Every 📧 retweetledi
Brandon Gell
Brandon Gell@bran_don_gell·
It's been amazing to see how many new people have discovered Every in the last month. It's the best deal on the internet right now. $30/m gets you: - all of @every's writing like our recent guide on how to use codex for knowledge work: every.to/p/how-to-use-c… - access to our CAMPs and Events where you learn from the best builder and operators - discounts to our courses keeping you at the edge of AI - ALL OUR SOFTWARE: → Cora.computerMonologue.towritewithspiral.comakeitsparkle.coproofeditor.aievery.to/plus-one it's a legit $1000 subscription for $30...
Ahmed 🇵🇸🇲🇦 - 212/acc@elazzabi_

One of the best newsletters around for keeping me up to date on AI stuff while I was on parental leave is @every’s. As a bonus, you also get access to Every’s apps. Monologue (audio transcription) and Cora (email productivity) are my favorites.

English
3
3
31
7.2K
Every 📧
Every 📧@every·
... it's like y'all don't know the man's name is SHIPPER
Every 📧 tweet mediaEvery 📧 tweet mediaEvery 📧 tweet mediaEvery 📧 tweet media
English
1
4
27
14.5K
FLORA ©
FLORA ©@floraai·
Opus 4.8 is live in FLORA. The @every said it better than we could. Monster mode, in the canvas.
Dan Shipper 📧@danshipper

BREAKING: Anthropic just dropped Opus 4.8—and it is a MONSTER We've been testing for about a week @every and our verdict is they could've just called it Opus 5, it's that good. Here's our vibe check: - Beats GPT-5.5 on Senior Engineer bench. On our toughest benchmark Opus 4.8 scores a 63—a hair higher than GPT-5.5's score of 62, and a full 30 points higher than Opus 4.7. It tackled a ground-up rewrite of a production codebase, and actually built something that works. HOWEVER: Coding performance varied a lot at different reasoning levels. We recommend using it on xhigh for best results. - Incredibly good writer. Opus 4.8 scored a 79.6 on our writing benchmark—measuring models on real-world writing tasks we do all of the time like essay writing, promo email writing, and more. It beats GPT-5.5 by 6 points. It produces well-written prose with fewer "AI-isms". It's also very good at writing in your voice given the right context. HOWEVER: Writing performance also varied with reasoning levels. Medium reasoning had higher incidence of AI-isms—we found best results with high. - Beast at knowledge work. Opus 4.8 is very good at general knowledge work tasks like report creation, research and more. It produced the best PowerPoint one-shot we've ever seen on our deck generation benchmark. - Emotionally intelligent, willing to question the frame. I've also found it to be quite good at talking through psychological or interpersonal issues. It has a high EQ, and it's also good at not glazing and helping to expand your perspective. Its thought process feels extremely rich and dynamic. THE BAD: These days a model is only as good as its harness, and Codex is still a far superior harness to the Claude Desktop app. This has kept me using Codex + GPT-5.5 as my daily driver, but I am flipping back and forth a lot more between Codex and Claude. Anthropic is back baby! Read the rest on @every: every.to/vibe-check/opu…

English
1
3
13
7.9K
Rosie Teo
Rosie Teo@rosieteouk·
@danshipper @every I literally just cancelled my Claude Max this morning in favour of GPT5.5. Hereeee we go again
English
1
0
0
191
Dan Shipper 📧
Dan Shipper 📧@danshipper·
BREAKING: Anthropic just dropped Opus 4.8—and it is a MONSTER We've been testing for about a week @every and our verdict is they could've just called it Opus 5, it's that good. Here's our vibe check: - Beats GPT-5.5 on Senior Engineer bench. On our toughest benchmark Opus 4.8 scores a 63—a hair higher than GPT-5.5's score of 62, and a full 30 points higher than Opus 4.7. It tackled a ground-up rewrite of a production codebase, and actually built something that works. HOWEVER: Coding performance varied a lot at different reasoning levels. We recommend using it on xhigh for best results. - Incredibly good writer. Opus 4.8 scored a 79.6 on our writing benchmark—measuring models on real-world writing tasks we do all of the time like essay writing, promo email writing, and more. It beats GPT-5.5 by 6 points. It produces well-written prose with fewer "AI-isms". It's also very good at writing in your voice given the right context. HOWEVER: Writing performance also varied with reasoning levels. Medium reasoning had higher incidence of AI-isms—we found best results with high. - Beast at knowledge work. Opus 4.8 is very good at general knowledge work tasks like report creation, research and more. It produced the best PowerPoint one-shot we've ever seen on our deck generation benchmark. - Emotionally intelligent, willing to question the frame. I've also found it to be quite good at talking through psychological or interpersonal issues. It has a high EQ, and it's also good at not glazing and helping to expand your perspective. Its thought process feels extremely rich and dynamic. THE BAD: These days a model is only as good as its harness, and Codex is still a far superior harness to the Claude Desktop app. This has kept me using Codex + GPT-5.5 as my daily driver, but I am flipping back and forth a lot more between Codex and Claude. Anthropic is back baby! Read the rest on @every: every.to/vibe-check/opu…
English
135
137
1.7K
266.4K
Every 📧
Every 📧@every·
@bcherny It's a beast! Very hard to make a model that is both an incredible software engineer and a near-human writer with depth and emotional intelligence—but that’s what this model feels like to us: every.to/vibe-check/opu…
English
0
0
4
941
Boris Cherny
Boris Cherny@bcherny·
Claude Opus 4.8 is out today. It's our strongest coding model yet: up on SWE-bench Pro (from 64.3 to 69.2) and noticeably more honest about its own work. It tells you when it's unsure and catches its own bugs instead of declaring victory early. Same price as 4.7.
Claude@claudeai

Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.

English
422
352
5.7K
324.4K
Mike Krieger
Mike Krieger@mikeyk·
Been building with Claude Opus 4.8 (out today!) for a couple weeks and it's already the model I reach for first. The best part is how much I can just let it run. It's more honest about its own work, flags what it's unsure of, & catches flaws in its code before handing it back.
English
14
6
229
10.8K
Cursor
Cursor@cursor_ai·
Claude Opus 4.8 is now available in Cursor. On CursorBench, it's able to work much more efficiently than Opus 4.7. We've also found it to be more persistent on harder tasks.
English
158
157
3.9K
280.9K
Claude
Claude@claudeai·
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.
Claude tweet media
English
3.5K
8.5K
66.1K
13.9M
Every 📧
Every 📧@every·
VIBE CHECK! 🚨🚨🚨
Dan Shipper 📧@danshipper

BREAKING: Anthropic just dropped Opus 4.8—and it is a MONSTER We've been testing for about a week @every and our verdict is they could've just called it Opus 5, it's that good. Here's our vibe check: - Beats GPT-5.5 on Senior Engineer bench. On our toughest benchmark Opus 4.8 scores a 63—a hair higher than GPT-5.5's score of 62, and a full 30 points higher than Opus 4.7. It tackled a ground-up rewrite of a production codebase, and actually built something that works. HOWEVER: Coding performance varied a lot at different reasoning levels. We recommend using it on xhigh for best results. - Incredibly good writer. Opus 4.8 scored a 79.6 on our writing benchmark—measuring models on real-world writing tasks we do all of the time like essay writing, promo email writing, and more. It beats GPT-5.5 by 6 points. It produces well-written prose with fewer "AI-isms". It's also very good at writing in your voice given the right context. HOWEVER: Writing performance also varied with reasoning levels. Medium reasoning had higher incidence of AI-isms—we found best results with high. - Beast at knowledge work. Opus 4.8 is very good at general knowledge work tasks like report creation, research and more. It produced the best PowerPoint one-shot we've ever seen on our deck generation benchmark. - Emotionally intelligent, willing to question the frame. I've also found it to be quite good at talking through psychological or interpersonal issues. It has a high EQ, and it's also good at not glazing and helping to expand your perspective. Its thought process feels extremely rich and dynamic. THE BAD: These days a model is only as good as its harness, and Codex is still a far superior harness to the Claude Desktop app. This has kept me using Codex + GPT-5.5 as my daily driver, but I am flipping back and forth a lot more between Codex and Claude. Anthropic is back baby! Read the rest on @every: every.to/vibe-check/opu…

English
0
0
16
2.7K
Every 📧 retweetledi
Dan Shipper 📧
Dan Shipper 📧@danshipper·
codex power-user best practices: - a few different "pulse" threads that run every morning to check the status of stuff i care about, e.g. proof metrics, all company meetings, etc - a "log" thread for ongoing, everyday activity i want to track. for me that's my "writing log"—want to track what im writing about, and working on from piece to piece - "inbox" thread which gathers all of my emails right now, but eventually will pull in all of the main important things from each "pulse" thread - "router" thread which knows about all of the other threads, and is also hooked up to my email to push emails into each thread appropriately
Dan Shipper 📧 tweet media
English
36
43
779
136.4K
Dan Shipper 📧
Dan Shipper 📧@danshipper·
my best practical advice for thriving AI? ride the models.
English
7
1
46
9.8K
Dan Shipper 📧
Dan Shipper 📧@danshipper·
AI is accelerating the pace of change, and therefore is increasing the value of human judgement - AI can execute repetitive tasks once we've created a stable frame telling it what matters - The world is always changing, and AI is driving a lot of it—which increases the decay of stable frames - Human judgment is required to notice when the situation has changed and reframing the work around it
English
6
1
29
5.4K
Every 📧 retweetledi
Dan Shipper 📧
Dan Shipper 📧@danshipper·
Why does automating everything lead to more human work? That's the question I asked last week in “After Automation”—an 8,000-word essay on the structural reasons behind this paradox—which quickly became our most viral piece of the year, driving the discourse on X with reactions from Lenny Rachitsky (@lennysan), the AI Daily Brief (@AIDailyBrief), and Marc Andreessen (@pmarca). On this week’s AI & I, @every COO Brandon Gell and I unpack: - The structural reason automation creates more work for humans, not less. AI makes yesterday's expert competence cheap—which floods the zone with code, writing, and design that's close, but not quite right. That glut drives up demand for experts who can work with AI to create distinctive, memorable work. - Why “AI layoffs” are usually a cover story. ClickUp recently laid off around 22% of its team and blamed AI. Our read: It's an easier explanation than admitting mismanagement, overhiring, or a company that isn't doing well financially. - Why models' exponential benchmark progress isn't what it seems. Once they saturate a benchmark, you can always find a slightly larger frame that zeros the progress out. Humans are indispensable because we operate outside established frames. - Why the need for human judgment only intensifies as we approach AGI. Agents act on behalf of someone—they don't have self-motivated wants. No matter how powerful they get, all the economic and technological forces push AI toward looking back at you and asking, “What should I do next?” - The practical answer to the AI paradox: Ride the models. When new models come out, use them for the work you're already good at—and you'll be more in demand than ever, not despite the automation, but because of it. This is a must-watch for anyone trying to figure out what to do with AI at work—and whether they should be worried. Watch below! Timestamps: 1. Introduction: 00:00:51 2. The AI paradox: more automation, more human work : 00:05:51 3. How AI makes yesterday's expert competence cheap: 00:10:00 4. AI can act autonomously but it does not have agency: 00:18:00 5. Why Dan is all in on AGI : 00:20:39 6. AI layoffs are a lie : 00:21:57 7. Ride the models and you'll be fine : 00:25:42 8. How to use AI as a long-form features editor: 00:35:30
English
5
6
75
8.5K