Every 📧 (@every) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

AI progress creates more work for humans, not less. Dive into our new report from @danshipper — and use the companion repo to read it with your agent 👇

Dan Shipper 📧@danshipper

We’ve automated every single thing we can @every with AI agents. And yet there’s way more human work to do than ever. We’ve gone from 4 -> 30 human employees since GPT-3. I wrote a report on the structural reasons: how AI makes expert competence cheap, why that drives up demand for experts, and why the dynamic only intensifies as we approach AGI. After Automation: every.to/p/after-automa…

English

0

3

22

15.8K

Every 📧 retweetledi

Brandon Gell@bran_don_gell·15h

It's been amazing to see how many new people have discovered Every in the last month. It's the best deal on the internet right now. $30/m gets you: - all of @every's writing like our recent guide on how to use codex for knowledge work: every.to/p/how-to-use-c… - access to our CAMPs and Events where you learn from the best builder and operators - discounts to our courses keeping you at the edge of AI - ALL OUR SOFTWARE: → Cora.computer → Monologue.to → writewithspiral.co → makeitsparkle.co → proofeditor.ai → every.to/plus-one it's a legit $1000 subscription for $30...

Ahmed 🇵🇸🇲🇦 - 212/acc@elazzabi_

One of the best newsletters around for keeping me up to date on AI stuff while I was on parental leave is @every’s. As a bonus, you also get access to Every’s apps. Monologue (audio transcription) and Cora (email productivity) are my favorites.

English

3

31

7.2K

Every 📧@every·1d

@lennysan ❤️👋 every.to/careers

QME

1

0

7

834

Lenny Rachitsky@lennysan·1d

Fascinating results + Anthropic running away with it right now + So many people want to start their own company + Google over OpenAI + Vercel, Linear, Every, PostHog overperforming A great list if you're trying to figure out where to go work 👇

Lenny Rachitsky@lennysan

What are your top 3 dream companies to work for right now?

English

176

79

1.4K

534K

Every 📧@every·1d

... it's like y'all don't know the man's name is SHIPPER

English

1

4

27

14.5K

Every 📧@every·1d

@floraai 🫡

QME

0

3

1.5K

FLORA ©@floraai·1d

Opus 4.8 is live in FLORA. The @every said it better than we could. Monster mode, in the canvas.

Dan Shipper 📧@danshipper

BREAKING: Anthropic just dropped Opus 4.8—and it is a MONSTER We've been testing for about a week @every and our verdict is they could've just called it Opus 5, it's that good. Here's our vibe check: - Beats GPT-5.5 on Senior Engineer bench. On our toughest benchmark Opus 4.8 scores a 63—a hair higher than GPT-5.5's score of 62, and a full 30 points higher than Opus 4.7. It tackled a ground-up rewrite of a production codebase, and actually built something that works. HOWEVER: Coding performance varied a lot at different reasoning levels. We recommend using it on xhigh for best results. - Incredibly good writer. Opus 4.8 scored a 79.6 on our writing benchmark—measuring models on real-world writing tasks we do all of the time like essay writing, promo email writing, and more. It beats GPT-5.5 by 6 points. It produces well-written prose with fewer "AI-isms". It's also very good at writing in your voice given the right context. HOWEVER: Writing performance also varied with reasoning levels. Medium reasoning had higher incidence of AI-isms—we found best results with high. - Beast at knowledge work. Opus 4.8 is very good at general knowledge work tasks like report creation, research and more. It produced the best PowerPoint one-shot we've ever seen on our deck generation benchmark. - Emotionally intelligent, willing to question the frame. I've also found it to be quite good at talking through psychological or interpersonal issues. It has a high EQ, and it's also good at not glazing and helping to expand your perspective. Its thought process feels extremely rich and dynamic. THE BAD: These days a model is only as good as its harness, and Codex is still a far superior harness to the Claude Desktop app. This has kept me using Codex + GPT-5.5 as my daily driver, but I am flipping back and forth a lot more between Codex and Claude. Anthropic is back baby! Read the rest on @every: every.to/vibe-check/opu…

English

1

3

13

7.9K

Every 📧@every·1d

"This is a banger model. If you are a Claude stan, you're going to love it. If you've been converted to Codex, I highly recommend you at least add it as part of your arsenal. It'll be an eye-opening experience."

Dan Shipper 📧@danshipper

Opus 4.8 is a GAMECHANGER Here's the full vibe check from @every:

English

1

2

14

2.9K

Every 📧@every·1d

@rosieteouk @danshipper 😅

QME

0

31

Rosie Teo@rosieteouk·1d

@danshipper @every I literally just cancelled my Claude Max this morning in favour of GPT5.5. Hereeee we go again

English

1

0

191

Dan Shipper 📧@danshipper·1d

BREAKING: Anthropic just dropped Opus 4.8—and it is a MONSTER We've been testing for about a week @every and our verdict is they could've just called it Opus 5, it's that good. Here's our vibe check: - Beats GPT-5.5 on Senior Engineer bench. On our toughest benchmark Opus 4.8 scores a 63—a hair higher than GPT-5.5's score of 62, and a full 30 points higher than Opus 4.7. It tackled a ground-up rewrite of a production codebase, and actually built something that works. HOWEVER: Coding performance varied a lot at different reasoning levels. We recommend using it on xhigh for best results. - Incredibly good writer. Opus 4.8 scored a 79.6 on our writing benchmark—measuring models on real-world writing tasks we do all of the time like essay writing, promo email writing, and more. It beats GPT-5.5 by 6 points. It produces well-written prose with fewer "AI-isms". It's also very good at writing in your voice given the right context. HOWEVER: Writing performance also varied with reasoning levels. Medium reasoning had higher incidence of AI-isms—we found best results with high. - Beast at knowledge work. Opus 4.8 is very good at general knowledge work tasks like report creation, research and more. It produced the best PowerPoint one-shot we've ever seen on our deck generation benchmark. - Emotionally intelligent, willing to question the frame. I've also found it to be quite good at talking through psychological or interpersonal issues. It has a high EQ, and it's also good at not glazing and helping to expand your perspective. Its thought process feels extremely rich and dynamic. THE BAD: These days a model is only as good as its harness, and Codex is still a far superior harness to the Claude Desktop app. This has kept me using Codex + GPT-5.5 as my daily driver, but I am flipping back and forth a lot more between Codex and Claude. Anthropic is back baby! Read the rest on @every: every.to/vibe-check/opu…

English

135

137

1.7K

266.4K

Every 📧@every·1d

@bcherny It's a beast! Very hard to make a model that is both an incredible software engineer and a near-human writer with depth and emotional intelligence—but that’s what this model feels like to us: every.to/vibe-check/opu…

English

0

4

941

Boris Cherny@bcherny·1d

Claude Opus 4.8 is out today. It's our strongest coding model yet: up on SWE-bench Pro (from 64.3 to 69.2) and noticeably more honest about its own work. It tells you when it's unsure and catches its own bugs instead of declaring victory early. Same price as 4.7.

Claude@claudeai

Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.

English

422

352

5.7K

324.4K

Every 📧@every·1d

@Polymarket Our vibe check—It's an absolute monster: every.to/vibe-check/opu…

English

0

3

334

Polymarket@Polymarket·1d

JUST IN: Anthropic officially releases Claude 4.8.

Polymarket@Polymarket

JUST IN: Claude 4.8 projected to be released by the end of the month. 85% chance. polymarket.com/event/claude-4…

English

114

77

1.4K

270.7K

Every 📧@every·1d

@mikeyk It's a monster! We think y'all should've rounded up to Opus 5: every.to/vibe-check/opu…

English

0

6

1.7K

Mike Krieger@mikeyk·1d

Been building with Claude Opus 4.8 (out today!) for a couple weeks and it's already the model I reach for first. The best part is how much I can just let it run. It's more honest about its own work, flags what it's unsure of, & catches flaws in its code before handing it back.

English

14

6

229

10.8K

Every 📧@every·1d

@cursor_ai Tops in our Senior Engineer Benchmark as well: every.to/vibe-check/opu…

English

0

6

1.5K

Cursor@cursor_ai·1d

Claude Opus 4.8 is now available in Cursor. On CursorBench, it's able to work much more efficiently than Opus 4.7. We've also found it to be more persistent on harder tasks.

English

158

157

3.9K

280.9K

Every 📧@every·1d

@ClaudeDevs Legitimately great model, jumping to the top of the pack in the rankings (and our hearts): every.to/vibe-check/opu…

English

0

6

2.1K

ClaudeDevs@ClaudeDevs·1d

Opus 4.8 is live in Claude Code today. A few things worth knowing: 🧵

Claude@claudeai

Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.

English

371

856

10.1K

1.3M

Every 📧@every·1d

@claudeai Vibe Check! Opus 4.8 tops both our Senior Engineer benchmark and our AI writing tests. It’s the most complete model we’ve tested: every.to/vibe-check/opu…

English

1

7

70

9.8K

Claude@claudeai·1d

Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.

English

3.5K

8.5K

66.1K

13.9M

Every 📧 retweetledi

Dan Shipper 📧@danshipper·1d

@every Watch on YouTube: youtube.com/watch?v=kXYDUo…

YouTube

English

2

3

25

11.3K

Every 📧@every·1d

And we're live with @danshipper @kplikethebird and @kieranklaassen putting Opus 4.8 to the test. "It's my favorite model." @kieranklaassen

Dan Shipper 📧@danshipper

LIVE VIBE CHECK: Anthropic Is SO Back—Testing Opus 4.8 x.com/i/broadcasts/1…

English

0

1

1.4K

Every 📧@every·1d

VIBE CHECK! 🚨🚨🚨

Dan Shipper 📧@danshipper

BREAKING: Anthropic just dropped Opus 4.8—and it is a MONSTER We've been testing for about a week @every and our verdict is they could've just called it Opus 5, it's that good. Here's our vibe check: - Beats GPT-5.5 on Senior Engineer bench. On our toughest benchmark Opus 4.8 scores a 63—a hair higher than GPT-5.5's score of 62, and a full 30 points higher than Opus 4.7. It tackled a ground-up rewrite of a production codebase, and actually built something that works. HOWEVER: Coding performance varied a lot at different reasoning levels. We recommend using it on xhigh for best results. - Incredibly good writer. Opus 4.8 scored a 79.6 on our writing benchmark—measuring models on real-world writing tasks we do all of the time like essay writing, promo email writing, and more. It beats GPT-5.5 by 6 points. It produces well-written prose with fewer "AI-isms". It's also very good at writing in your voice given the right context. HOWEVER: Writing performance also varied with reasoning levels. Medium reasoning had higher incidence of AI-isms—we found best results with high. - Beast at knowledge work. Opus 4.8 is very good at general knowledge work tasks like report creation, research and more. It produced the best PowerPoint one-shot we've ever seen on our deck generation benchmark. - Emotionally intelligent, willing to question the frame. I've also found it to be quite good at talking through psychological or interpersonal issues. It has a high EQ, and it's also good at not glazing and helping to expand your perspective. Its thought process feels extremely rich and dynamic. THE BAD: These days a model is only as good as its harness, and Codex is still a far superior harness to the Claude Desktop app. This has kept me using Codex + GPT-5.5 as my daily driver, but I am flipping back and forth a lot more between Codex and Claude. Anthropic is back baby! Read the rest on @every: every.to/vibe-check/opu…

English

0

16

2.7K

Every 📧@every·1d

LIVE VIBE CHECK: Anthropic Is SO Back—Testing Opus 4.8 x.com/i/broadcasts/1…

English

0

8

627

Every 📧 retweetledi

Dan Shipper 📧@danshipper·1d

codex power-user best practices: - a few different "pulse" threads that run every morning to check the status of stuff i care about, e.g. proof metrics, all company meetings, etc - a "log" thread for ongoing, everyday activity i want to track. for me that's my "writing log"—want to track what im writing about, and working on from piece to piece - "inbox" thread which gathers all of my emails right now, but eventually will pull in all of the main important things from each "pulse" thread - "router" thread which knows about all of the other threads, and is also hooked up to my email to push emails into each thread appropriately

English

36

43

779

136.4K

Every 📧@every·2d

@danshipper 🫡 live at the edge of AI with us every.to/subscribe

English

0

1

3.5K

Dan Shipper 📧@danshipper·2d

my best practical advice for thriving AI? ride the models.

English

7

1

46

9.8K

Every 📧@every·2d

@danshipper Every agent needs a human. Read Dan's report on what comes after automation, then use an agent to apply it to your own work: every.to/p/after-automa…

English

0

1

1.4K

Dan Shipper 📧@danshipper·2d

AI is accelerating the pace of change, and therefore is increasing the value of human judgement - AI can execute repetitive tasks once we've created a stable frame telling it what matters - The world is always changing, and AI is driving a lot of it—which increases the decay of stable frames - Human judgment is required to notice when the situation has changed and reframing the work around it

English

6

1

29

5.4K

Every 📧 retweetledi

Dan Shipper 📧@danshipper·2d

Why does automating everything lead to more human work? That's the question I asked last week in “After Automation”—an 8,000-word essay on the structural reasons behind this paradox—which quickly became our most viral piece of the year, driving the discourse on X with reactions from Lenny Rachitsky (@lennysan), the AI Daily Brief (@AIDailyBrief), and Marc Andreessen (@pmarca). On this week’s AI & I, @every COO Brandon Gell and I unpack: - The structural reason automation creates more work for humans, not less. AI makes yesterday's expert competence cheap—which floods the zone with code, writing, and design that's close, but not quite right. That glut drives up demand for experts who can work with AI to create distinctive, memorable work. - Why “AI layoffs” are usually a cover story. ClickUp recently laid off around 22% of its team and blamed AI. Our read: It's an easier explanation than admitting mismanagement, overhiring, or a company that isn't doing well financially. - Why models' exponential benchmark progress isn't what it seems. Once they saturate a benchmark, you can always find a slightly larger frame that zeros the progress out. Humans are indispensable because we operate outside established frames. - Why the need for human judgment only intensifies as we approach AGI. Agents act on behalf of someone—they don't have self-motivated wants. No matter how powerful they get, all the economic and technological forces push AI toward looking back at you and asking, “What should I do next?” - The practical answer to the AI paradox: Ride the models. When new models come out, use them for the work you're already good at—and you'll be more in demand than ever, not despite the automation, but because of it. This is a must-watch for anyone trying to figure out what to do with AI at work—and whether they should be worried. Watch below! Timestamps: 1. Introduction: 00:00:51 2. The AI paradox: more automation, more human work : 00:05:51 3. How AI makes yesterday's expert competence cheap: 00:10:00 4. AI can act autonomously but it does not have agency: 00:18:00 5. Why Dan is all in on AGI : 00:20:39 6. AI layoffs are a lie : 00:21:57 7. Ride the models and you'll be fine : 00:25:42 8. How to use AI as a long-form features editor: 00:35:30

English

5

6

75

8.5K

Every 📧

Keşfet