Scott Wu
219 posts




maybe the most insane (appreciative) barbell portfolio of all time. respect.

EPISODE 147: The New Era of Software Abundance @JTLonsdale visits @ScottWu46 & @russelljkaplan at @cognition HQ 00:00 Episode intro 01:35 Why technical talent & execution matters in AI 06:10 Do young people have an edge in the AI era? 08:26 Cognition's rapid growth 11:55 The new era of software abundance 14:30 Cognition engineers don't type code anymore 19:20 "Never sleep while Devin is idling" 21:25 The case for AI disinflation 23:50 How Devin generates 12X productivity gains 28:25 Cognition for government / taking on complex, broken systems 36:40 The AI race / competition with Anthropic 39:00 Forward deployed engineers? 43:40 How fast are LLMs improving? 47:10 The AI-led small business explosion


@cognition new post on joining Cognition at it's $10b Series C: The Devin is in the Details swyx.io/cognition

Paid $500 for @DevinAI - liking it so far. You can tell this team is much further than other agent labs when it comes to being truly remote-first. Very mature, advanced tooling and it just works across all surfaces (iPhone, Slack, Browser, GitHub, Linear). I was tired of trying to hand-connect everything with a custom setup of Open Inspect + Codex + Linear. When you look at your hourly effective rate, it stops making sense trying to hand-build all this stuff. Nvm the all the maintenance hours you need to put in. I'll keep using Devin 100% for the next week and report back. So far, my PR shipping velocity is higher than before - so that's good obv.

Paid $500 for @DevinAI - liking it so far. You can tell this team is much further than other agent labs when it comes to being truly remote-first. Very mature, advanced tooling and it just works across all surfaces (iPhone, Slack, Browser, GitHub, Linear). I was tired of trying to hand-connect everything with a custom setup of Open Inspect + Codex + Linear. When you look at your hourly effective rate, it stops making sense trying to hand-build all this stuff. Nvm the all the maintenance hours you need to put in. I'll keep using Devin 100% for the next week and report back. So far, my PR shipping velocity is higher than before - so that's good obv.








Devin can now schedule itself. Run any task once, like feature flag cleanup, release notes, or QA. Then tell Devin to make it recurring, so that one good session becomes an automated workflow. Available now for all users.





We are sharing an early preview of our ongoing SWE-1.6 training run. It significantly improves upon SWE-1.5 while being post-trained on the same pre-trained model - and it runs equally as fast at 950 tok/s. On SWE-Bench Pro it exceeds top open-source models. The preview model still exhibits some undesirable behaviors like overthinking and excessive self-verification, which we aim to improve. We are rolling out early access to a small subset of users in Windsurf.

We are sharing an early preview of our ongoing SWE-1.6 training run. It significantly improves upon SWE-1.5 while being post-trained on the same pre-trained model - and it runs equally as fast at 950 tok/s. On SWE-Bench Pro it exceeds top open-source models. The preview model still exhibits some undesirable behaviors like overthinking and excessive self-verification, which we aim to improve. We are rolling out early access to a small subset of users in Windsurf.




