
Tomas Marek
23 posts

Tomas Marek
@tomasmarekk
Researching LLMs and agentic workflows | AI Driven Development | also official @Webflow Partner & ex-FE developer.



@NoahEpstein_ models auth login --provider anthropic --method cli --set-default




It’s honestly crazy to watch how hard people are vendor-locked into Claude Code, even after months of getting treated like crap by it. I’m not saying Opus or Sonnet are bad models. But I’m in SWE circles, and the reality is that most serious engineers have been using Codex and the GPT family since December 2025. I used Claude Code all through 2025 because for a while it really was on the edge. But that changed at the end of 2025. And the benchmarks that actually matter have been showing that pretty clearly ever since. 5.2-codex, 5.3-codex, and now 5.4-high are just a tier above Claude models on stuff like Terminal Bench and LiveBench. Even the Claude Code harness is bad enough to drag it down to #39 on Terminal Bench, while Codex is sitting at #8. Since December 2025, basically every metric has been pointing toward OAI. - better limits - smarter models - way better public communication from the devs Are there downsides? Sure. - It’s a lot slower. But it also does the work better, so over time it saves you effort. - It’s not some AI normie companion that just pats you on the head and says yes to everything. You actually have to know how to explain what you want and what you’re trying to do. So yeah, it’s just sad watching so many people stay irrationally locked into the worse option while it keeps screwing them over. Give switching a shot. It’s worth it. And I’ll be the first one to go back to Claude Code the moment it’s actually on the edge again. But it just hasn’t been since December.

Thank you to everyone who spent time sending us feedback and reports. We've investigated and we're sorry this has been a bad experience. Here's what we found:

I don’t get this post. All users are saying they're reaching their rate limits faster than before. Anthropic investigated and said: we found nothing, but hey, why not use Sonnet instead of Opus because Opus uses about twice the bandwidth?






I noticed something interesting: Claude Code auto-adds itself as a co-author on every git commit. Codex doesn’t. That’s why you see Claude everywhere on GitHub, but not Codex. I wonder why OpenAI is not doing that. Feels like an obvious branding strategy OpenAI is skipping.

GPT-5.4 > Opus 4.6 And Google still doesn't have anything even remotely competitive.




Over 1.5 million people have reportedly left ChatGPT.



I am not sure if other developers feel like this. But I feel kinda depressed. Like everyone else, I have been using Claude code (for a while, it’s not a recent thing lol). And it’s incredible. I have never found coding more fun. The stuff you can do and the speed you can do it at now. Is absolutely insane. And I’m using it to ship a lot. And solve customer problems faster. So all around it’s a win. But at the same time. The skill I spent 10,000s of hours getting good at. Programming. The thing I spent most of my life getting good at. Is becoming a full commodity extremely quickly. As much fun as it is. And as much as I like using the tools. There’s something disheartening about the thing you spent most of your life getting good at. Now being mostly useless.




Offshore Oil Rig Dashboard.










