置顶推文
CodeWithStu
3.4K posts

CodeWithStu
@CodeWithStu
Principal Engineer building in public. Previously CTO @ Illuvium, Principal @ Checkout dot com, Lead Architect @ ClearBank
London, England 加入时间 Temmuz 2016
1.2K 关注1.1K 粉丝

@devadam88 So many hybrid roles at the moment not looking forward to this part again :(
English

Did you know you can now connect your GA data directly to LLMs like Gemini? ✨📊
By setting up the new Model Context Protocol (MCP) server, you move from manual reporting to strategic, AI-powered analysis.
▶️ Hit play to see the complete setup process → goo.gle/47CsGrA

English

@altryne I don’t think they’re ignoring the bugs to get more cash. More likely is that it’s buried in a ton of other noise/feedback. This bug actively hurts their infra as well because it’s more to process.
English

PSA: If you've been running out of Claude session quotas on Max tier, you're not alone. Read this.
Some insane Redditor reverse engineered the Claude binaries with MITM to find 2 bugs that could have caused cache-invalidation. Tokens that aren't cached are 10x-20x more expensive and are killing your quota.
If you're using your API keys with Claude this is even worse. This is also likely why this isn't uniform, while over 500 folks replied to me and said "me too", many (including me) didn't see this issue.
There are 2 issues that are compounded here (per Redditor, I haven't independently confirmed this) :
1s bug he found is a string replacement bug in bun that invalidates cache. Apparently this has to do with the custom @bunjavascript binary that ships with standalone Claude CLI.
The workaround there is to use Claude with `npx @anthropic-ai/claude-code`
2nd bug is worse, he claims that --resume always breaks cache. And there doesn't seem to be a workaround there, except pinning to a very old version (that will miss on tons of features)
This bug is also documented on Github and confirmed by other folks.
I won't entertain the conspiracy theories there that Anthropic "chooses" to ignore these bugs because it gets them more $$$, they are actively benefiting from everyone hitting as much cached tokens as possible, so this is absolutely a great find and it does align with my thoughts earlier.
The very sudden spike in reporting for this, the non-uniform nature (some folks are completely fine, some folks are hitting quotas after saying "hey") definitely points to a bug.
cc @trq212 @bcherny @_catwu for visibility in case this helps all of us.

Alex Volkov@altryne
My feed is showing me a bunch of folks who tapped out their whole usage limits on Mon/Tue. Is this your experience? Please comment, I want to understand how widespread this is
English

A DynamoDB migration took down our game for thousands of players.
700+ Lambdas & 1 Dynamo table. One migration caused an outage.
I built a pattern we dubbed ActionRunner to fix it. View the full breakdown: youtu.be/iRaqyrAFHFg

YouTube
English

@Aaronontheweb @dariogriffo Assuming this will affect people that run a lot of tests locally with TestContainers etc
English

@Aaronontheweb @dariogriffo Just saw this part as well. Not sure what the fair use is, but why is something in my CI subject to a fair use?

English

@dariogriffo This is literally the best way to annoy your community in one go. If there was an alternative I would switch
English

@CodeWithStu From today they enforced the licensing model.
Kinda... There is still one month setting a variable to bypass the check.
English

Wrote up some of the performance techniques that I've been using in my #dotnet #aws library if you're interested in that sort of thing :)
codewithstu.tv/thoughts/dotne…
English

@edandersen Yeah that’s entirely possible as well. Is there a world anymore where prices come down?lol
English

@CodeWithStu Or he is telegraphing that token costs are going up 10x
English

Software engineers will not be trusted to spend 50% of their salary on variable opex costs with no guarantee of productivity, unless they are executive level. this is a pipe dream to sell GPUs
TFTC@TFTC21
Jensen Huang: "If that $500,000 engineer did not consume at least $250,000 worth of tokens, I am going to be deeply alarmed. This is no different than a chip designer who says 'I'm just going to use paper and pencil. I don't think I'm going to need any CAD tools.'"
English
CodeWithStu 已转推

Put this in perspective that’s 200 million tokens used per day. Every working day of a year. Definitely possible but requires an extraordinary amount of thinking to know what to develop in this case. In addition to using tokens to debug logs etc (eg: plug direct to aws cloud watch). I think most people are taking it to the extreme though. He’s talking more about the productivity rather than the actual number (despite it being mentioned i feel)
English

I plan mode first and don’t accept the plan without a test plan. I also run everything plan through sequential-thinking mcp before i even look at it (agent instruction). Then i have explicit agent instructions for medium/high risk changes to spin up separate agents and do code review and security review in parallel (has no execution context window this way). The output is so much better for this and catches a ton of bugs before i even see anything
English

@RealProductGirl Yes exactly. My wife was sending todo lists via whatsapp, and i was struggling to remember some things like when the MOT/service was due. Hopefully it helps others :)
English

@CodeWithStu I love this! I'm assuming you came up with this to organize your household? I think it's awesome!
English

@TeeDevh It’s amazing how many people forget about bing webmaster tools.
Structured schemata is a must (eg: json-ld)
English

My SEO setup so far:
• sitemap.xml submitted and cleaned up
• robots.txt configured
• Google Search Console connected
• Bing Webmaster Tools connected
• Meta titles and descriptions set
• Canonical issues fixed
• Duplicate/redirect issues cleaned up
• www → non-www 301 redirect fixed
• Cloudflare DNS / SSL issue resolved
• Internal links cleaned up
• Waiting on Google recrawl + validation
What else should I add?
English











