John Rush

@john_b_rush

Seattle Katılım Şubat 2014

259 Takip Edilen166 Takipçiler

John Rush@john_b_rush·23 Şub

1.6 billion tokens and 11,345 receipts to find out what I've spent on eggs since 2001. john-rush.com/posts/eggs-25-…

English

18.6K

John Rush@john_b_rush·12 Şub

@thsottiaux It’d be great if the app had an option to automatically open each new conversation in its own Git worktree. I was surprised a few parallel chats were all writing straight to main.

English

Tibo@thsottiaux·10 Şub

What could we do better on Codex? App, model, strategy and features… what’s wrong in how we approach things that we should improve immediately?

English

1.2K

947

101.2K

John Rush@john_b_rush·1 Şub

I've been thinking about what actually changes when AI makes local iteration cheap. Your implementation team can fix something in an afternoon now. But if it still has to become a ticket, you’re just running the old system faster. The move is letting them ship it - and making it easy for everyone else to reuse and improve what works. That’s when capability starts compounding. john-rush.com/posts/compound…

English

199

John Rush@john_b_rush·5 Oca

*Agent-Streams: A Skeptical Overseer for Long-Running Coding Agents* Overnight coding agents are fantastic — until “DONE” means a TODO, disabled tests, or a stubbed integration. What’s worked for me: treat DONE as a claim, not a fact, and add a skeptical overseer with fresh context. The loop: - Spec is the trust boundary - Builder implements and declares DONE - Overseer reviews spec + diff, runs checks, and can only output: ISSUES.md or APPROVED - Merge only on APPROVED (otherwise iterate) This pattern has dramatically reduced false-done merges for long-running agents. You can check out my implementation of the pattern here: john-rush.com/posts/agent-st…

English

131

John Rush@john_b_rush·25 Eyl

gpt-5-codex is pretty incredible.

English

167

John Rush@john_b_rush·3 Tem

@iocapon Some automation, but I’m still figuring out when that’s useful. Typically I’m using goose and Claude code and writing markdown files to pass between them. I stay in the terminal and with git I always have the history of the markdowns.

English

Tika@iocapon·3 Tem

i really love this approach and it's something that i've been wanting to do for a few months now. i'm curious whether this stuff is automated or if you copy+paste out of different providers (go to chatgpt, then claude code etc..) or if you have managed to automate this another way

English

John Rush@john_b_rush·1 Tem

Current workflow: multiple claude code tabs, each on its own git worktree. o3 clarifies & writes the plan, sonnet-4 implements, o3 + sonnet audit. If something’s off I adjust the plan/prompt - not the generated code. Disposable outputs, compounding inputs. Link in thread

English

841

John Rush@john_b_rush·3 Tem

@felciano Yes, each macro unit of change. Basically a shippable unit of work I’d open a PR for Our current code base is a large monorepo with 45M+ tokens so I’m not able to regenerate it in one go, yet.

English

Ramon Felciano@felciano·3 Tem

Thanks for the write up. It sort of sounds like you rebuild the whole thing every time to find and “fix” a mistake by updating the specs (“inputs”). But presumably you don’t do that for the entire app/codebase, right? Do you do this cycle for each macro unit-of-change (e.g. new feature or enhancement)?

English