
Francesco Kirchhoff
11.8K posts

Francesco Kirchhoff
@FrancescoK
Comedian & Comedy club owner. Prog metal bass/guitar. Formerly Program Lead @ https://t.co/E8L5PdN576, ex @MSF, @eHealth_Africa e.a.









We ran GPT-5.4 (xhigh) an additional ten times on Tier 4 to get a pass@10 score. This was 38%. In one of these runs, it solved another problem no model had solved before. This problem was by @nasqret.




Claude Code on desktop can now preview your running apps, review your code, and handle CI failures and PRs in the background. Here’s what's new:


There used to be two playbooks for commercial software: a) be first to market b) make the best product Being first was rarely important, yet so many software companies operate this way. “We must ship by this time next month or we’ll lose.” A shallow way to build, in my opinion. Now with AI tools having gone from “lol nice JavaScript try again” in Jan 2025 to “damn, nice C program, take the wheel” in Jan 2026, there’s only one playbook that remains: make the best product. Now anyone can “compete” with you if being first is your differentiator. So don’t make a hundred products or a hundred features quickly just because you can. Instead leverage this “huge cheap skilled workforce” you now have to build something really good, even if it takes time. You can’t blame timelines for janky scrolling or broken text editing anymore. Build something that’s meaningfully different, something that you can be proud of a decade from now



WAIT ARE WE GETTING NATIVE CODEX? wait for me at least


I've never felt this much behind as a programmer. The profession is being dramatically refactored as the bits contributed by the programmer are increasingly sparse and between. I have a sense that I could be 10X more powerful if I just properly string together what has become available over the last ~year and a failure to claim the boost feels decidedly like skill issue. There's a new programmable layer of abstraction to master (in addition to the usual layers below) involving agents, subagents, their prompts, contexts, memory, modes, permissions, tools, plugins, skills, hooks, MCP, LSP, slash commands, workflows, IDE integrations, and a need to build an all-encompassing mental model for strengths and pitfalls of fundamentally stochastic, fallible, unintelligible and changing entities suddenly intermingled with what used to be good old fashioned engineering. Clearly some powerful alien tool was handed around except it comes with no manual and everyone has to figure out how to hold it and operate it, while the resulting magnitude 9 earthquake is rocking the profession. Roll up your sleeves to not fall behind.




Cursor's new in-editor browser thingy is kinda this but kinda better Now you just need a batteries included web framework that's cohesive across the entire stack (from data persistence to layout components to UI controls) to really fly


Even if current LLM progress hits a brick wall at Opus 4.5 level (and I doubt that will happen) the next 12 months are still going to be a staggering time of change in this industry as decision makers start truly understanding the new reality we live in. obie.medium.com/what-happens-w…
















