
If you've been wondering what I've been up to, the team and I have been cooking up something new. A new agent-native development environment deeply integrated with the GitHub graph. Not just for writing code, but all of the meta-work as well.
Ron Clabo
1.6K posts

@dotnetcore
Tweets about software development: .NET, AI, ASPNET Core, C#, Agentic Coding, & LuceneNET. Top 1% on StackOverflow. I Love learning & helping others.

If you've been wondering what I've been up to, the team and I have been cooking up something new. A new agent-native development environment deeply integrated with the GitHub graph. Not just for writing code, but all of the meta-work as well.


GPT-5.5 is ~39% cheaper than Opus 4.7, across merged PRs bucketed by diff size in Inspect despite the higher output token cost, 5.5 is cheaper for input tokens (cache writes are free), more token efficient, and tokenizes the same text to fewer tokens

We just launched a free, open-source course on GitHub Copilot CLI. Learn how to ship faster with AI powered command line workflows—context, custom agents, skills, MCP servers, and more. Hands on. Practical. Built for devs. Check it out: github.com/github/copilot…


gpt-5.4 has ramped faster than any other model we've launched in the API: within a week of launch, 5T tokens per day, handling more volume than our entire API one year ago, and reaching an annualized run rate of $1B in net-new revenue. it's a good model, try it out!




This is wild 🤯 My 5-year-old just built his FIRST GAME ever with Codex using only dictation mode. He wanted a game where you type in a word and transform into any character you want to be ✨ so he dictated it to Codex. Then he upgraded it to a game where that character jumps and collects coins. Hello World to the youngest AI engineer ever❤️🔥 @OpenAI @OpenAIDevs Codex needs an interactive voice mode ASAP




Shifting structures in a software world dominated by AI. Some first-order reflections (TL;DR at the end): Reducing software supply chains, the return of software monoliths – When rewriting code and understanding large foreign codebases becomes cheap, the incentive to rely on deep dependency trees collapses. Writing from scratch ¹ or extracting the relevant parts from another library is far easier when you can simply ask a code agent to handle it, rather than spending countless nights diving into an unfamiliar codebase. The reasons to reduce dependencies are compelling: a smaller attack surface for supply chain threats, smaller packaged software, improved performance, and faster boot times. By leveraging the tireless stamina of LLMs, the dream of coding an entire app from bare-metal considerations all the way up is becoming realistic. End of the Lindy effect – The Lindy effect holds that things which have been around for a long time are there for good reason and will likely continue to persist. It's related to Chesterton's fence: before removing something, you should first understand why it exists, which means removal always carries a cost. But in a world where software can be developed from first principles and understood by a tireless agent, this logic weakens. Older codebases can be explored at will; long-standing software can be replaced with far less friction. A codebase can be fully rewritten in a new language. ² Legacy software can be carefully studied and updated in situations where humans would have given up long ago. The catch: unknown unknowns remain unknown. The true extent of AI's impact will hinge on whether complete coverage of testing, edge cases, and formal verification is achievable. In an AI-dominated world, formal verification isn't optional—it's essential. The case for strongly typed languages – Historically, programming language adoption has been driven largely by human psychology and social dynamics. A language's success depended on a mix of factors: individual considerations like being easy to learn and simple to write correctly; community effects like how active and welcoming a community was, which in turn shaped how fast its ecosystem would grow; and fundamental properties like provable correctness, formal verification, and striking the right balance between dynamic and static checks—between the freedom to write anything and the discipline of guarding against edge cases and attacks. As the human factor diminishes, these dynamics will shift. Less dependence on human psychology will favor strongly typed, formally verifiable and/or high performance languages.³ These are often harder for humans to learn, but they're far better suited to LLMs, which thrive on formal verification and reinforcement learning environments. Expect this to reshape which languages dominate. Economic restructuring of open source – For decades, open-source communities have been built around humans finding connection through writing, learning, and using code together. In a world where most code is written—and perhaps more importantly, read—by machines, these incentives will start to break down.⁴ Communities of AIs building libraries and codebases together will likely emerge as a replacement, but such communities will lack the fundamentally human motivations that have driven open source until now. If the future of open-source development becomes largely devoid of humans, alignment of AI models won't just matter—it will be decisive. The future of new languages – Will AI agents face the same tradeoffs we do when developing or adopting new programming languages? Expressiveness vs. simplicity, safety vs. control, performance vs. abstraction, compile time vs. runtime, explicitness vs. conciseness. It's unclear that they will. In the long term, the reasons to create a new programming language will likely diverge significantly from the human-driven motivations of the past. There may well be an optimal programming language for LLMs—and there's no reason to assume it will resemble the ones humans have converged on. TL; DR: - Monoliths return – cheap rewriting kills dependency trees; smaller attack surface, better performance, bare-metal becomes realistic - Lindy effect weakens – legacy code loses its moat, but unknown unknowns persist; formal verification becomes essential - Strongly typed languages rise – human psychology mattered for adoption; now formal verification and RL environments favor types over ergonomics - Open source restructures – human connection drove the community; AI-written/read code breaks those incentives; alignment becomes decisive - New languages diverge – AI may not share our tradeoffs; optimal LLM programming languages may look nothing like what humans converged on ¹ x.com/mntruell/statu… ² x.com/anthropicai/st… ³ wesmckinney.com/blog/agent-erg… ⁴ #issuecomment-3717222957" target="_blank" rel="nofollow noopener">github.com/tailwindlabs/t…

I wanted to understand how GPT works, so I ported Karpathy's microgpt.py to C# from scratch. No frameworks and NuGet packages, just plain math in ~600 lines of code. It builds a tiny GPT that learns from 32K human names and invents new ones. Every piece is there: autograd, attention, Adam optimizer, the works. Just at a scale you can actually sit down and read. I also wrote a prerequisites guide that walks through all the math and ML you need, starting at a high school level. If you've ever wanted to peek under the hood of ChatGPT without drowning in linear algebra textbooks, this might help. github.com/milanm/AutoGra…

My biggest takeaways from @sherwinwu: 1. AI is writing virtually all code at OpenAI. 95% of the engineers use Codex, and engineers who embrace these tools open 70% more pull requests than their peers, and that gap is widening over time. 2. The role of a software engineer is shifting from writing code to managing fleets of AI agents. Many engineers now run 10 to 20 parallel Codex threads, steering and reviewing rather than writing code themselves. 3. The average PR code review time has dropped from 10-15 minutes per PR to 2-3 minutes. Every pull request at OpenAI is now reviewed by Codex before human eyes see it, and Codex surfaces suggestions and catches issues up front. This allows engineers to focus on more creative and strategic work while dramatically increasing productivity. 4. The models will eat your scaffolding for breakfast. When building AI products, don’t optimize for today’s model capabilities. The field is evolving so rapidly that the scaffolding (vector stores, agent frameworks, etc.) that seems essential today may be obsolete tomorrow as models improve. 5. Build for where the models are going, not where they are today. The most successful AI startups build products that work at 80% capability now, knowing the next model release will push them over the line. 6. Top performers become disproportionately more productive with AI tools. AI tools amplify the productivity of high-agency individuals, so the gap between top performers and everyone else is widening. The ROI on unblocking and empowering your best people compounds faster than ever in an AI-augmented environment. 7. Most enterprise AI deployments have negative ROI because they’re top-down mandates without bottom-up adoption. Success requires both executive buy-in and grassroots enthusiasm. Sherwin recommends creating a “tiger team” of technically-minded enthusiasts (often not engineers) who can explore capabilities, apply AI to specific workflows, and create excitement throughout the organization. 8. The one-person billion-dollar startup is coming, but with unexpected second-order effects. As AI makes individuals more productive, we’ll see not just billion-dollar solo founders but an explosion of small businesses: hundreds of $100M startups and tens of thousands of $10M startups. This will transform the startup ecosystem and venture capital landscape. 9. Business process automation is an underrated AI opportunity. While Silicon Valley focuses on knowledge work, most of the economy runs on repeatable business processes with standard operating procedures. There’s massive potential to apply AI to these workflows, which are often overlooked by the tech community. 10. The next two to three years will be the most exciting in tech history. After a relatively quiet period from 2015 to 2020, we’re now in an unprecedented era of innovation. Sherwin encourages everyone to engage with AI tools and not take this moment for granted, as the pace of change will eventually slow. 11. AI models will soon handle multi-hour tasks coherently. Today’s models are optimized for tasks that take minutes, but within 12 to 18 months we’ll see models that can work on complex tasks for upward of six hours. This will enable entirely new categories of products and workflows. 12. Audio is the next frontier for multimodal AI. While coding and text get most of the attention, audio is hugely underrated in business settings. Improvements in speech-to-speech models over the next 6 to 12 months will unlock significant new capabilities for business communication and operations.

I resigned from xAI today. This company - and the family we became - will stay with me forever. I will deeply miss the people, the warrooms, and all those battles we have fought together. It's time for my next chapter. It is an era with full possibilities: a small team armed with AIs can move mountains and redefine what's possible. Thank you to the entire xAI family. Onward. 🚀 And to Elon @elonmusk - thank you for believing in the mission and for the ride of a lifetime.
