Alex Bit

1.9K posts

Alex Bit banner
Alex Bit

Alex Bit

@alex__bit

love people & solving hard technical problems. forward-looking. systems thinker. building @Codemod to make code maintenance invisible for all. ex @Meta.

Palo Alto, CA Katılım Haziran 2009
3.4K Takip Edilen1.2K Takipçiler
Alex Bit
Alex Bit@alex__bit·
Optimize user value per line of code. If reviewing AI-generated PRs is becoming a bottleneck, it might be time to ask whether those PRs should exist in the first place. When code generation is cheap, it’s tempting to turn every backlog item into a PR. The real challenge has always been WHAT to build and what NOT to build. What features should be ruthlessly cut? How can we simplify things while delivering more value to users? New features add complexity to both the product and the UX. Products should evolve with their users. More code also means more liability and maintenance. I’d love to see AI products and mindsets shift in this direction, helping builders better understand their users and customers, because that’s where the real bottleneck is.
English
1
0
7
246
Alex Bit
Alex Bit@alex__bit·
Great point on the market size of code maintenance. Specially at enterprise scale. I also want to add that even this new benchmark focuses on the technical aspects of code maintenance. At enterprise scale, the bigger challenge is organizational: how to rally many teams, build confidence in planning, estimate blast radius and risk, make reliable changes, orchestrate tasks across teams, and communicate progress effectively with xfn and leadership, and more. The challenge (and therefore the opportunity and market size) of code maintenance grows as the market for code generation grows. A moving target.
English
0
1
3
564
Aakash Gupta
Aakash Gupta@aakashgupta·
Alibaba burned 10 billion tokens testing 18 AI models across 100 real codebases over 233 days each. The headline going viral: 75% of models break previously working code. The actual story: someone finally built the scoreboard that matters. Every AI coding benchmark until now asked: “Can it fix this bug right now?” SWE-CI tracks 71 consecutive commits across 233 days and asks: “Does it still work after 8 months of real evolution?” Most models scored a zero-regression rate below 25%. Three out of four maintenance cycles, the agent fixes today’s ticket and breaks yesterday’s feature. That gap is the snapshot, not the verdict. Nadella says 30% of Microsoft’s repos are AI-generated. Pichai claims the same for Google. Zuckerberg wants AI writing half of Meta’s code within the year. The code is shipping. The question was never whether AI would write production software. The question was when someone would start measuring the right thing. Gartner forecasts global IT spending above $6 trillion in 2026. Maintenance eats 60-70% of that. Roughly $4 trillion a year spent keeping existing code alive. Every AI coding tool today is optimized for the $2 trillion creation side. The $4 trillion maintenance side just got its first real benchmark. The models will close this gap. That’s the entire point of measuring it. Once you score maintenance, every lab starts training for maintenance. The same pattern played out with SWE-bench: models went from 3% to 70%+ in under two years once there was a leaderboard to chase. SWE-CI is the starting gun, not the funeral. The company that cracks long-term code stability owns the largest budget line in every engineering org on the planet. And that gap is only getting wider until someone does.
Priyanka Vergadia@pvergadia

🤯BREAKING: Alibaba just proved that AI Coding isn't taking your job, it's just writing the legacy code that will keep you employed fixing it for the next decade. 🤣 Passing a coding test once is easy. Maintaining that code for 8 months without it exploding? Apparently, it’s nearly impossible for AI. Alibaba tested 18 AI agents on 100 real codebases over 233-day cycles. They didn't just look for "quick fixes"—they looked for long-term survival. The results were a bloodbath: 75% of models broke previously working code during maintenance. Only Claude Opus 4.5/4.6 maintained a >50% zero-regression rate. Every other model accumulated technical debt that compounded until the codebase collapsed. We’ve been using "snapshot" benchmarks like HumanEval that only ask "Does it work right now?" The new SWE-CI benchmark asks: "Does it still work after 8 months of evolution?" Most AI agents are "Quick-Fix Artists." They write brittle code that passes tests today but becomes a maintenance nightmare tomorrow. They aren't building software; they're building a house of cards. The narrative just got honest: Most models can write code. Almost none can maintain it.

English
23
28
229
50.2K
Alex Bit
Alex Bit@alex__bit·
And even the best benchmarks out there focus on the technical aspects of code maintenance. At enterprise scale, the bigger challenge is organizational: how to rally many teams, build confidence in planning, estimate blast radius and risk, make reliable changes, orchestrate tasks across teams, and communicate progress effectively with xfn and leadership, and more.
Aakash Gupta@aakashgupta

Alibaba burned 10 billion tokens testing 18 AI models across 100 real codebases over 233 days each. The headline going viral: 75% of models break previously working code. The actual story: someone finally built the scoreboard that matters. Every AI coding benchmark until now asked: “Can it fix this bug right now?” SWE-CI tracks 71 consecutive commits across 233 days and asks: “Does it still work after 8 months of real evolution?” Most models scored a zero-regression rate below 25%. Three out of four maintenance cycles, the agent fixes today’s ticket and breaks yesterday’s feature. That gap is the snapshot, not the verdict. Nadella says 30% of Microsoft’s repos are AI-generated. Pichai claims the same for Google. Zuckerberg wants AI writing half of Meta’s code within the year. The code is shipping. The question was never whether AI would write production software. The question was when someone would start measuring the right thing. Gartner forecasts global IT spending above $6 trillion in 2026. Maintenance eats 60-70% of that. Roughly $4 trillion a year spent keeping existing code alive. Every AI coding tool today is optimized for the $2 trillion creation side. The $4 trillion maintenance side just got its first real benchmark. The models will close this gap. That’s the entire point of measuring it. Once you score maintenance, every lab starts training for maintenance. The same pattern played out with SWE-bench: models went from 3% to 70%+ in under two years once there was a leaderboard to chase. SWE-CI is the starting gun, not the funeral. The company that cracks long-term code stability owns the largest budget line in every engineering org on the planet. And that gap is only getting wider until someone does.

English
0
0
2
649
Alex Bit retweetledi
Matteo Collina
Matteo Collina@matteocollina·
.@nodejs has always been about I/O. Streams, buffers, sockets, files. But there's a gap that has bugged me for years: you can't virtualize the filesystem. You can't import a module that only exists in memory. You can't bundle assets into a Single Executable without patching half the standard library. That changes now 👇
Matteo Collina tweet media
English
51
267
2.6K
354.5K
Alex Bit retweetledi
Morad Vaisi
Morad Vaisi@RezaVaisi·
@elonmusk Dear Elon, In these critical days, the people of Iran need internet access. Please help them stay connected as they struggle to reclaim their country from the rule of the mullahs. Stand with the Iranian people. Iran will not forget its friends.
McNair, VA 🇺🇸 English
20K
69.6K
80.8K
2M
Alex Bit retweetledi
Ujjwal Chadha
Ujjwal Chadha@ujjwalscript·
Unpopular Opinion: We aren't building the future 10x faster with AI. We are just generating legacy code 10x faster. Everyone is currently bragging about developer velocity. "I built this entire backend in a weekend!" "AI wrote 80% of my codebase!" But here is the reality check we are ignoring: Code is a liability, not an asset. If an AI tool spits out 1,000 lines of functional boilerplate in five seconds, that is still 1,000 lines that a human being has to read, review, secure, and maintain when the dependencies inevitably break next year. We are treating code generation like a pure productivity win, but we are optimizing for the wrong metric. The bottleneck in software engineering was never how fast we could type. The bottleneck has always been comprehension, architecture, and maintenance. If we don't shift our focus from "generation speed" to "architectural sanity," the tech debt of the next five years is going to be an absolute, unmaintainable nightmare.
English
400
538
4.2K
634.6K
Alex Bit retweetledi
Reza Pahlavi
Reza Pahlavi@PahlaviReza·
Whether it was Mojtaba Khamenei or his IRGC handlers, the first statement from the current regime’s new leader confirms what the Iranian people have known all along: the Islamic Republic is incapable of change. Terror and repression are in its DNA. This latest statement promises only to continue the same insatiable bloodlust and chaos that has held our nation and the world hostage for decades. By explicitly threatening our Arab neighbors with continued military strikes and doubling down on the regime’s proxy terror networks, the new Ayatollah sounds just like the old Ayatollah. We cannot appease or negotiate with a barbaric dictatorship that views the occupation of our nation and the destruction of its neighbors as a "divine" mandate. It’s long past time to turn the page. Don’t throw this regime a lifeline. The only resolution to five decades of aggression is to end this regime once and for all. Stand with the Iranian people in getting rid of this menace. Together, we can build a Middle East our children will be proud to inherit.
English
2.5K
10.6K
27.4K
492.5K
Alex Bit
Alex Bit@alex__bit·
thinking about building a new saas app and business? you should check out @makerkit_dev! @gc_psk deeply cares about devx. he uses @codemod to help devs install plugins (such as google analytics, posthog, umami, signoz, etc.) with ease. check out the 12-min technical walkthrough with screen sharing and code demos below 👇🏼
Alex Bit tweet media
English
2
3
9
677
Alex Bit retweetledi
Makerkit - React SaaS Starter Kits
Thanks for letting us tell our story, @alex__bit 🙏 It was great chatting with you The plugin DX problem is one I've wanted to solve for a long time Thanks to @codemod, we finally did it!
Alex Bit@alex__bit

thinking about building a new saas app and business? you should check out @makerkit_dev! @gc_psk deeply cares about devx. he uses @codemod to help devs install plugins (such as google analytics, posthog, umami, signoz, etc.) with ease. check out the 12-min technical walkthrough with screen sharing and code demos below 👇🏼

English
0
1
2
273
Alex Bit retweetledi
kooshiar
kooshiar@kooshiar·
Secretary @marcorubio Iranian immigrants have long contributed to the U.S. economy, especially in STEM, medicine and AI. Most strongly oppose the Islamic regime and support the goal of a strong and secure USA under President Trump and your leadership. Over the past few months, many have faced serious hardship due to the USCIS hold. They are here legally and often hold advanced degrees. Returning them to Iran would place them in danger and risk sending highly trained experts in strategic fields back to the Iranian regime. We would greatly appreciate it if you could revisit this decision. Thank you for your consideration.
English
1.3K
5.2K
10.2K
252K
Alex Bit retweetledi
Alex Bit
Alex Bit@alex__bit·
or just: npx codemod@latest css-design-token-mining
English
0
1
5
331
Luke
Luke@naamukim·
@alex__bit Thanks for sharing this! The API looks like it could provide a really nice developer experience. I just starred the repo and I’ll give it a try sometime.
English
1
0
1
12
Luke
Luke@naamukim·
jscodeshift codemods, but 8x faster. Same syntax, simpler API. Most codemod tools aren't really enterprise-ready yet this is my attempt to change that. Still early days, so any feedback or contributions would mean the world to me. github.com/NaamuKim/zmod
Luke tweet media
English
1
0
2
170