Sabitlenmiş Tweet
mager
31.9K posts

mager
@mager
Agentic Engineer @Uber 📈 Building @beatbrainxyz @prxpsxyz @loooomxyz ❤️ KLM
Chicago Katılım Ocak 2007
4K Takip Edilen11.1K Takipçiler

@dontbesilent @garrytan I need a skill that gives me da direct window to Garry’s brain
English

1、这是我迄今为止用过的最好的 skill
2、无论我们多么大力宣传这个 skill,大部分都只会围观,最多安装一下,不会真的去在自己的上下文里面去推进
3、所以,这让这个 skill 的使用体验更稀缺了
Garry Tan@garrytan
I just launched /office-hours skill with gstack. Working on a new idea? GStack will help you think about it the way we do at YC. (It's only a 10% strength version of what a real YC partner can do for you, but I assure you that is quite powerful as it is.)
中文
mager retweetledi

Are you building a software factory yet? Start small, and keep iterating mager.co/blog/2026-03-1…
English

@hudaman I created ME.md: loooom.xyz/me
I can just paste this in to any clean Claude session to get instant context: loooom.xyz/me/mager/raw
English

I let AI analyze every first round March Madness matchup 🏀 mager.co/blog/2026-03-1…
English
mager retweetledi

Using Skills well is a skill issue.
I didn't quite realize how much until I wrote this, the best can completely transform how your team works.
Thariq@trq212
English
mager retweetledi

Agentic software engineering adoption is on fire at @Uber. 1,800 code changes per week are now written entirely by Uber's internal background coding agent, and 95% of our engineers now use AI every month across all the tools we track.
This is a real reset moment for engineering; it's one of the most exciting times to lead. This shift requires builders to be curious and hands-on. I’m incredibly lucky to be surrounded by a team that’s doing exactly that.
The best part is that the strongest adoption isn’t being pushed top down from leadership announcements; it’s coming from engineers who are quietly experimenting, quietly shipping, and quietly pushing things forward.
I love spending time with those engineers because there’s no substitute for being close to the work.
Over the last few months, we leaned in hard, and the results have been phenomenal.
The bigger shift: going agentic.
84% of AI users are now working with agent-style workflows, not just tab completion. Claude Code usage nearly doubled in 2 months (32% → 63%), while IDE-based tools have largely plateaued.
Engineers are moving from accepting suggestions to delegating tasks. Even within traditional IDEs, ~70% of committed code is now AI-generated.
Background agents are writing code autonomously.
Our internal background coding agent went from <1% of all code changes to 8% in just a few months. There is zero human authoring. Engineers review and approve, but the code is written entirely by AI agents.
The role of the engineer is shifting - from writing every line to architecting systems and reviewing AI-generated code.
More to come from the @UberEng team in the coming days.
English

Build your own Claude terminal interface in 10 min mager.co/blog/2026-03-1…
English

@RealProductGirl @Lovable Fast design iteration. And you can continue chatting to improve the design. It outputs a working app/prototype.
English

Day 0 of learning to vibe code as a PM with ZERO software background.
Lines of code written in my life: 0
First tool: @Lovable
Documenting every win, fail, and everything in between.
Follow along to see my daily struggles 🚀
Wish me luck Builders! 👊

English
mager retweetledi

nanochat now trains GPT-2 capability model in just 2 hours on a single 8XH100 node (down from ~3 hours 1 month ago). Getting a lot closer to ~interactive! A bunch of tuning and features (fp8) went in but the biggest difference was a switch of the dataset from FineWeb-edu to NVIDIA ClimbMix (nice work NVIDIA!). I had tried Olmo, FineWeb, DCLM which all led to regressions, ClimbMix worked really well out of the box (to the point that I am slightly suspicious about about goodharting, though reading the paper it seems ~ok).
In other news, after trying a few approaches for how to set things up, I now have AI Agents iterating on nanochat automatically, so I'll just leave this running for a while, go relax a bit and enjoy the feeling of post-agi :). Visualized here as an example: 110 changes made over the last ~12 hours, bringing the validation loss so far from 0.862415 down to 0.858039 for a d12 model, at no cost to wall clock time. The agent works on a feature branch, tries out ideas, merges them when they work and iterates. Amusingly, over the last ~2 weeks I almost feel like I've iterated more on the "meta-setup" where I optimize and tune the agent flows even more than the nanochat repo directly.

English






