Beyang

6.8K posts

Beyang banner
Beyang

Beyang

@beyang

Building @ampcode @sourcegraph

San Francisco, CA เข้าร่วม Eylül 2008
1.9K กำลังติดตาม11.8K ผู้ติดตาม
Aaron Francis
Aaron Francis@aarondfrancis·
@beyang I bet a caveman would loooove a dump truck can you imagine
English
1
0
1
385
Beyang
Beyang@beyang·
Something like this study has been published for every generation of LLMs (from autocomplete to RAG-chat to agents). A few things continue to hold true: - New tools require new training. You don't plop a caveman into a dump truck and expect good results. - That's not to absolve the tool builders from all responsibility—good tools guide the user to good outcomes. - That being said, there's going to be large differences between the top 0.1% most effective (not necessarily the highest token consumer) users and the median user. - New tech creates new problems. The question is whether it is better to embrace new tech and figure out how to mitigate the problems or to avoid altogether.
Charles 🎉 Frye @ GTC@charles_irl

We find that the adoption of Cursor leads to a statistically significant, large, but transient increase in project-level development velocity, along with a substantial and persistent increase in static analysis warnings and code complexity. arxiv.org/abs/2511.04427

English
4
2
20
4.1K
Beyang
Beyang@beyang·
Yes that is all true, but what I'm saying is that where we're going is gonna make how folks currently do things (agile, xp, whatever you wanna call it) look like waterfall in hindsight... Maybe someone will come up with a fancy zeitgeist-y name for the new thing, but perhaps it won't even have a name, since it feels quite possible that we will do the full circle as one person with one tool.
English
1
0
1
71
Péter Szilágyi
Péter Szilágyi@peter_szilagyi·
@beyang Ser, that was called “waterfall” model, it’s like 30+ year old and even it’s introductoy paper concluded that it’s not functional. For the past 20+ years everybody was doing itwrative agile, which is exactly what you just proposed as a novelty.
English
1
0
0
238
Beyang
Beyang@beyang·
Historically, the process of making software flowed from design to implementation (never mind the fact that some of the greatest software broke this rule). With agents, it feels more necessary than ever to have a system that permits quick two-way iteration.
Beyang tweet media
English
4
0
14
1.4K
Beyang
Beyang@beyang·
Mission statement: agents that maximize human agency
English
7
2
29
1.3K
Beyang
Beyang@beyang·
This behavior seems to map roughly to the intuition I have for the recent GPT vs Claude model families in coding. The latest GPT models seem quite good at reasoning and complex planning where latest Claude now feels (relatively to GPT) like it’s more narrowly pattern matching within a lower dimensional sub space of code (most notably frontend web dev for me). Within that subspace, Claude is better, but everything else, GPT (deep mode in Amp, recommend with ^3 reasoning).
Lisan al Gaib@scaling01

GPT-5.4 is special LisanBench: GPT-5.4 vs Opus 4.6 vs Gemini 3.1 Pro

English
5
1
42
8.4K
Beyang
Beyang@beyang·
@wtsnz Chief Garden Officer
English
0
0
2
203
Beyang
Beyang@beyang·
Thinking about this Feynman quote and the fractal beauty of the world and how it's turtles all the way down and turtles all the way up, and how we're all born turtle appreciators.
Beyang tweet media
English
4
0
16
828
Beyang
Beyang@beyang·
The delight of novel experiences will be shadowed by nostalgia for what we leave behind. It’s normal. Pour one out for the Old Ways, then move on. Figure out the new world. Do things. The New Way is learned through doing. There will be highs and there will be lows. Great movements are built on emotion, not always positive ones. There will be those who lean into the negative, and the world has no shortage of demagogues who will gladly trade you more cortisol for influence and power over you. You decide which path you want for yourself. A big part of that is choosing your community. For me, I want to be among those who are leaning into the delight and joy and wonder of it all. Someday, you'll tell your grandkids: I was there when the world changed. And I set my hand to the stone to build the one you’re living in now.
Mo@atmoio

I was a 10x engineer. Now I'm useless.

English
1
4
22
2.8K
Beyang
Beyang@beyang·
As Rob Pike originally noted, it's better to think of your codebase as something grown, not built. This will only become truer in the age of AI. Learn how to use the machines and then tend your garden.
English
0
8
78
3.4K
Beyang
Beyang@beyang·
Endorse. The white pill here is that in the before times, every human dev was neck deep in the code mines, so only a small amount of attention could focus on the architecture and systems design. Hence the state of legacy codebases. Now, clean code feels within reach!
Mario Zechner@badlogicgames

your biggest enemy is still complexity. it's also your agent's biggest enemy. but it has no holistic view of your code base, so it keeps adding complexity. and you think that's how it's supposed to be, because the clanker shat it out, and you don't know the stack. glhf!

English
3
3
47
7.6K
Awili
Awili@Awiliii·
@beyang it'd be great if you can share how you build and ship Amp! It's one thing I've been wishing you guys shared more on! Will definitely help us make a lot more sense on why you're building it the way it is and for us users to take inspiration on
English
1
0
0
316
Beyang
Beyang@beyang·
Fifteen bucks for a quality review is cheap if you value your time. That being said, Amp pushes to main.
English
5
3
67
6.9K
Beyang
Beyang@beyang·
@boyter Oh, I still verify all changes. I just don't have to push it up to an old fashioned PR UI to do so :)
English
2
0
2
254
boyter
boyter@boyter·
@beyang YOLO. I’m still a coward and branch first, but that’s just because I don’t trust the models enough yet.
English
1
0
1
326
Beyang
Beyang@beyang·
It's interesting how the evolutionary path of the GPT and Claude model families influenced their relative strengths in coding today. Claude was first to robust tool use, whereas GPT opted to focus on reasoning. So the failure mode of Claude is overeagerness to use tools without thinking things through, while the primary complaint with GPT is waiting for it to think. Initially, robust tool use mattered more, but now it seems OAI's bet on reasoning has paid off, at least in my experience. Remains to be seen how easily Claude can catch up here.
English
5
3
37
4K
Beyang
Beyang@beyang·
@udaysy Amp stores all threads and looking at the thread that yielded the diff gives a pretty strong indicator of the intent.
English
1
0
2
203
Uday Yatnalli
Uday Yatnalli@udaysy·
@beyang how do you review diffs you didnt write tho? reviewing your own ai output is one thing. reviewing someone elses ai output where you dont know the intent is way harder
English
1
0
0
201
Beyang
Beyang@beyang·
The key skill for building fast is becoming the ability to review quickly. To spot the key points in a thousand-line diff and deduce the targeted updates needed to make the patch mergeable.
English
8
4
63
5.3K
Beyang
Beyang@beyang·
@kr0der Experimentation is ongoing. The case for switching the oracle was much more obvious to us. But stay tuned!
English
0
0
1
82
Anthony
Anthony@kr0der·
@beyang are you guys seeing 5.3 codex outperform 5.4 at coding?
English
1
0
0
133