akira
4.8K posts

akira
@realmcore_
Making an autonomous swe • @0xrandomlabs Incepto ne desistam Pax aeternum Memento Mori

Interesting article on treating agent output like compiler output (and why) skiplabs.io/blog/codegen_a…


@JoshPurtell especially for multi-agent yes!! here raw intelligence !== output quality at all

you can refactor anything in two weeks



5.3 codex xhigh is significantly better than 5.4 xhigh for coding. It's genuinely nuts




gpt 5.4 How do you guys get this model to not do random algorithmic garbage and just write straightforward procedural code I do not see why it should be this hard for a model to write code like a first year college student probably skill issue tbh


none of this is true btw. claude code still sucks and crashes often, google is notorious for dropping projects, openai has a trail of abandoned ideas and then you've got vendor lock-in to worry about on top of that there's plenty of room, same as always

the window for experimenting with llms has basically closed now. the megacorps have fully hit escape velocity and are shipping new products and new features daily. the shift is that they’re not just shipping llms anymore, they’re using llms to build products and improve existing ones at scale. the wild west era of llms isn’t really the wild west anymore. a year ago, this could’ve been an indie dev side project, maybe even a monetizable product. it was literally so easy that the only real bottleneck was your free time. now, whatever idea you have, you should basically assume google/anthropic/oai will build some version of it within a week and wipe out most of the startup surface area around it





