Norin

1

141

Norin@norlava·6h

I think part of the challenge is just also the people, sentiment side of things. There are a lot of teams still where they're under constant pressure of layoffs or half their team was laid off, being expected to double their output, and leadership pushing them to adopt "AI-first mindset" or show ROI. And then there's the narrative that AI native companies push and how we have people sharing online how they've "cracked" some magical code which can make you feel like even more of an outsider who's behind. So I guess if that's your day to day experience, the motivation isn't there to get past the cold start. There are solutions here but it's tough, we need more empathy and realistic examples of what peers are doing as you say not being told to increase our productivity.

English

19

Kevin S Lin@kevins8·10h

there’s currently a cold start problem for folks getting started with agents, especially in organizations everyone needs to discover best practices and the “right way” to do things. build skills. context. loops we need better ways to discover what peers are doing and what the top 10% do better than everyone else like @steipete suggestion of everyone sharing sanitized codex sessions

English

11

0

14

1.1K

Norin@norlava·12h

@kunchenguid This is a great example of why for long running tasks you need and should have the developer in the loop

English

534

Kun Chen@kunchenguid·15h

ok be careful with your fable 5. i just ran into a new problem that never happened before - it's now doing things i didn't ask i just told it there is a bug in a repo. without checking with me, it did the fix AND raised a PR using my gh cli, claiming it's following CONTRIBUTING.md the PR was not bad, but it's a big surprise as it - assumed the credentials in my gh cli is the one i wanted to use - assumed i would be happy with the change as is - assumed i was ready to publish the work that's a lot of assumptions from just me telling it to fix a bug. i now feel the need to explicitly tell it NOT to do extra things which increased cognitive load for me and it's not a good feeling

Kun Chen@kunchenguid

ok Claude Fable 5 (Mythos) is finally here if you are on a subscription, go use it NOW because it may be removed from the subscription in a few days

English

77

14

482

145.1K

Norin@norlava·15h

Learnings + code from our side quest building reliable long running agents. With improving model capabilities + need for better verification this topic feels like a good one to discuss and improve upon together.

x.com/i/article/2064…

English

57

Norin@norlava·15h

x.com/i/article/2064…

ZXX

278

Norin@norlava·17h

The asterisk on anthropic's benchmark table shares how starred scores are Mythos 5 but the one your agent actually calls (Fable 5) falls back towards Opus 4.8 on those benchmarks because safety classifiers block answers (e.g. Terminal-Bench 2.1). Interesting... make of that what you will, real world testing is still important.

English

65

Norin@norlava·1d

💯💯💯 agree. that's the way we're using them - review gates, HIL, observe and steer mid-run. it's open source with the actual code that gives you this. sharing in cases it's helpful for others designing, building, or wanting to fork their own (Atomic bastani-inc on github) x.com/norlava/status…

Lot of buzz around loops/workflows. Many saying they’ve been doing this but not sharing their code. We’re sharing our code. This was not trivial to construct, it’s taken hundreds of hours of studying coding agent implementations, feedback from developers using it in repos that are 10M+ LoC, and re-architecting 3x from scratch. This is a production loop as in you can and are supposed to use it to gain actual ROI with your agent. We built in public you can find it on Github under Atomic (bastani-inc). Real production code needs management across dependencies, teams, and not infinite tokens to burn. We realized you need a way for the developer to define their ‘loop’ explicitly with good design, no provider lock in, review gates, verbatim compaction (not what you see today in coding agents), HIL, and the ability to observe and steer agents mid run. Why share it? Because we think everyone should benefit from knowledge on how to use these because we can get better with each other faster. Less hype, just the code. Overall, write up on our learnings coming soon.

English

80

Daniel San@dani_avila7·1d

I partly agree… loops are the goal, but you can’t skip the prompting fundamentals It took me years to nail the right execution flow for routines that now run automatically. Not because the models weren’t capable, but because the surrounding software wasn’t ready for LLMs Today it is, and if you want reliable loops, you need a solid harness and proper observability first Those aren’t nice-to-haves, they’re the baseline

Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore. You should be designing loops that prompt your agents.

English

6

4

45

3.7K

Norin@norlava·1d

@asaio87 we shared our implementation to make it easier to design/build or have an example of real code for what this could look like x.com/norlava/status…

Lot of buzz around loops/workflows. Many saying they’ve been doing this but not sharing their code. We’re sharing our code. This was not trivial to construct, it’s taken hundreds of hours of studying coding agent implementations, feedback from developers using it in repos that are 10M+ LoC, and re-architecting 3x from scratch. This is a production loop as in you can and are supposed to use it to gain actual ROI with your agent. We built in public you can find it on Github under Atomic (bastani-inc). Real production code needs management across dependencies, teams, and not infinite tokens to burn. We realized you need a way for the developer to define their ‘loop’ explicitly with good design, no provider lock in, review gates, verbatim compaction (not what you see today in coding agents), HIL, and the ability to observe and steer agents mid run. Why share it? Because we think everyone should benefit from knowledge on how to use these because we can get better with each other faster. Less hype, just the code. Overall, write up on our learnings coming soon.

English

1

33

andrei saioc@asaio87·2d

What are these loops ? I have never used them. Are these for or while loops ?

Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore. You should be designing loops that prompt your agents.

English

22

0

18

2.2K

Norin@norlava·1d

This is everything🙌, there's push to offload entirely to the model/agent, cut the developer out, doesn't work. We've been building loops/workflows but with the opposite framing - engineer as the one steering with determinism. Sharing the code in case folks are looking for an actual example of what this could look like for production with HIL/manual review/control/visibility. x.com/norlava/status…

Lot of buzz around loops/workflows. Many saying they’ve been doing this but not sharing their code. We’re sharing our code. This was not trivial to construct, it’s taken hundreds of hours of studying coding agent implementations, feedback from developers using it in repos that are 10M+ LoC, and re-architecting 3x from scratch. This is a production loop as in you can and are supposed to use it to gain actual ROI with your agent. We built in public you can find it on Github under Atomic (bastani-inc). Real production code needs management across dependencies, teams, and not infinite tokens to burn. We realized you need a way for the developer to define their ‘loop’ explicitly with good design, no provider lock in, review gates, verbatim compaction (not what you see today in coding agents), HIL, and the ability to observe and steer agents mid run. Why share it? Because we think everyone should benefit from knowledge on how to use these because we can get better with each other faster. Less hype, just the code. Overall, write up on our learnings coming soon.

English

1

0

2

157

dex@dexhorthy·1d

Recurring reminder that you should DEFINITELY replace all your coding processes with loops

dex@dexhorthy

Here’s what’s gonna happen: - you replace your code review with feedback loops (sentry, datadog, support tickets, etc) - you stop reading the code - software factory fixes everything - one day something breaks at 3am, agent can’t fix it - nobody’s read the code in 3 months - you have 3 weeks of downtime trying to re-onboard and fix it - you lose significant % of your contracts and users - your company is now dead

English

15

7

144

23.9K

Norin@norlava·1d

@diegocabezas01 For sure, we shared our code as well in case you want to try building/designing your own. x.com/norlava/status…

Lot of buzz around loops/workflows. Many saying they’ve been doing this but not sharing their code. We’re sharing our code. This was not trivial to construct, it’s taken hundreds of hours of studying coding agent implementations, feedback from developers using it in repos that are 10M+ LoC, and re-architecting 3x from scratch. This is a production loop as in you can and are supposed to use it to gain actual ROI with your agent. We built in public you can find it on Github under Atomic (bastani-inc). Real production code needs management across dependencies, teams, and not infinite tokens to burn. We realized you need a way for the developer to define their ‘loop’ explicitly with good design, no provider lock in, review gates, verbatim compaction (not what you see today in coding agents), HIL, and the ability to observe and steer agents mid run. Why share it? Because we think everyone should benefit from knowledge on how to use these because we can get better with each other faster. Less hype, just the code. Overall, write up on our learnings coming soon.

English

9

Diego | AI 🚀 - e/acc@diegocabezas01·1d

@norlava Nice! Thanks for sharing

English

1

0

1

45

Diego | AI 🚀 - e/acc@diegocabezas01·2d

Can someone explain me the loops? Because I don’t think they are python loops

Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore. You should be designing loops that prompt your agents.

English

87

3

155

84.8K

Norin@norlava·1d

@mvanhorn This is a fantastic write-up! I agree with the production deployments + cost. We open sourced our implementation of loops/workflows because lots of hype but not enough sharing the code to get started. Think it aligns well with what you're saying. x.com/norlava/status…

Lot of buzz around loops/workflows. Many saying they’ve been doing this but not sharing their code. We’re sharing our code. This was not trivial to construct, it’s taken hundreds of hours of studying coding agent implementations, feedback from developers using it in repos that are 10M+ LoC, and re-architecting 3x from scratch. This is a production loop as in you can and are supposed to use it to gain actual ROI with your agent. We built in public you can find it on Github under Atomic (bastani-inc). Real production code needs management across dependencies, teams, and not infinite tokens to burn. We realized you need a way for the developer to define their ‘loop’ explicitly with good design, no provider lock in, review gates, verbatim compaction (not what you see today in coding agents), HIL, and the ability to observe and steer agents mid run. Why share it? Because we think everyone should benefit from knowledge on how to use these because we can get better with each other faster. Less hype, just the code. Overall, write up on our learnings coming soon.

English

568

Matt Van Horn@mvanhorn·2d

x.com/i/article/2063…

ZXX

200

445

4.8K

3.1M

Norin@norlava·1d

Lot of buzz around loops/workflows. Many saying they’ve been doing this but not sharing their code. We’re sharing our code. This was not trivial to construct, it’s taken hundreds of hours of studying coding agent implementations, feedback from developers using it in repos that are 10M+ LoC, and re-architecting 3x from scratch. This is a production loop as in you can and are supposed to use it to gain actual ROI with your agent. We built in public you can find it on Github under Atomic (bastani-inc). Real production code needs management across dependencies, teams, and not infinite tokens to burn. We realized you need a way for the developer to define their ‘loop’ explicitly with good design, no provider lock in, review gates, verbatim compaction (not what you see today in coding agents), HIL, and the ability to observe and steer agents mid run. Why share it? Because we think everyone should benefit from knowledge on how to use these because we can get better with each other faster. Less hype, just the code. Overall, write up on our learnings coming soon.

English

742

Norin@norlava·2d

I'm also thinking that it's likely he's heavily optimized the OpenClaw ecosystem and would require serious reworking to generally work on all codebase shapes

English

54

Norin@norlava·2d

Okay so peter’s right about the method, it does work altho he’s a bit vague. Execution matters though, we've seen that if you want this to scale you need a legit ‘loop’ engine that is designed with developer in the loop (pun not intended) to avoid slop and costs. Idk if the labs think about that though or just offload it to the model, that doesn’t work (yet) and is too expensive. There’s a DX where human’s don’t ‘slow down’ AI in development but I guess it’s not hype so it doesn’t sell as nicely. We’ve been building like this for a couple of months.

Here’s your monthly reminder that you shouldn’t be prompting coding agents anymore. You should be designing loops that prompt your agents.

English

1

0

192

Norin@norlava·2d

@thdxr I feel this way about like most CLI tools

English