
lobo
22 posts



My new skill lineup:
/domain-model - replaces /grill-me, integrates some DDD concepts and adds docs & ADR's during discussions
/to-prd - create a PRD
/to-issues - create issues with blocking
/github-triage - triage issues with a state machine-based labelling system
/tdd - do TDD where appropriate
Still more to flesh out, but this is feeling AWESOME
English
lobo retweetledi

We found that agents generate progressively worse code with each iteration. Real developers do not.
SlopCodeBench is the only eval that faithfully measures quality degradation on iterative, long-horizon coding tasks.
arxiv.org/abs/2603.24755
scbench.ai
🧵

English
lobo retweetledi
lobo retweetledi
lobo retweetledi




