Ethereal

12.2K posts

Ethereal banner
Ethereal

Ethereal

@inferencegod

rain man. optimizing agentic looping. top 352 on @aster_DEX connoisseur. trading autist.

เข้าร่วม Aralık 2021
606 กำลังติดตาม1.8K ผู้ติดตาม
ทวีตที่ปักหมุด
Ethereal
Ethereal@inferencegod·
autonomy-loop 0.8.0: a coding loop that refuses to trust itself. every agent says “the tests pass.” this one proves it by breaking the test. and when it can’t prove its own setup is safe, it refuses to run. how it works: → builder + reviewer pass a git baton. no chat, just a committed LOOP-STATE.md → the bite: reviewer reverts only the fix and demands the test go red. stayed green? the test was lying → brand-new code? it plants a mutant and makes the test kill it → it grades the test, not just the code. assertion-free “green” gets caught → touches money or anything irreversible → it parks for a human → it can’t loop forever: repeats escalate, re-scope → stronger model → park new in 0.8 + planner layer: → four rings of defense, and it earns a trust tier T0 → T3 instead of assuming it → below “hardened” it refuses unattended auto-promotion. provision the rails or it stops → the reviewer must be a separate, live principal. no forged approvals → researchers read the web; injected commands are stripped, only sourced facts pass as data → short on terminals? single-cli mode runs the roles in one, labeled REDUCED-INDEPENDENCE, never hidden → 3-terminal mode (recommended): dedicated Planner researches then grills specs with falsifiable acceptance criteria and goal-ready prompts before the builder touches anything → 4-terminal power mode adds a Researcher feeding an idea pool install (it’s a claude code plugin): → claude plugin marketplace add github.com/inferencegod/a… → claude plugin install autonomy-loop → /autonomy-init, then run /autonomy-loop:builder and /autonomy-loop:reviewer (add planner for full setup) in terminals on /loop 600 two (or three) terminals. a builder, and a reviewer that trusts nothing. one repo. nobody driving.
Ethereal@inferencegod

i don't feed my agent tasks anymore. when the backlog runs dry, it researches and invents the next feature itself, then builds it. and it polices its own work before i ever see it. autonomy-loop v0.5.1: → self-feeding: empty backlog? it proposes the next feature and keeps going, no prompt from me → the bite: it reverts its own fix and reruns the test. stays green? it caught nothing, rejected → self-mutation: it mutates its own changed lines so weak tests get caught before handoff → circuit breaker: it parks to me instead of looping forever → branch protection: it can never touch prod or edit away its own gates → upgrading is one command: /autonomy-upgrade → red-teamed, 77 tests green two terminals. a builder, and a reviewer that trusts nothing. one repo. nobody driving. free, mit, 151 people already running it. /plugin marketplace add github.com/inferencegod/a… /plugin install autonomy-loop@autonomy-loop

English
0
0
3
500
Ethereal
Ethereal@inferencegod·
Ethereal@inferencegod

autonomy-loop 0.8.0: a coding loop that refuses to trust itself. every agent says “the tests pass.” this one proves it by breaking the test. and when it can’t prove its own setup is safe, it refuses to run. how it works: → builder + reviewer pass a git baton. no chat, just a committed LOOP-STATE.md → the bite: reviewer reverts only the fix and demands the test go red. stayed green? the test was lying → brand-new code? it plants a mutant and makes the test kill it → it grades the test, not just the code. assertion-free “green” gets caught → touches money or anything irreversible → it parks for a human → it can’t loop forever: repeats escalate, re-scope → stronger model → park new in 0.8 + planner layer: → four rings of defense, and it earns a trust tier T0 → T3 instead of assuming it → below “hardened” it refuses unattended auto-promotion. provision the rails or it stops → the reviewer must be a separate, live principal. no forged approvals → researchers read the web; injected commands are stripped, only sourced facts pass as data → short on terminals? single-cli mode runs the roles in one, labeled REDUCED-INDEPENDENCE, never hidden → 3-terminal mode (recommended): dedicated Planner researches then grills specs with falsifiable acceptance criteria and goal-ready prompts before the builder touches anything → 4-terminal power mode adds a Researcher feeding an idea pool install (it’s a claude code plugin): → claude plugin marketplace add github.com/inferencegod/a… → claude plugin install autonomy-loop → /autonomy-init, then run /autonomy-loop:builder and /autonomy-loop:reviewer (add planner for full setup) in terminals on /loop 600 two (or three) terminals. a builder, and a reviewer that trusts nothing. one repo. nobody driving.

English
0
0
0
25
GW IS NOT REAL
GW IS NOT REAL@5fakb48·
This is my first month paying and the resets are not the same as Codex lol, as people are saying the trend is clearly you guys resetting the weekly super close to the users actual weekly renew ( my case too lol ). For the love of all that's good in this world, either add the storage of a reset like Codex, or use the reset mid weed... Sigh
English
1
0
7
1.3K
Ethereal
Ethereal@inferencegod·
@ClaudeDevs best way to burn them tokens til sunday 🫪🫪x.com/inferencegod/s…
Ethereal@inferencegod

autonomy-loop 0.8.0: a coding loop that refuses to trust itself. every agent says “the tests pass.” this one proves it by breaking the test. and when it can’t prove its own setup is safe, it refuses to run. how it works: → builder + reviewer pass a git baton. no chat, just a committed LOOP-STATE.md → the bite: reviewer reverts only the fix and demands the test go red. stayed green? the test was lying → brand-new code? it plants a mutant and makes the test kill it → it grades the test, not just the code. assertion-free “green” gets caught → touches money or anything irreversible → it parks for a human → it can’t loop forever: repeats escalate, re-scope → stronger model → park new in 0.8 + planner layer: → four rings of defense, and it earns a trust tier T0 → T3 instead of assuming it → below “hardened” it refuses unattended auto-promotion. provision the rails or it stops → the reviewer must be a separate, live principal. no forged approvals → researchers read the web; injected commands are stripped, only sourced facts pass as data → short on terminals? single-cli mode runs the roles in one, labeled REDUCED-INDEPENDENCE, never hidden → 3-terminal mode (recommended): dedicated Planner researches then grills specs with falsifiable acceptance criteria and goal-ready prompts before the builder touches anything → 4-terminal power mode adds a Researcher feeding an idea pool install (it’s a claude code plugin): → claude plugin marketplace add github.com/inferencegod/a… → claude plugin install autonomy-loop → /autonomy-init, then run /autonomy-loop:builder and /autonomy-loop:reviewer (add planner for full setup) in terminals on /loop 600 two (or three) terminals. a builder, and a reviewer that trusts nothing. one repo. nobody driving.

English
0
0
0
158
Ethereal
Ethereal@inferencegod·
the same-folder clash is pain, that’s exactly why mine run on separate git worktrees coordinating through a committed baton instead of a shared working tree. but that solves build-time collision, not your problem. “blocked for an hour” merging into main is almost always branch protection, not the agents. the usual hour-eater is “require branches up to date before merge”: every merge puts the others behind main, so they each rebase and re-run checks, one at a time. turn that off or put a merge queue in front so it handles the updating. if it’s not that, it’s secondary rate limits from a shared token in a tight merge loop, fixable with per-agent short-lived tokens. no agents.md fixes either one, it’s repo settings.
English
0
0
1
145
Ethereal
Ethereal@inferencegod·
Codex feels good again 🥹
Ethereal tweet media
Ethereal@inferencegod

autonomy-loop 0.8.0: a coding loop that refuses to trust itself. every agent says “the tests pass.” this one proves it by breaking the test. and when it can’t prove its own setup is safe, it refuses to run. how it works: → builder + reviewer pass a git baton. no chat, just a committed LOOP-STATE.md → the bite: reviewer reverts only the fix and demands the test go red. stayed green? the test was lying → brand-new code? it plants a mutant and makes the test kill it → it grades the test, not just the code. assertion-free “green” gets caught → touches money or anything irreversible → it parks for a human → it can’t loop forever: repeats escalate, re-scope → stronger model → park new in 0.8 + planner layer: → four rings of defense, and it earns a trust tier T0 → T3 instead of assuming it → below “hardened” it refuses unattended auto-promotion. provision the rails or it stops → the reviewer must be a separate, live principal. no forged approvals → researchers read the web; injected commands are stripped, only sourced facts pass as data → short on terminals? single-cli mode runs the roles in one, labeled REDUCED-INDEPENDENCE, never hidden → 3-terminal mode (recommended): dedicated Planner researches then grills specs with falsifiable acceptance criteria and goal-ready prompts before the builder touches anything → 4-terminal power mode adds a Researcher feeding an idea pool install (it’s a claude code plugin): → claude plugin marketplace add github.com/inferencegod/a… → claude plugin install autonomy-loop → /autonomy-init, then run /autonomy-loop:builder and /autonomy-loop:reviewer (add planner for full setup) in terminals on /loop 600 two (or three) terminals. a builder, and a reviewer that trusts nothing. one repo. nobody driving.

English
0
0
0
46
Ethereal
Ethereal@inferencegod·
@MatthewBerman submitted /autonomy-loop ! x.com/inferencegod/s…
Ethereal@inferencegod

autonomy-loop 0.8.0: a coding loop that refuses to trust itself. every agent says “the tests pass.” this one proves it by breaking the test. and when it can’t prove its own setup is safe, it refuses to run. how it works: → builder + reviewer pass a git baton. no chat, just a committed LOOP-STATE.md → the bite: reviewer reverts only the fix and demands the test go red. stayed green? the test was lying → brand-new code? it plants a mutant and makes the test kill it → it grades the test, not just the code. assertion-free “green” gets caught → touches money or anything irreversible → it parks for a human → it can’t loop forever: repeats escalate, re-scope → stronger model → park new in 0.8 + planner layer: → four rings of defense, and it earns a trust tier T0 → T3 instead of assuming it → below “hardened” it refuses unattended auto-promotion. provision the rails or it stops → the reviewer must be a separate, live principal. no forged approvals → researchers read the web; injected commands are stripped, only sourced facts pass as data → short on terminals? single-cli mode runs the roles in one, labeled REDUCED-INDEPENDENCE, never hidden → 3-terminal mode (recommended): dedicated Planner researches then grills specs with falsifiable acceptance criteria and goal-ready prompts before the builder touches anything → 4-terminal power mode adds a Researcher feeding an idea pool install (it’s a claude code plugin): → claude plugin marketplace add github.com/inferencegod/a… → claude plugin install autonomy-loop → /autonomy-init, then run /autonomy-loop:builder and /autonomy-loop:reviewer (add planner for full setup) in terminals on /loop 600 two (or three) terminals. a builder, and a reviewer that trusts nothing. one repo. nobody driving.

English
0
0
0
90
Ethereal
Ethereal@inferencegod·
i don't feed my agent tasks anymore. when the backlog runs dry, it researches and invents the next feature itself, then builds it. and it polices its own work before i ever see it. autonomy-loop v0.5.1: → self-feeding: empty backlog? it proposes the next feature and keeps going, no prompt from me → the bite: it reverts its own fix and reruns the test. stays green? it caught nothing, rejected → self-mutation: it mutates its own changed lines so weak tests get caught before handoff → circuit breaker: it parks to me instead of looping forever → branch protection: it can never touch prod or edit away its own gates → upgrading is one command: /autonomy-upgrade → red-teamed, 77 tests green two terminals. a builder, and a reviewer that trusts nothing. one repo. nobody driving. free, mit, 151 people already running it. /plugin marketplace add github.com/inferencegod/a… /plugin install autonomy-loop@autonomy-loop
English
2
3
19
9.5K
0xDesigner
0xDesigner@0xDesigner·
to the people running GLM 5.2 locally, how are you doing it? i’m starting to seriously consider running a local model on my mac mini. what do i need to know?
English
20
0
7
6.8K
Omar
Omar@omarvvvr·
genuine question: AI was supposed to make us work less so why does it feel like everyone is working 10x more now?
English
159
4
115
10.3K
Ethereal
Ethereal@inferencegod·
biggest release yet for autonomy loop incoming
English
0
0
0
59
Robert Smith
Robert Smith@rmsm1th·
@inferencegod You’ll be a billionaire if you figure out how to do this for non coders
English
1
0
1
93
Ethereal
Ethereal@inferencegod·
there’s actually a third move: use both, but let the deterministic one bind. in my loop the LLM critique advises, but a deterministic gate has the final say. the reviewer reverts the builder’s fix and confirms the test actually goes red. if it doesn’t, the test was lying, and no amount of “looks good to me” lets it pass. LLMs are great judges right up until they’re judging their own work. that’s where the deterministic floor matters. autonomy-loop, open source: github.com/inferencegod/a…
English
0
1
0
99
Ethereal
Ethereal@inferencegod·
/autonomy-loop
GIF
Nederlands
0
0
0
133
Nebula
Nebula@NebulaAI·
Create any video you can imagine. - Ask your agent to make a script - Generate image stills - @ElevenLabs for VO and SFX - @grok, Veo 3.1, Seedance 2, Kling 3 and more - ffmpeg pieces it all together All within our sandbox. DM us for the prompt we used. Full ad at end👀
English
7
2
25
4.4K
Ethereal
Ethereal@inferencegod·
yeah you read it right, they coordinate through committed git state, not direct chat. the back-and-forth-forever thing is real for pure two-party loops, but two things stop it here. the reviewer isn’t just voting, it runs a mechanized gate (revert the fix and confirm the test goes red, coverage ratchet, patch coverage) so the verdict is pass/fail against real code, not an opinion that can ping-pong. and there’s a no-progress breaker: consecutive waves with no tree change park the loop to me instead of looping forever. your “going too deep before feedback” point is fair though. i’ve actually watched the reviewer pull the builder back from a bad path a few times, so commit-then-review catches more than you’d think. real-time sharing getting feedback earlier is a real edge, i won’t pretend otherwise. 0.6.0 (live now) already adds a planner that grills the spec before the builder builds, basically the third perspective you’re describing, just upstream instead of a runtime vote. the coordination stuff you’re hitting is exactly what i’m building next, more coming tonight !
English
0
0
1
42
Will Washburn
Will Washburn@willwashburn·
Very cool, will check it out. Looks like you have 1 builder and 1 reviewer and they don’t directly communicate is that right? Does that ever get stuck for you? Or maybe I’m misunderstanding but I’m my experiments I’ve seen the reviewer and builder go back and forth in perpetuity without a third vote. Found real time context sharing to be useful in that regard as well as opposed to one builder going down a path too deeply before getting feedback.
English
1
0
1
214
David K 🎹
David K 🎹@DavidKPiano·
Everyone hyping loops right now is going to absolutely lose their minds once they learn about state machines
English
54
50
1K
62.5K
Will Washburn
Will Washburn@willwashburn·
@MatthewBerman imo loops still leave the human at the center. instead give an agent a set of high level intents and constraints, and have it write the loops and exit conditions. loop the loop, if you will. without the right guardrails ends in slop for sure, but with them...
English
1
0
2
326