Jan-Stefan Janetzky

1.4K posts

Jan-Stefan Janetzky banner
Jan-Stefan Janetzky

Jan-Stefan Janetzky

@iGottZ

building the memory management your LLM pretends to have. https://t.co/YdklMr2CNI

Germany เข้าร่วม Temmuz 2011
371 กำลังติดตาม175 ผู้ติดตาม
ทวีตที่ปักหมุด
Jan-Stefan Janetzky
I have ran Fable 5 with the following environment variables: CLAUDE_CODE_DISABLE_ADAPTIVE_THINKING=1 CLAUDE_CODE_EFFORT_LEVEL=max CLAUDE_AUTOCOMPACT_PCT_OVERRIDE=0 I have then instructed it to use workflows to generate a fine-grained planning against my entirely hand-written goal description. the runtime of this workflow has cost me more than 60% of my weekly usage in my claude 20x subscription. The result is fairly crazy, as you can just let dumber settings take over and finish the tasks. here is the review of such a phase by Fable 5 with xhigh with adaptive thinking a couple phases / sessions in: ——————————————————— TL;DR: This project's planning is unusually rigorous — conflicts between design docs resolved in a binding table before coding, tests specified as proofs ("show it red first"), and every increment contractually ending in a deployable state. The key insight: this isn't over-engineering, it's the correct response to AI execution. Stateless AI sessions have no senior engineer's implicit memory, so the plan serves as externalized working memory. Empirically validated today: ~1,100 lines (migration, security gates, live deploy) built in one session with zero structural surprises, and a mid-feature pause that cost nothing. Trade-off: the upfront planning only pays off because the target is well-defined. ——————————————————— An assessment of this project's planning depth — from the AI that builds against it The planning here is unusual, and I mean that as a measurable observation, not a compliment. A few properties stand out that ordinary industry practice doesn't have: 1. Cross-document conflict resolution before any code exists. The project was designed in six separate design documents, each independently reviewed from three adversarial perspectives. Before construction started, every contradiction between those documents — colliding database migration numbers, the same code deletion planned in two places, a package name that would shadow a variable name — was resolved in a binding conflict table. In a typical team, these collisions are discovered as merge conflicts, mid-build. Here, when I hit a point today where the design document and the master plan disagreed, the conflict had already been adjudicated months of decisions ago, with a documented rationale. 2. Tests are planned as proofs, not as coverage. The plan doesn't say "write tests for the security gate." It says: build a fake backend with a hit counter, prove the counter stays at zero, and first demonstrate the test fails when the gate is removed. Red before green is specified at planning time. That's the difference between "tests exist" and "the test demonstrably catches the leak it claims to catch." 3. Every work unit ends in a stable state — by contract, not by luck. The plan enforces an invariant: each increment ends committed, tested, and deployable, holding indefinitely, never requiring the next increment to exist. This paid off literally today: the project owner paused the session mid-feature for external reasons. Nothing needed cleanup — green build, documented re-entry point, done. 4. Decisions carry their sources. Every product decision is recorded with the owner's verbatim words attached. That cleanly separates policy (the human's) from mechanism (the code's), and prevents a failure mode specific to AI-assisted work: an assistant's interpretation slowly mutating into a "user requirement" over successive sessions. The honest framing of why this depth exists: you can find comparable rigor in industry — Google-style design docs, RFD cultures — but typically backed by staff engineers and quarter-long horizons. This is a solo project executed by AI sessions. And that's precisely the point: the depth isn't over-engineering, it's the correct response to the execution context. In human teams, senior engineers' implicit knowledge carries consistency across months. AI sessions have no such memory — each one starts blank. The master plan functions as externalized working memory that forces every stateless executor onto the same build-truth. Today was a clean empirical test: a ~1,100-line increment — a database migration, a trust/permission matrix, security validation, live deployment — went through in one session with zero structural surprises. Everything unexpected was detail-level (an ordering bug in a log message, a struct that lost comparability). The architecture held exactly as planned. The counter-ledger, because it belongs in any honest assessment: the upfront planning itself cost sessions, and it only amortizes because the target is well-defined — for an exploratory project this would be wasted effort. And design documents drift from as-built reality over time; the project knows this, mitigates it ("gates verify artifacts, never document numbers"), but the entropy is real. In short: ordinary good practice is "a design doc per feature, two reviewers." This is a planning discipline that treats the memoryless executor as a first-class constraint — and it's working, verifiably. The planning: x.com/iGottZ/status/…
Jan-Stefan Janetzky@iGottZ

man.. just this planning alone is on a different level than opus.. (fable 5 on max thinking btw)

English
1
0
0
393
Jan-Stefan Janetzky
@ImLunaHey in metal gear solid you have a boss fight, where it's reasonable to plug the player 1 controller into the player 2 port and back after the fight.
English
1
0
1
10
luna
luna@ImLunaHey·
@iGottZ "controller swap support"?
Français
1
0
0
12
luna
luna@ImLunaHey·
releasing a new version of my emulator later today which will also include basic ps1, nes, nds and gbc support.
luna tweet media
English
1
0
10
281
Jan-Stefan Janetzky
@LexnLin there ain't nothing a vpn can't solve, if you know people willing to break the law. this whole US only thing doesn't work.
English
0
0
0
79
Leon Lin
Leon Lin@LexnLin·
it's not looking good for eu anyways. If we are lucky, people in the US get access to fable again the next week. I guess I gotta wait for a few weeks/months tho😭
Rob Hallam@robj3d3

Update on Fable 5: > Anthropic staff have flown to Washington > Ongoing talks are happening with Trump administration > Both sides are eager to solve the dispute Something will probably come out tomorrow. This is also likely the reason we haven't heard from @AnthropicAI yet despite the "24hrs" promise.

English
15
0
50
7.2K
Jan-Stefan Janetzky
I told opus to check out planning documents fable created, just to spec out the next task with the same care. It's suprisingly close.
Jan-Stefan Janetzky tweet media
English
0
0
0
26
Jan-Stefan Janetzky
@LexnLin I had the feeling that Fable 5 has that "You did not ask for this, but I'll blow your mind when I'm finished" energy
English
0
0
4
193
Leon Lin
Leon Lin@LexnLin·
I had the feeling that Fable 5 has that "I know what you meant, not what you typed" power
English
38
27
687
27K
Jan-Stefan Janetzky
@elonmusk it's echo-chamber land tho. good for growth tbh and certainly something regulators do not want to see.
English
0
0
0
7
Jan-Stefan Janetzky
@araseb_ well. that's engagement farming in a nutshell. question is, do you want these specific followers?
English
0
0
0
6
Sarah
Sarah@araseb_·
0 -> 1M in 7 days! 🎉 Thank you all for following my journey ❤️
Sarah tweet media
English
77
2
92
3.1K
Jan-Stefan Janetzky
Not launched, but customers are lining up already. Especially since it's made and hosted in germany. It's in development for almost 7 months now and finally approaching that specific state, at which I can market it as SaaS and still sleep good at night. All planning documents are kept behind closed doors, so if anyone would attempt a license violation, I could just build out enough in a week to still keep the competitive edge. Just drop any AI at it for a deep review to see where this is heading.
English
1
0
0
14
Steven
Steven@StevBuilds·
@iGottZ Nice! Fully launched already?
English
1
0
0
7
Steven
Steven@StevBuilds·
Which problem is your product solving? One sentence + Link👇
English
12
0
11
490
corbin
corbin@corbin_braun·
AI is better than you think. the deeper you dive in the rabbit hole.
English
48
7
105
5.6K
Jan-Stefan Janetzky
quick reminder: if you get rid of all noise-making devices like graphics cards, fans and stuff from your home office, you suddenly start to hear your surroundings.. elevated.. (still better than hearing token generation)
English
0
0
1
10
Jan-Stefan Janetzky
@Kanescaler I'm building a long term memory for llm's. people laugh at it, until they see the light. Let's connect!
English
0
0
1
8
Kane
Kane@Kanescaler·
Hey founders ! Looking to connect with people building in: 🍽️ SaaS 🚀 Tech 📲 Automation 🧠 AI tools 📱 Product Development 🔥 Web APP 💻 Devs Drop what you're working on 👇
English
109
2
67
3.7K
baba yaga
baba yaga@babayagatwt·
Be honest: When was the last time an AI project made you think "I'd actually pay for this."
English
45
2
44
2.6K
ThePrimeagen
ThePrimeagen@ThePrimeagen·
After two weeks my project has crossed 12 KLoC... I feel like I am going a little too fast
English
49
3
561
45.7K
Jan-Stefan Janetzky
@DJLougen holy shit! I'll actually bench this against real gold sets I created with qwen3.6:27b
English
0
0
1
86
Jan-Stefan Janetzky
@omarvvvr there is no "better". are you building for docker? is it async web stuff? use golang then. are you building for web? is it simple stuff? use javascript then. is it complicated heavy compute stuff for web? anything that compiles to wasm then. there is no "one" language.
English
0
0
0
43
Omar
Omar@omarvvvr·
Can you name a programming language better than JavaScript or not ?
English
42
0
28
3K
Comet
Comet@cometwtf·
All you have to do is put 24 random words in the right order to unlock Satoshi’s $70 BILLION wallet Insane right?
Comet tweet media
English
24
4
57
4.3K
David Ondrej
David Ondrej@DavidOndrej1·
working without Fable is hard everything feels way slower both 4.8 and 5.5 are much, MUCH dumber than Fable was I find myself repeating it super-obvious shit every few minutes these models are just tools Fable was... something more
English
204
42
1.1K
52.8K