Michał Piszczek (@cdiamond) - ملف تويتر

Everyone is still selling Prompt Engineering. I think they're teaching a skill that's already depreciating.📉 My position is simple: In 2026, model quality is no longer where performance comes from. Recovery is. And recovery is not a prompt — it's a loop. Here's the shift nobody priced in.👇 ⚙️For three years we optimized the INPUT: → prompts → personas → system instructions → temperature → context windows → benchmark scores That mattered when the model was the weak link. It isn't anymore. 💸The marginal cost of raw intelligence is collapsing. And anything whose cost collapses stops being your moat. Models are becoming a commodity input. The value moved UP the stack — to whoever controls what happens betweenthe model calls. So the real question changed. ❌How do I make the model smarter? ✅How does my system behave the moment it's wrong? That's not a prompting problem. It's a control problem. 🧮 And here's the physics of it: One agent step is ~90% reliable. Chain ten with no feedback → 0.9¹⁰ ≈ 35%. Errors compound. Open-loop systems decay. The only thing that fights compounding error is feedback: measure → correct → repeat. A thermostat. A rocket. A production agent. All closed-loop. For the exact same reason. 🔁 This is why real agents don't run on one clever prompt. They run on layered loops — and each one buys a specific property: 🧭 ReAct (Think → Act → Observe) grounds the model in reality, not its own assumptions 🪞 Reflection (Generate → Critique → Improve) turns a bad draft into a good one with no human 🗺️ Planning (Goal → Decompose → Execute → Replan) survives a plan that was wrong on contact ✅ Verification (Generate → Test → Fail → Fix) converts "looks right" into "is right" 🚨 Escalation (Execute → Confidence drops → Human → Resume) knows the boundary of its own competence 🧠 Memory stops the system repeating the same mistake twice Notice: none of these make the model smarter. They make the SYSTEM survivable. ⚠️ And here's the mistake I see everywhere: People spend weeks tuning the prompt. And zero time designing what happens after the first failure. But real agents don't fail once. They fail constantly. The gap between a demo and a production system was never intelligence. It's recovery. The cleanest way I can put it: 🖥️ The model is becoming the CPU. 🧩 The loop is becoming the operating system. Nobody brags about their CPU anymore. They build on the OS. Prompts are turning into implementation details. Loops are turning into the product. 🛠️ So if you're building, here's where I'd put your next 10 hours: Not in your system prompt. In your failure path. Write down what your agent does on its SECOND attempt. If the answer is "the same thing" — you don't have a system. You have a demo. — 📌Next week I'm breaking down the loop pattern I lean on most in production: drill-down + recurrent decompose — how to take a goal that's too big to execute and recursively split it until every leaf is a step the agent can't fail at.

English

0

1

19

Michał Piszczek

اكتشف