
Carlos Donderis 🌊
361 posts

Carlos Donderis 🌊
@CaDs
Father👨🍼 Spaniard 🇪🇸 in Japan 🇯🇵 Human nature connoisseur 🧘♂️ VPoE at @mercari_jp 💻 My opinions are my own 💭 Tweets in Spanish, English, and Japanese
Tokyo Katılım Mart 2007
360 Takip Edilen1.5K Takipçiler


BTW I knew this was going to happen.
cads-tech.dev/artificial-dum…
English

Tips to get more out of Opus 4.7 by Boris itself :)
Boris Cherny@bcherny
Dogfooding Opus 4.7 the last few weeks, I've been feeling incredibly productive. Sharing a few tips to get more out of 4.7 🧵
English


Scaling Managed Agents: Decoupling the brain from the hands
anthropic.com/engineering/ma…
English

How We Built a System for AI Agents to Ship Real Code Across 75+ Repos
mabl.com/blog/how-we-bu…
English

Agentic AI Engineering Workflows 2026
sesamedisk.com/agentic-ai-eng…
English

Background agents work across the entire SDLC.
background-agents.com
English
Carlos Donderis 🌊 retweetledi

AI is everywhere. The agentic organization isn’t—yet
mckinsey.com/capabilities/p…
English

Protecting Our Systems with Intelligence
engineering.block.xyz/blog/protectin…
English


I liked this article from Block.
Very Jack Dorsey style.
Flattening organizations, evolving classic rigid roles and leveraging AI as the brain to fuel your organization is a good premise.
Block shared more insights about how are they going about it than many (myself included).
I loved the idea of the World Model aplied to an organization and I will likely use it for myself.
But getting the world model to properly provide accurate information about decisions and priorities in a large and active organization is far from a trivial challenge, and I think, as of today, technology is not mature enough.
But I'm sure it will be soon enough.
block.xyz/inside/from-hi…
English
Carlos Donderis 🌊 retweetledi

Genuinely didn't expect this.
Left @karpathy's autoresearch running on a Mac Mini over the weekend. 259 experiments, no intervention. It landed at 1.353 val_bpb — a 30% improvement from where it started.
For reference, the Mac Studio (4x the memory, 4x the price) took 5 hours of guided work to reach 1.29. The Mini got within 5% on its own. It just needed time.
The weird part: it kept making the model smaller. Every improvement it found was about speed — fewer layers, smaller batches, tighter attention. More optimizer steps in the same time budget. On Apple Silicon, throughput beats scale. That wasn't obvious to me.
tiny-lab is the control plane I'm building around this. Auto-restart, eval harness, experiment ledger, promotion protocol. Open source. github.com/trevin-creator…
English

Codebase standardization and structuring is essential with agents, which is exactly what my org is focusing on here:
"Agents are most effective in environments with strict boundaries and predictable structure, so we built the application around a rigid architectural model."
OpenAI Developers@OpenAIDevs
📣 Shipping software with Codex without touching code. Here’s how a small team steering Codex opened and merged 1,500 pull requests to deliver a product used by hundreds of internal users with zero manual coding. openai.com/index/harness-…
English

