Ben Cohen
408 posts

Ben Cohen
@blc_16
spends too much time watching football and coding. Prev @Meta, @Microsoft engineer

Digging into reports, most of the fastest burn came down to a few token-heavy patterns. Some tips: • Sonnet 4.6 is the better default on Pro. Opus burns roughly twice as fast. Switch at session start. • Lower the effort level or turn off extended thinking when you don't need deep reasoning. Switch at session start. • Start fresh instead of resuming large sessions that have been idle ~1h • Cap your context window, long sessions cost more CLAUDE_CODE_AUTO_COMPACT_WINDOW=200000 We're rolling out more efficiency improvements, make sure you're on the latest version. If a small session is still eating a huge chunk of your limit in a way that seems unreasonable, run /feedback and we'll investigate




Earlier this week, we published our technical report on Composer 2. We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours.

By the end of 2026, I predict token spend will be greater than engineering salaries at early stage startups.






Introducing Chroma Context-1, a 20B parameter search agent. > pushes the pareto frontier of agentic search > order of magnitude faster > order of magnitude cheaper > Apache 2.0, open-source


We scored 36.08% on ARC-AGI-3 in one day using the Agentica SDK.




In consumer, paid ads generally = lack of true product market fit I have yet to see a generational startup with largely paid ad-driven growth…











