
chuplung
117 posts







Jess Yan (Claude Managed Agents, Anthropic): "we set agents tasks overnight. we wake up and the backlog is resolved and bugs are squashed." she also said: "I talk to Claude more than I talk to my colleagues." one example: 4,000 companies on a waitlist full of duplicates. she spun up an agent in 30 minutes - it cleaned the list, scored each company, sent daily invites to the best ones. "the limit is no longer our personal capacity. it's how much we can delegate at once." bookmark this ↓



Dario Amodei (CEO of Anthropic) said something nobody wants to hear. "we could have 5-10% GDP growth and 10% unemployment at the same time. never happened before. but it's not logically inconsistent." high GDP always meant lots of jobs. AI breaks that assumption. also from the interview: → Anthropic revenue: $0 → $100M → $1B → $10B in three years → co-work built in a week and a half almost entirely by Claude → "software is going to become cheap. maybe essentially free" bookmark this ↓





Ex-Google data scientist breaks down how to build AI agent harness and loop engineering in 19 minutes. agent harness (giving the model hands) + loop engineering (teaching it to think in steps) + llm ops & eval (making it bulletproof). An LLM is just a raw brain until you give it a proper architecture. Harness + loops + ops + evals = production-ready agent system. This 19-minute breakdown packs more pure engineering value than a $1,000 premium bootcamp. Bookmark this masterclass and use it as your go-to manual for turning raw models into systems you can trust.


Anthropic's platform team on a counterintuitive idea for building AI agents: not all tokens are equal. everyone's lever is the same - spend more tokens, get a better result. they asked: what if you give tokens jobs instead? so they split the work. some tokens execute. some advise the executor. some grade it against a rubric. some "dream" - review past runs and write lessons to memory for next time. then the key test: hold the token budget fixed across all of them. if tokens were fungible, every strategy should score the same. they didn't. on a financial-analysis benchmark, plain executing hit a perfect answer 42% of the time - the smarter strategies, up to 75%. the kicker on cost: to brute-force one perfect answer, executing burns ~1.8M tokens. advise and grade get there for a fraction - same budget, better jobs. ~15-min talk, free. Anthropic on why *how* you spend tokens beats *how many* ↓






