J.crv
8K posts

J.crv
@jcrv__
tokenmaxxing building @@@@@ prev @avalabs


New CursorBench results just dropped. Two big takeaways. Composer 2.5 is way better than most people think. 63.2% score at $0.55 per task. Nearly matching Opus 4.7 Max and GPT 5.5 Extra High at 20x less cost. This is insane value. Gemini 3.5 Flash is #10 at 49.8%. Below GPT 5.5 Low. Below Opus 4.7 Low. Google's newest model can't even beat budget tier competition. Composer 2.5 is the sleeper. Gemini 3.5 Flash is the disappointment.


We’ve identified a security incident that involved unauthorized access to certain internal Vercel systems, impacting a limited subset of customers. Please see our security bulletin: vercel.com/kb/bulletin/ve…



Ahead of Senate confirmation hearing, Fed pick Kevin Warsh discloses investments in a slew of crypto firms theblock.co/post/397409/ah…

ANTHROPIC IS BANNING USERS THAT ARE UNDER 18 They’re now requiring these things in order to verify your age: > digital ID > facial scan > biometrics

Introducing Claude Managed Agents: everything you need to build and deploy agents at scale. It pairs an agent harness tuned for performance with production infrastructure, so you can go from prototype to launch in days. Now in public beta on the Claude Platform.

the idea of pricing for agents is getting traction. auth0.com/pricing.md resend.com/pricing.md workos.com/pricing.md

Curated agent marketplaces look a lot like AOL. Open payment protocols look a lot like HTTP. The last time this happened, HTTP won. What will prevail in the age of agentic commerce?

@sbaratelli @nvidia @openclaw most folks will want as much intelligence as possible, and open models aren't there yet.










