Shun
79 posts

Shun
@22uenos
design at judgment labs, prev stanford
San Francisco, CA Joined Aralık 2025
159 Following60 Followers
Shun retweeted

this is genius
Anthropic@AnthropicAI
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
English

@brian_lovin @WeAreDevs Braun museum, markthalle neun, walk around kreuzberg, tempelhof
Deutsch

I'll be in Berlin July 6–10 for @WeAreDevs...if you're nearby, I'd love to hang out!
Also taking Berlin recs, this is my first visit 👀
English

Had a lot of fun making these visuals :)
Made in @paper with Claude



Judgment Labs@JudgmentLabs
We built Agent Judge to evaluate long-horizon agents. As agents take on longer tasks, the evidence needed to evaluate them gets buried across tool calls, retries, logs, database updates, and final outputs. Evaluating these agents requires investigating the trajectory, not just judging the final answer.
English

We built Agent Judge to evaluate long-horizon agents.
As agents take on longer tasks, the evidence needed to evaluate them gets buried across tool calls, retries, logs, database updates, and final outputs.
Evaluating these agents requires investigating the trajectory, not just judging the final answer.

English

What the fuck does high taste even mean bruh
Garry Tan@garrytan
High agency high taste is the unlock these days
English










