

Danny Hernandez
479 posts

@Hernandez_Danny
Measuring and forecasting AI progress @AnthropicAI.










Today, we're announcing Claude 3, our next generation of AI models. The three state-of-the-art models—Claude 3 Opus, Claude 3 Sonnet, and Claude 3 Haiku—set new industry benchmarks across reasoning, math, coding, multilingual understanding, and vision.




How quickly is A.I. advancing? And should you be working in the field? Checkout my recent conversation on these topics with @Hernandez_Danny: podcast.clearerthinking.org/episode/172/da…


Deep learning. Large language models. Vast capabilities. ☁️💻💡 From chatbots to code generation, learn how #GenerativeAI is redefining #ML-powered capabilities—& how you can build & use large language on #AWS. #MachineLearning #AI 👉 go.aws/416RimS

After working for the past few moths with key partners like @NotionHQ, @Quora, and @DuckDuckGo, we’ve been able to carefully test out our systems in the wild. We are now opening up access to Claude, our AI assistant, to power businesses at scale.

It’s hard work to make evaluations for language models (LMs). We’ve developed an automated way to generate evaluations with LMs, significantly reducing the effort involved. We test LMs using >150 LM-written evaluations, uncovering novel LM behaviors. anthropic.com/model-written-…