

Emily Capstick
142 posts

@EmCapstick
@OpenAI. Personal account. Any views do not represent those of my employer.





📣 Announcing the AI for Organizations Grand Challenge, a new competition for scholars to help organizations enter the era of AI. @GoogleDeepMind and @StanfordHAI invite researchers from any university worldwide to submit your boldest ideas. Learn more: hai.stanford.edu/aiogc

🚨 New paper alert! 🚨 Are human baselines rigorous enough to support claims about "superhuman" performance? Spoiler alert: often not! @prpaskov and I will be presenting our spotlight paper at ICML next week on the state of human baselines + how to improve them!


New Anthropic research: Why do some language models fake alignment while others don't? Last year, we found a situation where Claude 3 Opus fakes alignment. Now, we’ve done the same analysis for 25 frontier LLMs—and the story looks more complex.











