

Katrina Drozdov (Evtimova)
367 posts

@stochasticdoggo
Research Scientist @ValsAI | PhD from @NYUDataScience | Bulgarian yogurt, prime numbers, and dogs bring me joy | she/her



🚨 Are lower-priced AI models really cheaper? Beware of the "Price Reversal" phenomenon in Reasoning Language Models (RLMs)! 💸 We evaluated frontier RLMs and found sth shocking: a model with lower API pricing can actually cost more! 🧵👇

Working at Anthropic was a wonderful experience. Extremely high talent density, amazing culture, mission-driven, zero politics, leadership with real technical depth. Over the past year, I’ve learned so much about what made Anthropic successful and developed great respect for the founders and the team. I'm grateful to have been part of such an extraordinary organization. Reflecting on the last 20 years, I see three phases—and I'm now entering a fourth:


We’re releasing ProofBench, a challenging benchmark that measures models’ ability to write formally verifiable graduate-level proofs!







