Sabitlenmiş Tweet

Accepted to ICLR 2026!🎉 So grateful to my amazing collaborators 🫶
We introduce CLASH to evaluate value reasoning, revealing new failure modes in reasoning models and intriguing steerability results!
📰 Paper: arxiv.org/pdf/2504.10823

English























