
Thrilled to share that our paper on "Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits" has been accepted at AISTATS 2026! 🚀🚀 Read more about how input mutations can be mapped to interpretable behavioral insights. arxiv.org/abs/2602.00092 🧵














