

[1/🧵] ✨ Hide & Seek: Transformer Symmetries Obscure Sharpness & Riemannian Geometry Finds It ✨ Super excited to announce our paper on factoring out parameter symmetries to better predict generalization in transformers (accepted as #ICML25 spotlight! 🎉) Amazing work by Marvin da Silva (@marvinfsilva) and Felix Dangel (@f_dangel). Symmetries hide sharpness — Riemannian geometry reveals it👇















