
@LinYorker @ryu0000000001 @weijie444 arxiv.org/abs/2106.06199 Same update here
rohan anil
10.1K posts

@_arohan_
member of technical staff & co-founder of @coreautoai - and continuing to aspire to understand deep learning.

@LinYorker @ryu0000000001 @weijie444 arxiv.org/abs/2106.06199 Same update here





golden age for optimizers right now. every day you see another SoapyShampooGluon ^-1/2 (RMSMatched) drops





1/n Please stop by👋. This is not just another ICML 2026 optimizer paper. We have rich intuition to share on why simple preconditioners like orthogonalization and row-normalization specifically benefit NNs optimization. Quick overview below 🧵

Mythos has cracked MacOS. It took five days.

Neural networks might speak English, but they think in shapes. Understanding their rich *neural geometry* is key to understanding how they work – and to debugging and controlling them with precision. Starting today, we’re releasing a series of posts on this research agenda. 🧵






