
@hyhieu226 While studying LLM compression, we also confirmed that the gradients are indeed low-rank with often very small rank. (ZS-SVD ICML 2026)
mint-vu.github.io/ZS-SVD/
English
Soheil Kolouri
37 posts

@SKolouri
Assistant Professor @Vanderbilt_CS


Introducing 🥚EGGROLL 🥚(Evolution Guided General Optimization via Low-rank Learning)! 🚀 Scaling backprop-free Evolution Strategies (ES) for billion-parameter models at large population sizes ⚡100x Training Throughput 🎯Fast Convergence 🔢Pure Int8 Pretraining of RNN LLMs








Often repeated by my supervisor @CSProfKGD too. You should be able to get the gist of a paper by just reading its figures and captions. This is what captures attention, otherwise you’re discouraging ppl from reading on. @ylecun spitting truth.







