UC Berkeley Sky retweetledi

Introducing M²RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling
We bring back non-linear recurrence to language modeling and show it's been held back by small state sizes, not by non-linearity itself.
📄 Paper: arxiv.org/abs/2603.14360
💻 Code: github.com/open-lm-engine…
🤗 Models: huggingface.co/collections/op…
English






















