
presented for your amusement:
twitter.com/aelaron_/statu…
Sasha Rush@srush_nlp
What's neat about the Mamba paper is that they're really exploring the design space outside of PyTorch. Like this model makes no sense if you aren't willing to get your hands dirty and prove it. github.com/state-spaces/m…
English















