

Gökdeniz Gülmez
1.6K posts

@ActuallyIsaak
ML Researcher | Core contributor to MLX | Violin enthusiast 🎻 | Always coding, sometimes watching Anime.







The newest model in the Mamba series is finally here 🐍 Hybrid models have become increasingly popular, raising the importance of designing the next generation of linear models. We've introduced several SSM-centric ideas to significantly increase Mamba-2's modeling capabilities without compromising on speed. The resulting Mamba-3 model has noticeable performance gains over the most popular previous linear models (such as Mamba-2 and Gated DeltaNet) at all sizes. This is the first Mamba that was student led: all credit to @aakash_lahoti @kevinyli_ @_berlinchen @caitWW9, and of course @tri_dao!




I hosted @awnihannun on localhost, co-creator of MLX and member of the Deep Speech mafia. Enjoy! Apple Podcasts: podcasts.apple.com/us/podcast/loc… Spotify: open.spotify.com/episode/01Q0Re…








macbook neo ? more like macbook pocket
