Stefan Bauer
72 posts

Stefan Bauer
@stefanAbauer
deep learning & causality





Today we release crowd-code 2.0, our second phase of crowd-sourcing the largest long-horizon open software engineering dataset. Install once. Forget about it.






We’re releasing UNI-D², a unified codebase for discrete diffusion language models 🤝🚀 Co-led with @vincentpaulinef and an amazing advisor team: @stefanAbauer, @AlexanderTong7 , @andrea_dittadi, @AMK6610, @KaplFer 🙌 🔗 GitHub: github.com/nkalyanv99/UNI… 📚 Docs: nkalyanv99.github.io/UNI-D2/ Reproduce and extend state-of-the-art baselines with one toolkit. Let’s move beyond autoregressive models and push discrete diffusion together 🧵👇

We now know that LoRA can match full-parameter RL training (from x.com/thinkymachines… and our Tina paper arxiv.org/abs/2504.15777), but what about DoRA, QLoRA, and more? We are releasing a clean LoRA-for-RL repo to explore them all. github.com/shangshang-wan…







Talk about perfect timing!🧞🧞♀️ Check out what we have been cooking for the last few weeks - Jasmine is a production-ready JAX-based codebase for world modeling from unlabeled videos

Inspired by today's Genie 3 release? We are open-sourcing 🧞♀️Jasmine🧞♀️, a production-ready JAX-based codebase for world modeling from unlabeled videos. Scale from single hosts to hundreds of xPUs thanks to XLA! 🧵 (1/10)

We are hosting a student researcher this year at the Paradigms of Intelligence team at Google! Interested in working with @ninoscherrer and me on AGI, or whatever you think is the next big thing 🥰, please consider applying! docs.google.com/forms/u/2/d/e/…




