

Delta Institute @ MLSys
1.1K posts

@DeltaInstitutes
Supporting exceptional researchers and engineers, from academia to industry and beyond.



An early beta of Grok Build, an agentic CLI for coding, building apps, and automating workflows is now available for SuperGrok Heavy subscribers. Through this early beta, we will improve the model and product based on your feedback. Try it at x.ai/cli



The authors introduce Kaon, a Muon variant with random noise replacing SVs. Kaon matches Muon, suggesting Muon’s gains don’t depend from a geometry. They also show Muon has a stable opt. step size, yielding a more effective learning rate during training. 🔗arxiv.org/abs/2605.11181


Our paper “Learning from Less: Measuring the Effectiveness of RLVR in Low Data and Compute Regimes” was accepted to #MLSys 2026! We introduce three procedurally generated, verifiable datasets—Counting, Graph, and Spatial Reasoning—to study RLVR under low-data / low-compute constraints. Key result: small, mixed-complexity datasets can be more data-efficient than large, easy ones.







We have released the source code and benchmarks of TokenWeave. TokenWeave speeds up distributed LLM inference via compute–communication overlap and fused AllReduce, RMSNorm, and residual addition. Code: github.com/microsoft/toke… Paper: arxiv.org/pdf/2505.11329 Try it out!










Some updates: I've joined Recursive as a member of the founding team. My core curiosity about the world centers on how complex patterns and knowledge emerge from the two open-ended processes we know: natural and cultural evolution. I've been lucky to explore this during my PhD through works like ADAS, Darwin Gödel Machine, and The AI Scientist. Excited to keep chasing this thread with the incredible team!




I'm excited to finally release the fruit of the research we've been doing at Perceptron for the last 16 months: Perceptron Mk1. We've been developing multi-modal recipes from the ground up to build models that perform best in the physical world, from video understanding to embodied reasoning to robotics. Mk1 is our scaled up recipe.


