

🚀 Introducing TLT (Taming the Long-Tail), an efficient, lossless system that boosts reasoning RL training by mitigating the rollout bottleneck! 🏆 Accepted by #ASPLOS2026 ✨ What’s new? ⚡ Enjoy 2× faster end-to-end reasoning RL training 🔒 Lossless on-policy RL — training quality preserved theoretically and empirically 🎁 Get a free, high-quality draft model for efficient deployment 🔗 Github: github.com/mit-han-lab/fa… 🔗 Paper: arxiv.org/pdf/2511.16665 👇 More in the thread (1/7) #ASPLOS #EfficientAI #OpenSource #LLMs #Reasoning #ReinforcementLearning















