Post

We built a fork of @NovaSkyAI SkyRL, making SGLang by @lmsysorg a fully supported rollout engine, and integrating @_xjdr nmoe as a training backend, providing full support for B200 MoE RL
github.com/ai-blaise/nmoe
github.com/ai-blaise/sgla…
github.com/ai-blaise/SkyRL
English

@NovaSkyAI @lmsysorg @_xjdr Highly performant forward and backward G1 attention gate kernels based on @Alibaba_Qwen Gated Attention research
github.com/ai-blaise/nmoe…
github.com/ai-blaise/nmoe…
github.com/ai-blaise/nmoe…

English

@_BlaiseAI @NovaSkyAI @lmsysorg @_xjdr Thank you for building with SkyRL! Let's chat about upstreaming your contributions to main?
English
