NovaSky (@NovaSkyAI) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

NovaSky@NovaSkyAI·13 Şub

We are excited to announce that SkyRL now implements the Tinker API. Run Tinker training scripts on your own hardware with zero code changes. Try it out today: novasky-ai.notion.site/skyrl-tinker

Tyler Griggs@tyler_griggs_

SkyRL now implements the Tinker API. Now, training scripts written for Tinker can run on your own GPUs with zero code changes using SkyRL's FSDP2, Megatron, and vLLM backends. Blog: novasky-ai.notion.site/skyrl-tinker 🧵

English

0

4

27

2.7K

NovaSky retweetledi

Kexin Huang@KexinHuang5·5d

Today, we're excited to share that Biomni is published in @ScienceMagazine. Biomedical research is still fragmented, manual, and difficult to scale. In this work, we introduce Biomni - the first general-purpose biomedical AI agent with an integrated biology environment that can reason, plan, and execute end-to-end scientific workflows. We show that, with the right environment and harness, AI can automate large-scale omics analyses, orchestrate laboratory robotics, optimize molecular properties, and even train new AI models for biology. We also introduce a reinforcement learning recipe for continually improving biomedical AI agents, enabling open-source models to achieve frontier-level performance. It's surreal to look back. We started the Biomni project in early 2024, when agentic AI was still nascent. It is exciting to see tens of thousands of biologists collaborating with agents every day to accelerate science. Try Biomni: biomni.phylo.bio Read more: science.org/doi/10.1126/sc… This work is not possible without this truly inter-disciplinary team: @serena2z @hcwww_ @YuanhaoQ Minta Lu, Ryan Li, @yusufroohani Lin Qiu @shiyi_c98 Gavin Junze Di @rickwierenga @kavi_deniz Sherry @TianweiShe Shruti Jennefer Xin Zhou @MWheelerMD Jon Bernstein @MengdiWang10 @PengHeAtlas @zhou_jingtian @SnyderShot @lecong Aviv Regev @jure @StanfordAILab @genentech @phylo_bio @arcinstitute @UW @berkeley_ai @RetroBio_ @tamarindbio @Princeton @UCSF

English

39

184

856

129.8K

NovaSky retweetledi

Shiyi Cao@shiyi_c98·5d

Biomni is now out in Science @ScienceMagazine ! Huge congrats to the team 🎉 Biomni is a really exciting step toward AI agents that can carry out real biomedical research workflows. Proud that SkyRL supported the RL training for Biomni-R0 early last year, where we explored end-to-end multi-turn RL for biomedical reasoning agents. It’s really exciting to see how AI agents have evolved the past one year toward carrying out real scientific workflows and accelerating discovery.

Kexin Huang@KexinHuang5

Today, we're excited to share that Biomni is published in @ScienceMagazine. Biomedical research is still fragmented, manual, and difficult to scale. In this work, we introduce Biomni - the first general-purpose biomedical AI agent with an integrated biology environment that can reason, plan, and execute end-to-end scientific workflows. We show that, with the right environment and harness, AI can automate large-scale omics analyses, orchestrate laboratory robotics, optimize molecular properties, and even train new AI models for biology. We also introduce a reinforcement learning recipe for continually improving biomedical AI agents, enabling open-source models to achieve frontier-level performance. It's surreal to look back. We started the Biomni project in early 2024, when agentic AI was still nascent. It is exciting to see tens of thousands of biologists collaborating with agents every day to accelerate science. Try Biomni: biomni.phylo.bio Read more: science.org/doi/10.1126/sc… This work is not possible without this truly inter-disciplinary team: @serena2z @hcwww_ @YuanhaoQ Minta Lu, Ryan Li, @yusufroohani Lin Qiu @shiyi_c98 Gavin Junze Di @rickwierenga @kavi_deniz Sherry @TianweiShe Shruti Jennefer Xin Zhou @MWheelerMD Jon Bernstein @MengdiWang10 @PengHeAtlas @zhou_jingtian @SnyderShot @lecong Aviv Regev @jure @StanfordAILab @genentech @phylo_bio @arcinstitute @UW @berkeley_ai @RetroBio_ @tamarindbio @Princeton @UCSF

English

3

7

20

5.3K

NovaSky retweetledi

Casper Hansen@casper_hansen_·26 Haz

SkyRL Tinker implementation is so cool, but still early days so I went ahead and implemented a basic chunked cross-entropy for 23x less memory usage! Now most models fit much more seamlessly in memory when combined with Megatron.

English

5

11

70

3.1K

NovaSky retweetledi

Stas Bekman@StasBekman·29 Haz

In parallel we announce a new open source repo: github.com/Snowflake-AI-R… This is the framework for very fast RL (and future other optimizations rolled into it) It currently has all the code you need to use or integrate Arctic RL into RL frameworks, with SkyRL and Verl available and more framework integrations coming. Please kindly spread the word! Thank you!

English

1

15

101

5.2K

NovaSky retweetledi

Stas Bekman@StasBekman·29 Haz

After many months of intense work the @Snowflake AI Research team is happy to present to you the new open source project: Arctic RL snowflake.com/en/blog/engine… - Arctic RL integrates with VeRL and SkyRL today; enable ZoRRo with one config flag, no code changes required - ZoRRo delivers up to 6x actor-update acceleration and a 3.5x end-to-end training speedup, reducing Arctic-Text2SQL-R2 training from ~5 days to ~36 hours on 32 H200 GPUs - Arctic-Text2SQL-R2 achieved higher accuracy scores (48.7) than Gemini 3.1 Pro (47.9) and Claude 4.7 (47.3) on Snowflake's evaluated enterprise SQL benchmark under the tested conditions - Two open source recipes ship with this release: a text-to-SQL recipe that improved BIRD dev accuracy from 59.92% to 70.35%, and a multi-hop QA recipe that improved average accuracy from 69.6% to 72.3%

English

5

15

82

6.3K

NovaSky retweetledi

Trajectory@trajectorylabs·30 May

🏹5 Days of Trajectory. Day 3 - An Open Source Training Stack for Continual Learning Building the platform for continual learning requires both partnering with pioneering AI companies, as we showed on Day 2 with Harvey, and working toward frontier research, which we are highlighting today. Continual learning means models that improve hourly from real production use. But with the size of frontier models, this becomes quite difficult. A Qwen-397b would need to spin up and tear down repeatedly across six GPU nodes, and that's valuable time gone. Our contribution is Continual LoRA (C-LoRA): many lightweight adapters running at once on one shared base model. Our insight centers on where the parallelism lives: instead of splitting one giant job across nodes, we load-balance many small jobs over a single base. The result: 2.81x experiment throughput over single-tenant training, with no regression on rewards. We built this together, with @anyscalecompute, @NovaSkyAI, and generous support from @GoogleCloud and @GoogleStartups. We've open-sourced on SkyRL as one of the first multi-LoRA, RL training platforms, so that every team can get to continual learning faster. We’re very excited to see what you build, please reach out!

English

11

62

507

95.9K

NovaSky retweetledi

vLLM@vllm_project·29 May

We've shipped two major upgrades for RL✨! 1. Native weight syncing APIs: Standardizes weight transfer, provides optimized implementations for NCCL and CUDA IPC out of the box, and also lets frameworks easily bring their own. 2. Improved pause/resume for Async RL: Careful coordination between DP ranks so that engines don’t deadlock. Validated at scale in P/D, wide-EP setups! In collaboration with @anyscalecompute, @NovaSkyAI, and @RedHat. More and more RL frameworks are using vLLM as the default for inference, details in the blog 👇 vllm.ai/blog/2026-05-2…

English

2

28

215

11.7K

NovaSky retweetledi

Simon Mo@simon_mo_·29 May

Amazing work! More and more RL frameworks are using vLLM as default. @vllm_project along with @anyscalecompute and @NovaSkyAI revamped weight syncing and improved wide-ep deployment for rollout!

Sumanth Hegde@sumanthrh

Excited to share some of our work on improving vLLM for RL! A number of RL frameworks, including SkyRL, use vLLM for inference, and we’ve noticed some common problems: 1. Weight syncing between training and inference is implemented in an ad-hoc fashion and duplicated across frameworks. 2. Asynchronous RL is prone to break at scale, especially in P/D and DPEP deployments. We’ve been working on improving both!

English

1

10

64

5.5K

NovaSky retweetledi

Eric Tang@erictang000·27 May

it's been a really great experience working w/ @j316chuck and the @trajectorylabs team on building out their post-training stack for continual learning on top of SkyRL really excited to continue collaborating and seeing how the team can push the frontier for continual learning!

Ronak Malde@rronak_

Today, @MichaelElabd, @QuantumArjun, and I are excited to announce Trajectory. We are a research lab and product company building the platform for Continual Learning. Our platform unlocks the signal already sitting in product usage, so companies can continuously post-train large-scale agentic models that outperform the frontier. @trajectorylabs We’ve raised $15M from @Conviction, @BessemerVP, @radicalvcfund, @jeffdean, @drfeifei and more. We’re partnering with some of the best AI-native companies: @ClayRunHQ @Harvey, @DecagonAI, @mercor_ai, @RogoAI to power their agentic systems, some of which we are already in production with. We’ve brought together a world class research team from DeepMind, OpenAI, Apple, Meta Superintelligence, Amazon AGI, Scale AI, and an elite product team from Stripe and Figma. AI will never again start on day one. Every correction, every retry, every edit will make products smarter. This is Continual Learning.

English

3

2

21

1.7K

NovaSky retweetledi

Sumanth Hegde@sumanthrh·27 May

Awesome to see Trajectory labs launch out of stealth! It's been great collaborating with them in building out multi-LoRA for SkyRL!

jenn@jennzhaii

So excited to share that I’ve joined @trajectorylabs! We’re pushing the frontier of RL research to build the platform for continual learning - systems that learn and evolve alongside your products in real time. We believe in a world where everyone has the power to own their own intelligence and shape their own destiny. And we’re hiring :)

English

4

1

14

3.8K

NovaSky@NovaSkyAI·27 May

RT @charlie_ruan: Excited to have supported @trajectorylabs with the SkyRL team over the past month, bringing training onto their own clust…

English

0

2

0

34

NovaSky retweetledi

Ronak Malde@rronak_·27 May

Today, @MichaelElabd, @QuantumArjun, and I are excited to announce Trajectory. We are a research lab and product company building the platform for Continual Learning. Our platform unlocks the signal already sitting in product usage, so companies can continuously post-train large-scale agentic models that outperform the frontier. @trajectorylabs We’ve raised $15M from @Conviction, @BessemerVP, @radicalvcfund, @jeffdean, @drfeifei and more. We’re partnering with some of the best AI-native companies: @ClayRunHQ @Harvey, @DecagonAI, @mercor_ai, @RogoAI to power their agentic systems, some of which we are already in production with. We’ve brought together a world class research team from DeepMind, OpenAI, Apple, Meta Superintelligence, Amazon AGI, Scale AI, and an elite product team from Stripe and Figma. AI will never again start on day one. Every correction, every retry, every edit will make products smarter. This is Continual Learning.

English

246

141

1.4K

1.8M

NovaSky retweetledi

Charlie Ruan@charlie_ruan·19 May

Super cool work built with SkyRL! Have also thought about how it's so wasteful not to train on the observation tokens. Can't wait to upstream it:)

Dimitris Papailiopoulos@DimitrisPapail

x.com/i/article/2056…

English

0

4

31

3.5K

NovaSky retweetledi

Charlie Ruan@charlie_ruan·12 May

A 4B RLM trained with SkyRL that matches Sonnet 4.6, very cool work!

alphaXiv@askalphaxiv

Reinforcing Recursive Language Models Can a 4B model learn to recursively call itself to answer hard long-context questions? We RL fine-tuned a small model to behave as a native RLM. On evidence selection across scientific papers, our 4B RLM matches Sonnet 4.6 in quality while running significantly faster and cheaper.

English

0

5

29

2.9K

NovaSky@NovaSkyAI·24 Nis

Check out VLM training in SkyRL! SFT / multi-turn RL, LoRA / full-finetuning, tinker-compatible as well!

Nithin Chalapathi@nithinch10

SkyRL now supports end-to-end vision-language post-training, from SFT to agentic RL, and adds vision model support to SkyRL’s Tinker interface! Existing multimodal cookbooks, e.g. VLM classification, work out of the box:

English

0

10

800

NovaSky retweetledi

Ziming Mao@ziming_mao·7 Nis

🚀 Excited to share the training & inference results for UCCL-EP: a portable, high-performance expert-parallel communication library across heterogeneous GPU + NIC hardware. 💻 Code: github.com/uccl-project/u… 📝 Blog: uccl-project.github.io/posts/uccl-ep-… 📈 Highlights: • Up to 45% faster Megatron-LM training vs RCCL on 128 AMD GPUs • Up to 40% faster SGLang inference vs NCCL on 32 H200 GPUs • Up to 25% lower vLLM TPOT vs NCCL • Up to 2.3x better EP dispatch/combine on AWS EFA 🔁 Fully portable across heterogeneous GPU/NIC hardware and a drop-in replacement for DeepEP Amazing team: Chon Lam Lao, @yangzhouy, Yihan Zhang, Chihan Cui, Zhongjie Chen, Zhiying Xu, @KaichaoYou, Zhen Huang, Zhenyu Gu, Costin Raiciu, Scott Shenker, @istoica05

English

1

20

69

5.3K

NovaSky@NovaSkyAI·23 Mar

Great work from the @OpenHandsDev community and CMU! Open source SOTA on code localization via RL. Happy to see the beautiful reward curves trained with SkyRL!

Aditya Soni@Aditya_Soni_8

Can we train code agents to search relevant locations in a codebase only using a terminal? Introducing CodeScout: an effective RL recipe for code search 🚀 🏆 Outperforms 18x larger OSS LLMs 🔥 Comparable to proprietary LLMs 📈 SoTA on SWE-Bench Verified, Pro, & Lite 🧵 [1/N]

English

0

7

25

3.3K

NovaSky retweetledi

vLLM@vllm_project·3 Mar

Excited to see SkyRL sharing their work on inference and vLLM in RL at the LLMs on Ray office hours this Thursday. If you’re exploring using vLLM in RL workflows, this will be a great session to join. See you there 👇

Seiji Eicher@seiji_________

Hi all, extending the invite to the LLMs on Ray office hours next Thursday, 3/5 9:30-10:30AM PT! We will be hosting @erictang000 and @sumanthrh from the @NovaSkyAI SkyRL project to present on inference/vLLM in RL and take questions from the group. After, there will be time for any other questions folks have on distributed inference w/ Ray. Hope you can make it! Sign up for the invite here: forms.gle/QESMQ8ojRJsCZV…

English

1

9

42

7.4K

NovaSky@NovaSkyAI·27 Şub

We’ve been consistently surprised lately by how capable frontier models are at handling complex kernel implementation and system optimization. Check out this work as a step toward automating AI infrastructure building!

Shiyi Cao@shiyi_c98

Introducing our new work K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model — a new paradigm for automated GPU kernel generation, achieving SoTA results. 🔍 Big insight: Traditional methods treat LLMs as stochastic code generators inside heuristic loops — but this misses a key point: LLMs are powerful planners with rich domain priors. 🧠 Core idea: K-Search uses the LLM itself as a co-evolving world model — one that plans + updates beliefs + guides search decisions based on experience. 📌 This decouples high-level strategy (intent) from low-level code implementation, allowing the optimizer to pursue multi-step transformations even when intermediate implementations don’t immediately improve performance. 📈 Key results: 🔥 Our discovered kernels are ~2.10× average speedup vs state-of-the-art evolutionary search across 4 FlashInfer kernels on H100/B200. 🔥 Up to 14.3× gain on complex Mixture-of-Experts (MoE) kernels. 🔥 State-of-the-art performance on GPUMode TriMul (H100) task — beating both automated and human solutions. 🙏 Acknowledgements This work is developed in @BerkeleySky, w/ the amazing @ziming_mao, @profjoeyg, and @istoica05. We thank @DachengLi177, @MayankMish98, @randwalk0, @pgasawa, @fangz_zzu, and @tian_xia_ for helpful discussion and feedback. We also thank the generous compute support from @databricks, @awscloud, @anyscalecompute, @nvidia, @Google, @LambdaAPI, and @MayfieldFund. 👨‍💻 GitHub: github.com/caoshiyi/K-Sea… 📄 arXiv: arxiv.org/pdf/2602.19128…

English

0

3

20

2.8K

NovaSky retweetledi

Shiyi Cao@shiyi_c98·26 Şub

Introducing our new work K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model — a new paradigm for automated GPU kernel generation, achieving SoTA results. 🔍 Big insight: Traditional methods treat LLMs as stochastic code generators inside heuristic loops — but this misses a key point: LLMs are powerful planners with rich domain priors. 🧠 Core idea: K-Search uses the LLM itself as a co-evolving world model — one that plans + updates beliefs + guides search decisions based on experience. 📌 This decouples high-level strategy (intent) from low-level code implementation, allowing the optimizer to pursue multi-step transformations even when intermediate implementations don’t immediately improve performance. 📈 Key results: 🔥 Our discovered kernels are ~2.10× average speedup vs state-of-the-art evolutionary search across 4 FlashInfer kernels on H100/B200. 🔥 Up to 14.3× gain on complex Mixture-of-Experts (MoE) kernels. 🔥 State-of-the-art performance on GPUMode TriMul (H100) task — beating both automated and human solutions. 🙏 Acknowledgements This work is developed in @BerkeleySky, w/ the amazing @ziming_mao, @profjoeyg, and @istoica05. We thank @DachengLi177, @MayankMish98, @randwalk0, @pgasawa, @fangz_zzu, and @tian_xia_ for helpful discussion and feedback. We also thank the generous compute support from @databricks, @awscloud, @anyscalecompute, @nvidia, @Google, @LambdaAPI, and @MayfieldFund. 👨‍💻 GitHub: github.com/caoshiyi/K-Sea… 📄 arXiv: arxiv.org/pdf/2602.19128…

English

12

64

307

97.7K

NovaSky

Keşfet