NovaSky (@NovaSkyAI) - Hồ sơ Twitter | Zamantika Mersobahis Locabet

Tweet ghim

NovaSky@NovaSkyAI·13 Şub

We are excited to announce that SkyRL now implements the Tinker API. Run Tinker training scripts on your own hardware with zero code changes. Try it out today: novasky-ai.notion.site/skyrl-tinker

Tyler Griggs@tyler_griggs_

SkyRL now implements the Tinker API. Now, training scripts written for Tinker can run on your own GPUs with zero code changes using SkyRL's FSDP2, Megatron, and vLLM backends. Blog: novasky-ai.notion.site/skyrl-tinker 🧵

English

0

4

27

1.8K

NovaSky đã retweet

vLLM@vllm_project·3 Mar

Excited to see SkyRL sharing their work on inference and vLLM in RL at the LLMs on Ray office hours this Thursday. If you’re exploring using vLLM in RL workflows, this will be a great session to join. See you there 👇

Seiji Eicher@seiji_________

Hi all, extending the invite to the LLMs on Ray office hours next Thursday, 3/5 9:30-10:30AM PT! We will be hosting @erictang000 and @sumanthrh from the @NovaSkyAI SkyRL project to present on inference/vLLM in RL and take questions from the group. After, there will be time for any other questions folks have on distributed inference w/ Ray. Hope you can make it! Sign up for the invite here: forms.gle/QESMQ8ojRJsCZV…

English

1

9

43

6.9K

NovaSky@NovaSkyAI·27 Şub

We’ve been consistently surprised lately by how capable frontier models are at handling complex kernel implementation and system optimization. Check out this work as a step toward automating AI infrastructure building!

Shiyi Cao@shiyi_c98

Introducing our new work K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model — a new paradigm for automated GPU kernel generation, achieving SoTA results. 🔍 Big insight: Traditional methods treat LLMs as stochastic code generators inside heuristic loops — but this misses a key point: LLMs are powerful planners with rich domain priors. 🧠 Core idea: K-Search uses the LLM itself as a co-evolving world model — one that plans + updates beliefs + guides search decisions based on experience. 📌 This decouples high-level strategy (intent) from low-level code implementation, allowing the optimizer to pursue multi-step transformations even when intermediate implementations don’t immediately improve performance. 📈 Key results: 🔥 Our discovered kernels are ~2.10× average speedup vs state-of-the-art evolutionary search across 4 FlashInfer kernels on H100/B200. 🔥 Up to 14.3× gain on complex Mixture-of-Experts (MoE) kernels. 🔥 State-of-the-art performance on GPUMode TriMul (H100) task — beating both automated and human solutions. 🙏 Acknowledgements This work is developed in @BerkeleySky, w/ the amazing @ziming_mao, @profjoeyg, and @istoica05. We thank @DachengLi177, @MayankMish98, @randwalk0, @pgasawa, @fangz_zzu, and @tian_xia_ for helpful discussion and feedback. We also thank the generous compute support from @databricks, @awscloud, @anyscalecompute, @nvidia, @Google, @LambdaAPI, and @MayfieldFund. 👨‍💻 GitHub: github.com/caoshiyi/K-Sea… 📄 arXiv: arxiv.org/pdf/2602.19128…

English

0

3

20

2.7K

NovaSky đã retweet

Shiyi Cao@shiyi_c98·26 Şub

Introducing our new work K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model — a new paradigm for automated GPU kernel generation, achieving SoTA results. 🔍 Big insight: Traditional methods treat LLMs as stochastic code generators inside heuristic loops — but this misses a key point: LLMs are powerful planners with rich domain priors. 🧠 Core idea: K-Search uses the LLM itself as a co-evolving world model — one that plans + updates beliefs + guides search decisions based on experience. 📌 This decouples high-level strategy (intent) from low-level code implementation, allowing the optimizer to pursue multi-step transformations even when intermediate implementations don’t immediately improve performance. 📈 Key results: 🔥 Our discovered kernels are ~2.10× average speedup vs state-of-the-art evolutionary search across 4 FlashInfer kernels on H100/B200. 🔥 Up to 14.3× gain on complex Mixture-of-Experts (MoE) kernels. 🔥 State-of-the-art performance on GPUMode TriMul (H100) task — beating both automated and human solutions. 🙏 Acknowledgements This work is developed in @BerkeleySky, w/ the amazing @ziming_mao, @profjoeyg, and @istoica05. We thank @DachengLi177, @MayankMish98, @randwalk0, @pgasawa, @fangz_zzu, and @tian_xia_ for helpful discussion and feedback. We also thank the generous compute support from @databricks, @awscloud, @anyscalecompute, @nvidia, @Google, @LambdaAPI, and @MayfieldFund. 👨‍💻 GitHub: github.com/caoshiyi/K-Sea… 📄 arXiv: arxiv.org/pdf/2602.19128…

English

12

63

306

92.6K

NovaSky@NovaSkyAI·19 Şub

Excited to see SkyRL being used by systems research to study how agentic RL workload can be optimized!! github.com/ThunderAgent-o…

Hao Kang@GT_HaoKang

🔥Modifying 2 lines of code and get your agentic serving/rollout up to 3.9x faster losslessly! ⚡️Say hello to ThunderAgent, a fast, simple, and program-aware agentic Inference System. 🥇 We propose a program abstraction to schedule all GPU and CPU resources, the first principled approach for distributed agentic inference and rollout. 🌐 Blog: thunderagent.ai 💻 Code: github.com/ThunderAgent-o… 📜 Paper: arxiv.org/pdf/2602.13692 #AI #ThunderAgent #LLMAgent #Mlsys 1/n

English

1

4

23

2.2K

NovaSky@NovaSkyAI·18 Şub

Train your terminal-use agents with SkyRL+Harbor!

Charlie Ruan@charlie_ruan

Releasing the official SkyRL + Harbor integration: a standardized way to train terminal-use agents with RL. From the creators of Terminal-Bench, Harbor is a widely adopted framework for evaluating terminal-use agents on any task expressible as a Dockerfile + instruction + test script. This integration extends it: the same tasks you evaluate on, you can now RL-train on. Blog: novasky-ai.notion.site/skyrl-harbor 🧵

English

0

1

17

1.2K

NovaSky đã retweet

Charlie Ruan@charlie_ruan·4 Şub

A 30-minute talk on SkyRL’s most recent updates:)

Huaizheng Zhang@zhzHNN

LLM RL Training with SkyRL. One of the best RL sharing you must watch. youtu.be/MrJNri6ysYQ

English

2

3

13

1.8K

NovaSky@NovaSkyAI·27 Oca

A very cool project built with SkyRL!!

Kanishk Gandhi@gandhikanishk

(1/9) We built Endless Terminals: a fully autonomous pipeline that procedurally generates terminal tasks for RL training with no human annotation needed. Simple PPO + scaled environments give consistent improvements on downstream tasks like Terminal Bench 2.0!

English

0

4

14

2.6K

NovaSky@NovaSkyAI·9 Ara

For even more details, check out the links below! Release: github.com/NovaSky-AI/Sky… PyPI package: pypi.org/project/skyrl-… SkyRL Slack: join.slack.com/t/skyrl/shared… Docs: skyrl.readthedocs.io/en/latest/inde… (n/n)

English

0

1

249

NovaSky@NovaSkyAI·9 Ara

To get started with `skyrl-train` as a library instead of forking, you can follow our instructions here: #installing-skyrl-train-from-pypi" target="_blank" rel="nofollow noopener">skyrl.readthedocs.io/en/latest/gett… to use the PyPI package! (6/n)

English

1

0

2

297

NovaSky@NovaSkyAI·9 Ara

We recently released SkyRL-Train v0.3.0! Highlights include: - Experimental support for Pipeline-RL style Async-RL - Updated E2E Recipes page with Math, Search, SQL runs - Migration from mbridge -> Megatron-Bridge - 14 new OSS contributors! (1/n) 🧵

English

2

6

28

3.1K

NovaSky đã retweet

Negin Raoof@NeginRaoof_·6 Ara

How can we make a better TerminalBench agent? Today, we are announcing the OpenThoughts-Agent project. OpenThoughts-Agent v1 is the first TerminalBench agent trained on fully open curated SFT and RL environments. OpenThinker-Agent-v1 is the strongest model of its size on TerminalBench, and sets a new bar on our newly released OpenThoughts-TB-Dev benchmark. (1/n)

English

17

78

289

124.9K

NovaSky@NovaSkyAI·6 Ara

OpenThoughts-Agent + SkyRL + Harbor 🚀🚀

Charlie Ruan@charlie_ruan

Announcing OpenThoughts-Agent with an incredible team — a data-centric effort on TerminalBench-style tasks, built with SkyRL+Harbor 💻🤖 Co-leading the RL team over the past month has been a blast, and we’re just getting started! (1/n) 🧵

English

1

15

2.5K

NovaSky đã retweet

Charlie Ruan@charlie_ruan·6 Ara

Announcing OpenThoughts-Agent with an incredible team — a data-centric effort on TerminalBench-style tasks, built with SkyRL+Harbor 💻🤖 Co-leading the RL team over the past month has been a blast, and we’re just getting started! (1/n) 🧵

Negin Raoof@NeginRaoof_

How can we make a better TerminalBench agent? Today, we are announcing the OpenThoughts-Agent project. OpenThoughts-Agent v1 is the first TerminalBench agent trained on fully open curated SFT and RL environments. OpenThinker-Agent-v1 is the strongest model of its size on TerminalBench, and sets a new bar on our newly released OpenThoughts-TB-Dev benchmark. (1/n)

English

7

16

49

7.6K

NovaSky đã retweet

Shiyi Cao@shiyi_c98·2 Ara

🤖 I am in San Diego for #NeurIPS2025 this week! Excited to chat about SkyRL(-Agent), Coding LLM/Agent, Self-evolving Agent, RL, and Inference/Training Infrastructure.

English

5

3

60

4.6K

NovaSky đã retweet

Dacheng Li@DachengLi177·2 Ara

I will be at San Diego for Neurips from Dec 1 to Dec 7. Happy to chat on RL, LLM agents, memory system / long context, video models and training infrastructure 😄

English

4

2

64

9.6K

NovaSky

Khám phá