
@neversupervised LLMs are different species and should be treated as such
English
Van0SS
61 posts







SkyRL now implements the Tinker API. Now, training scripts written for Tinker can run on your own GPUs with zero code changes using SkyRL's FSDP2, Megatron, and vLLM backends. Blog: novasky-ai.notion.site/skyrl-tinker 🧵

Frontier labs spend millions purchasing RL environments for training terminal agents. But we decided to open source it. Introducing SETA: Scaling Environments for Terminal Agents, the largest open source training RL environments for terminal agents. We released: - 400 termianl agent training environments, more to come - SOTA agent harness on terminal-bench with CAMEL terminal toolkit - The RL training pipeline and trained SETA-RL-Qwen3-8B model weights









