slime

49 posts

slime

slime

@slime_framework

The LLM post-training framework for RL Scaling. https://t.co/4ILpx8hfKN

Katılım Eylül 2025
8 Takip Edilen910 Takipçiler
Sabitlenmiş Tweet
slime
slime@slime_framework·
slime v0.2.4 is here! This release focuses on profiling and observability. Highlights: - rollout trace timeline viewer - real-time W&B reporting for ITL / TTFT and other runtime metrics - router stack unified on sgl-router We hope this makes rollout systems much easier to inspect and optimize. Release: github.com/THUDM/slime/re…
English
2
7
57
4.1K
slime
slime@slime_framework·
wow, congrats!
RadixArk@radixark

Today, we are thrilled to officially launch RadixArk with $100M in Seed funding at a $400M valuation. The round was led by @Accel and co-led by @sparkcapital. RadixArk exists to make frontier AI infrastructure open and accessible to everyone. Today, the systems behind the most capable AI models are concentrated in a small number of companies. As a result, most AI teams are forced to rebuild training and inference stacks from scratch, duplicating the same infrastructure work instead of focusing on new models, products, and ideas. RadixArk was founded to change that. We are building an AI platform that makes it easier for teams to train and serve the best models at scale. RadixArk comes from the open-source community. We started with SGLang, where many of us are core developers and maintainers, and expanded our work to Miles for large-scale RL and post-training. We will continue contributing to both projects and working with the community to make them the strongest open-source infrastructure foundations for frontier AI. We would like to thank our long-term partners, contributors, and the broader SGLang community for believing in this mission. We're also grateful to @Accel and @sparkcapital, NVentures (Venture capital arm of @nvidia), Salience Capital, A&E Investment, @HOFCapital, @walden_catalyst, @AMD, LDVP, WTT Fubon Family, @MediaTek, Vocal Ventures, @Sky9Capital and our angel investors @ibab, @LipBuTan1, Hock Tan, @johnschulman2, @soumithchintala, @lilianweng, @oliveur, @Thom_Wolf, @LiamFedus, @robertnishihara, @ericzelikman, @OfficialLoganK, and @multiply_matrix among others. Thanks for the exclusive interview with @MeghanBobrowsky at @WSJ about our vision.

English
1
0
7
533
slime
slime@slime_framework·
slime environment & CI update: - SGLang upgraded to 0.5.10.post1 - Megatron updated to dev commit 1dcf0dafa884, following the radixark/miles setup - Part of Qwen2.5/3 CI replaced with Qwen3.5/3.6 - Added CI for PD disaggregation - Added CI for GLM-4.7-Flash with transformers 5 This update improves compatibility with newer model and dependency stacks.
English
1
2
18
1.5K
slime retweetledi
Z.ai
Z.ai@Zai_org·
Scaling laws push model capability forward. But whether that capability becomes reliable in production depends on how we handle Scaling Pain. z.ai/blog/scaling-p… In our latest blog, we share how we debugged GLM-5 serving at scale: reproducing rare garbled outputs, repetition, and rare-character generation; tracing and eliminating KV Cache race conditions; fixing HiCache synchronization issues; and introducing LayerSplit for up to 132% throughput improvement. We hope these lessons help the community avoid similar pitfalls and build more robust inference infrastructure.
English
40
82
884
84.3K
slime
slime@slime_framework·
🎉🎉🎉
RadixArk@radixark

Training DeepSeek V4 @deepseek_ai at scale? SGLang + Miles is the Day 0 path. @lmsysorg Miles and SGLang enable full-parameter RL training for DSV4 with stability, efficiency, and broad hardware support. ✅ Verified stability - Rollout Routing Replay (R3) and indexer replay (experimental) - Tensor-level validation across the Miles & Megatron mixed-precision training stack - Step-0 train-inference diff: ~0.02–0.03 ✅ Efficient full-parameter RL - DP / TP / SP / EP / PP / CP support - Tilelang attention and indexer kernels - FP8/BF16 rollout and FP8/BF16 training support ✅ Broad hardware support - Verified training on NVIDIA Hopper and Grace Blackwell clusters - Ready for DeepSeek V4 RL from Day 0 This is the exclusive Day 0 path to scale DeepSeek V4 with rock-solid reliability. Full technical docs & setup guide below! 👇 #DeepSeekV4 #SGLang #RL

ART
0
0
19
1.3K
slime
slime@slime_framework·
As always, trained with slime.
Z.ai@Zai_org

Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. Blog: z.ai/blog/glm-5.1 Weights: huggingface.co/zai-org/GLM-5.1 API: docs.z.ai/guides/llm/glm… Coding Plan: z.ai/subscribe Coming to chat.z.ai in the next few days.

English
0
2
21
1.8K
slime retweetledi
Chengxing Xie
Chengxing Xie@Chengxing_Xie·
Huge shoutout to slime-agentic, a community project built on the slime RL framework! It elegantly implements various Agentic RL paradigms—including Agentflow, Memagent, and ToolOrchestra—using slime's flexible custom interfaces without any messy codebase overrides. If you're into Agentic RL, check it out! 👇 🔗 github.com/LMIS-ORG/slime…
English
2
15
77
3.9K
slime
slime@slime_framework·
slime v0.2.4 is here! This release focuses on profiling and observability. Highlights: - rollout trace timeline viewer - real-time W&B reporting for ITL / TTFT and other runtime metrics - router stack unified on sgl-router We hope this makes rollout systems much easier to inspect and optimize. Release: github.com/THUDM/slime/re…
English
2
7
57
4.1K
slime
slime@slime_framework·
slime v0.2.3 is here! 🚀 The biggest update in this release is YAML-based --sglang-config. It enables much more flexible SGLang configuration for advanced rollout setups, including: - PD disaggregation with different parallelism for prefill / decode - EPD - serving multiple different models launching multiple routers in one deployment We hope v0.2.3 gives you much more freedom in building efficient rollout systems. Release: github.com/THUDM/slime/re…
English
2
3
20
1.6K
slime retweetledi
Ling Yang
Ling Yang@LingYang_PU·
What if your AI agent got better just by talking to you? Introducing OpenClaw-RL — a fully async RL framework that turns your everyday conversations into training signals. Your agent learns your habits, your workflows, your preferences. Privately. Continuously. #Clawdbot #openclaw 🔑 Two learning modes: • Binary RL — likes/dislikes become rewards • On-Policy Distillation — your textual feedback becomes token-level guidance Self-hosted. Zero API keys. Your data never leaves your machine. 👉 github.com/Gen-Verse/Open…
English
16
63
431
30.4K
slime
slime@slime_framework·
GLM-5 support just landed in slime — and the RL infra optimizations mentioned in the GLM-5 tech report are all here. Give it a try: github.com/THUDM/slime/pu…
Z.ai@Zai_org

Presenting the GLM-5 Technical Report! arxiv.org/abs/2602.15763 After the launch of GLM-5, we’re pulling back the curtain on how it was built. Key innovations include: - DSA Adoption: Significantly reduces training and inference costs while preserving long-context fidelity - Asynchronous RL Infrastructure: Drastically improves post-training efficiency by decoupling generation from training - Agent RL Algorithms: Enables the model to learn from complex, long-horizon interactions more effectively Through these innovations, GLM-5 achieves SOTA performance among open-source models, with particularly strong results in real-world software engineering tasks.

English
0
3
24
2.8K
slime
slime@slime_framework·
It’s finally here — GLM-5 just dropped! slime ended up doing a lot more heavy lifting for GLM-5 than before, and we’re super happy about it. We’ll add GLM-5 support in slime next… after a short recharge from the rush :p Happy Chinese New Year!
Z.ai@Zai_org

Introducing GLM-5: From Vibe Coding to Agentic Engineering GLM-5 is built for complex systems engineering and long-horizon agentic tasks. Compared to GLM-4.5, it scales from 355B params (32B active) to 744B (40B active), with pre-training data growing from 23T to 28.5T tokens. Try it now: chat.z.ai Weights: huggingface.co/zai-org/GLM-5 Tech Blog: z.ai/blog/glm-5 OpenRouter (Previously Pony Alpha): openrouter.ai/z-ai/glm-5 Rolling out from Coding Plan Max users: z.ai/subscribe

English
2
2
58
3.8K
slime
slime@slime_framework·
slime v0.2.2 is out! This release brings multiple memory + performance optimizations, plus major new capabilities: • Int4-QAT training • Full R3 (Rollout Routing Replay) support with DeepEP + MTP • Upgraded to SGLang v0.5.7 and Megatron dev branch Huge thanks to everyone who contributed! github.com/THUDM/slime/re…
English
1
6
42
3.2K