Prime Intellect (@PrimeIntellect) - Twitter Profili

Sabitlenmiş Tweet

The next wave of AI will not be won by better prompts. It will be won by systems that learn from experience. Today, Prime Intellect Lab is out of beta, open for you to start training your own models. The era of self-improving agents is here.

English

83

204

2K

1.3M

Vibrant Labs@VibrantLabsAI·17h

1/n For browser agents, a major bottleneck in evaluation is truthful scoring on the live web. A task is only as good as your ability to confirm the agent actually did it, on a real site whose state keeps moving and that the agent can potentially misreport. So we took matters into our own hands. Today, we're releasing Ecom Bench on @PrimeIntellect: 40 shopping tasks on real Shopify storefronts, each run in a live @browserbase browser and graded by a deterministic verifier. vibrantlabs.com/research/ecom-…

English

7

9

26

4.7K

Prime Intellect@PrimeIntellect·16h

@VibrantLabsAI 🔥

QME

0

6

461

Prime Intellect retweetledi

Mika Senghaas@mikasenghaas·1d

this is a good one

Prime Intellect@PrimeIntellect

Today we're releasing prime-rl v0.6.0 — enabling RL at trillion-parameter MoE scale on agentic workloads at the highest efficiency. We've relentlessly optimized our RL infra. The result: GLM-5 on agentic SWE tasks at 131k context and sub-5-minute step time.

English

1

2

42

5.3K

Prime Intellect@PrimeIntellect·1d

prime-rl is fully open source, and we're hiring systems engineers to take it further. Read the full prime-rl performance deep dive: primeintellect.ai/blog/rl-at-1t-…

English

2

7

100

17.4K

Prime Intellect@PrimeIntellect·1d

Huge thanks to the @vllm_project team, and @robertshaw21 in particular, for all the help along the way. Also to the llm-d and Dynamo teams for the collaboration on routing and inference.

English

3

0

48

3.5K

Prime Intellect@PrimeIntellect·1d

The trainer is 3D-parallel (FSDP2 + CP + EP), built on TorchTitan. FSDP2 shards params, grads & optimizer state. EP keeps experts sharded and routes tokens with all2all instead of all-gathering ~80GB per layer. CP handles the 131k context and GLM-5's DSA attention.

English

2

1

51

3.2K

Prime Intellect@PrimeIntellect·1d

Over a long run the trainer and inference policies slowly drift apart, and that mismatch can kill your training. R3 (router replay) captures the routing decisions from the inference engine, replays them on the trainer - KL mismatch drops ~10x.

English

2

0

48

3.4K

Prime Intellect@PrimeIntellect·1d

One Mooncake store pools KV cache across all nodes, so any worker can reuse any prefix. The router picks workers by a score over load, queue depth, KV usage and prefix overlap. You get cross-replica cache hits with balanced routing across the whole deployment.

English

1

0

50

3.5K

Prime Intellect@PrimeIntellect·1d

We disaggregate prefill and decode onto separate workers. A long prefill used to stall decode for everyone. Now it doesn't.

English

1

0

57

4.3K

Prime Intellect@PrimeIntellect·1d

In RL, inference is the bottleneck — we optimize for throughput, not latency. High concurrency, FP8 precision, and wide expert parallelism over 32+ GPUs. Every GPU holds its own slice of experts and acts as its own endpoint.

English

1

0

68

5.8K

Prime Intellect@PrimeIntellect·1d

Today we're releasing prime-rl v0.6.0 — enabling RL at trillion-parameter MoE scale on agentic workloads at the highest efficiency. We've relentlessly optimized our RL infra. The result: GLM-5 on agentic SWE tasks at 131k context and sub-5-minute step time.

English

36

86

915

264.3K

Prime Intellect retweetledi

Johannes Hagemann@johannes_hage·16 Haz

awesome post by @kimbochen covering RL systems end-to-end, including a SWE training run on GLM-5 using our prime-rl framework.

SemiAnalysis@SemiAnalysis_

RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker newsletter.semianalysis.com/p/rl-systems-m…

English

1

16

144

20.7K

Prime Intellect retweetledi

elie@eliebakouch·16 Haz

nice blog by @kimbochen about the current RL ecosystem, goes into detail about the different settings and tradeoffs to consider when RLing open models

SemiAnalysis@SemiAnalysis_

RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker newsletter.semianalysis.com/p/rl-systems-m…

English

1

9

85

13K

Prime Intellect retweetledi

Vincent Weisser@vincentweisser·6d

Excited to support this epic Inference-time compute hackathon with Prime Intellect credits for post-training + compute 24hr hack on > Agents: Multi-step systems that take a goal and execute. Tool use, planning, long horizons. > Real-Time and Interactive: Sub-second loops, live multimodal. > RL + Applied AI: Systems that judge capability of a person or a model. Autograders, preference ranking, rubrics, verified-skill signals, human-in-the-loop. luma.com/hncudfxb

English

7

95

6.7K

Prime Intellect@PrimeIntellect·17 Haz

@_xjdr @garybasin @nisten 🙏

QME

0

5

141

xjdr@_xjdr·17 Haz

@garybasin @nisten @PrimeIntellect for most of them now. they are the absolute best!

English

3

0

21

691

xjdr@_xjdr·17 Haz

Ok i did it . this is Noumena's version of your favorite, ai code TUI . I rebuilt the REPL engine to be event based added a new rust markdown rendering stack that considerably increased performance and stability i added a rust OpenAI compat websocket client i added a handful of tools and exposed a handfull of others that were hidden by default this system is optimized for our internal inference system and models but i have recently tested it with a vanilla kimi k2.7 and it works quite well. over the next few days we will make the Noumena Inference Engine available and serve at least the k2.7 model . this project will support native noumena code (ncode) login and keys when its available. either way, i hope y'all enjoy and find this useful. this is the first of several things i hope to launch this week github.com/Noumena-Networ…

xjdr@_xjdr

based on everything that has happened over the last week (and year really) and how good k2.7 is in this harness, it is getting tempting to make this available to y'all

English

25

33

392

55.4K

Prime Intellect retweetledi

Vincent Weisser@vincentweisser·17 Haz

We are so back! Future looking bright to post-train, serve, and continuously improve your own model on top of models like GLM-5.2 using primeintellect.ai 🫡

Z.ai@Zai_org

Introducing GLM-5.2: Frontier Intelligence, Open Weights - Significant improvements in coding and agentic tasks - Strong long-horizon capabilities with a 1M context window - Two levels of reasoning effort: GLM-5.2 (max) pushes the limits, while GLM-5.2 (high) strikes a strong balance between performance and token efficiency - MIT-licensed open weights - Same API pricing as GLM-5.1 Tech Blog: z.ai/blog/glm-5.2 Weights: huggingface.co/zai-org/GLM-5.2 API: docs.z.ai/guides/llm/glm… Coding Plan: z.ai/subscribe Chat: chat.z.ai

English

5

116

8.9K

Prime Intellect@PrimeIntellect·17 Haz

@SemiAnalysis_ 👀

QME

0

7

2.7K

SemiAnalysis@SemiAnalysis_·16 Haz

RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker newsletter.semianalysis.com/p/rl-systems-m…

English

8

53

414

191.9K

Prime Intellect retweetledi

Vincent Weisser@vincentweisser·17 Haz

Great RL systems deep dive by @SemiAnalysis_ Scaling RL is as much of an infra problem as an algorithm one SemiAnalysis ran experiments on our stack: Prime RL + Sandboxes. System efficiency is ultimately queue health to match generator and trainer throughput

SemiAnalysis@SemiAnalysis_

RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker newsletter.semianalysis.com/p/rl-systems-m…

English

4

15

119

26.6K

Prime Intellect retweetledi

will brown@willccbb·15 Haz

been beating this drum since early 2025, seems like people are starting to see why it's so important :) RL works -> "train or get trained on" -> open models + post-training infra are the path to institutional flywheels + democratization of AI progress

MidCurveCapital@midwit_capital

The next big trade is infrastructure / RL environments that enable companies to turn their institutional knowledge / processes into continuously improving learning loops that they can own.

English

6

28

394

35.5K

Prime Intellect retweetledi

Vincent Weisser@vincentweisser·15 Haz

Satya is perfectly describing the why and what behind @primeintellect since 2023 🫡 > AI needs to be open & sovereign > Let every company create its own self-improving agents: and own their loop to make them better > A rich open ai ecosystem creates far more abundance than a future locked down by a few closed labs > Every company is becoming an ai company: so every company needs to own its own product <> model improvement loop @primeintellect enables this today: > Your own evals + rl envs for the outcomes you care about > models self-improving in production from your real traces > don't cede your moat to a handful of labs. This self-improvement loop is the IP and it compounds Open self improving agents for everyone 🫡

Satya Nadella@satyanadella

x.com/i/article/2065…

English

13

33

227

24.7K

Prime Intellect

Keşfet