Prime Intellect

2.8K posts

Prime Intellect banner
Prime Intellect

Prime Intellect

@PrimeIntellect

The Open Stack for Self-Improving Agents https://t.co/ZRZOsRRbwr

Katılım Haziran 2020
40 Takip Edilen67.1K Takipçiler
Sabitlenmiş Tweet
Prime Intellect
Prime Intellect@PrimeIntellect·
The next wave of AI will not be won by better prompts. It will be won by systems that learn from experience. Today, Prime Intellect Lab is out of beta, open for you to start training your own models. The era of self-improving agents is here.
English
83
204
2K
1.3M
Vibrant Labs
Vibrant Labs@VibrantLabsAI·
1/n For browser agents, a major bottleneck in evaluation is truthful scoring on the live web. A task is only as good as your ability to confirm the agent actually did it, on a real site whose state keeps moving and that the agent can potentially misreport. So we took matters into our own hands. Today, we're releasing Ecom Bench on @PrimeIntellect: 40 shopping tasks on real Shopify storefronts, each run in a live @browserbase browser and graded by a deterministic verifier. vibrantlabs.com/research/ecom-…
Vibrant Labs tweet media
English
7
9
26
4.7K
Prime Intellect
Prime Intellect@PrimeIntellect·
Huge thanks to the @vllm_project team, and @robertshaw21 in particular, for all the help along the way. Also to the llm-d and Dynamo teams for the collaboration on routing and inference.
English
3
0
48
3.5K
Prime Intellect
Prime Intellect@PrimeIntellect·
The trainer is 3D-parallel (FSDP2 + CP + EP), built on TorchTitan. FSDP2 shards params, grads & optimizer state. EP keeps experts sharded and routes tokens with all2all instead of all-gathering ~80GB per layer. CP handles the 131k context and GLM-5's DSA attention.
Prime Intellect tweet media
English
2
1
51
3.2K
Prime Intellect
Prime Intellect@PrimeIntellect·
Over a long run the trainer and inference policies slowly drift apart, and that mismatch can kill your training. R3 (router replay) captures the routing decisions from the inference engine, replays them on the trainer - KL mismatch drops ~10x.
Prime Intellect tweet media
English
2
0
48
3.4K
Prime Intellect
Prime Intellect@PrimeIntellect·
One Mooncake store pools KV cache across all nodes, so any worker can reuse any prefix. The router picks workers by a score over load, queue depth, KV usage and prefix overlap. You get cross-replica cache hits with balanced routing across the whole deployment.
Prime Intellect tweet media
English
1
0
50
3.5K
Prime Intellect
Prime Intellect@PrimeIntellect·
We disaggregate prefill and decode onto separate workers. A long prefill used to stall decode for everyone. Now it doesn't.
Prime Intellect tweet media
English
1
0
57
4.3K
Prime Intellect
Prime Intellect@PrimeIntellect·
In RL, inference is the bottleneck — we optimize for throughput, not latency. High concurrency, FP8 precision, and wide expert parallelism over 32+ GPUs. Every GPU holds its own slice of experts and acts as its own endpoint.
Prime Intellect tweet media
English
1
0
68
5.8K
Prime Intellect
Prime Intellect@PrimeIntellect·
Today we're releasing prime-rl v0.6.0 — enabling RL at trillion-parameter MoE scale on agentic workloads at the highest efficiency. We've relentlessly optimized our RL infra. The result: GLM-5 on agentic SWE tasks at 131k context and sub-5-minute step time.
Prime Intellect tweet media
English
36
86
915
264.3K
Prime Intellect retweetledi
Prime Intellect retweetledi
Prime Intellect retweetledi
Vincent Weisser
Vincent Weisser@vincentweisser·
Excited to support this epic Inference-time compute hackathon with Prime Intellect credits for post-training + compute 24hr hack on > Agents: Multi-step systems that take a goal and execute. Tool use, planning, long horizons. > Real-Time and Interactive: Sub-second loops, live multimodal. > ​RL + Applied AI: Systems that judge capability of a person or a model. Autograders, preference ranking, rubrics, verified-skill signals, human-in-the-loop. luma.com/hncudfxb
English
7
7
95
6.7K
xjdr
xjdr@_xjdr·
Ok i did it . this is Noumena's version of your favorite, ai code TUI . I rebuilt the REPL engine to be event based added a new rust markdown rendering stack that considerably increased performance and stability i added a rust OpenAI compat websocket client i added a handful of tools and exposed a handfull of others that were hidden by default this system is optimized for our internal inference system and models but i have recently tested it with a vanilla kimi k2.7 and it works quite well. over the next few days we will make the Noumena Inference Engine available and serve at least the k2.7 model . this project will support native noumena code (ncode) login and keys when its available. either way, i hope y'all enjoy and find this useful. this is the first of several things i hope to launch this week github.com/Noumena-Networ…
xjdr@_xjdr

based on everything that has happened over the last week (and year really) and how good k2.7 is in this harness, it is getting tempting to make this available to y'all

English
25
33
392
55.4K
Prime Intellect retweetledi
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker newsletter.semianalysis.com/p/rl-systems-m…
English
8
53
414
191.9K
Prime Intellect retweetledi
Vincent Weisser
Vincent Weisser@vincentweisser·
Great RL systems deep dive by @SemiAnalysis_ Scaling RL is as much of an infra problem as an algorithm one SemiAnalysis ran experiments on our stack: Prime RL + Sandboxes. System efficiency is ultimately queue health to match generator and trainer throughput
Vincent Weisser tweet media
SemiAnalysis@SemiAnalysis_

RL Systems Mind the Gap: Matching Trainer and Generator Throughput RL Training Infrastructure, GRPO, PipelineRL, Async RL, Policy Staleness, RL Sandbox Infra, CPU Requirements, TCO Analysis, Thinking Machines Tinker newsletter.semianalysis.com/p/rl-systems-m…

English
4
15
119
26.6K
Prime Intellect retweetledi
will brown
will brown@willccbb·
been beating this drum since early 2025, seems like people are starting to see why it's so important :) RL works -> "train or get trained on" -> open models + post-training infra are the path to institutional flywheels + democratization of AI progress
will brown tweet mediawill brown tweet media
MidCurveCapital@midwit_capital

The next big trade is infrastructure / RL environments that enable companies to turn their institutional knowledge / processes into continuously improving learning loops that they can own.

English
6
28
394
35.5K
Prime Intellect retweetledi
Vincent Weisser
Vincent Weisser@vincentweisser·
Satya is perfectly describing the why and what behind @primeintellect since 2023 🫡 > AI needs to be open & sovereign > Let every company create its own self-improving agents: and own their loop to make them better > A rich open ai ecosystem creates far more abundance than a future locked down by a few closed labs > Every company is becoming an ai company: so every company needs to own its own product <> model improvement loop @primeintellect enables this today: > Your own evals + rl envs for the outcomes you care about > models self-improving in production from your real traces > don't cede your moat to a handful of labs. This self-improvement loop is the IP and it compounds Open self improving agents for everyone 🫡
Vincent Weisser tweet media
Satya Nadella@satyanadella

x.com/i/article/2065…

English
13
33
227
24.7K