Prime Intellect

2.5K posts

Prime Intellect banner
Prime Intellect

Prime Intellect

@PrimeIntellect

open superintelligence stack https://t.co/ZRZOsRQDGT

शामिल हुए Haziran 2020
34 फ़ॉलोइंग61.7K फ़ॉलोवर्स
पिन किया गया ट्वीट
Prime Intellect
Prime Intellect@PrimeIntellect·
Introducing Lab: A full-stack platform for training your own agentic models Build, evaluate and train on your own environments at scale without managing the underlying infrastructure. Giving everyone their own frontier AI lab.
English
133
291
2.5K
744.9K
Prime Intellect रीट्वीट किया
Vincent Weisser
Vincent Weisser@vincentweisser·
If you want to meet at NVIDIA GTC - come through tonight luma.com/vpzfkrzr
English
3
6
50
5.1K
Prime Intellect रीट्वीट किया
Prime Intellect रीट्वीट किया
Damian Barabonkov
Damian Barabonkov@iamdamianb·
I had a ton of fun collaborating with NVIDIA last week to benchmark their new Vera CPUs on realistic sandbox workloads. For more info, check out the blog! 👇
Prime Intellect@PrimeIntellect

Today, we’re sharing how our collaboration with @nvidia helps power the open superintelligence stack. The next frontier of AI infrastructure is building systems for agentic models that can reason for hours, use tools, execute code, and learn from outcomes at scale. primeintellect.ai/blog/nvidia-co…

English
1
3
38
7K
Prime Intellect
Prime Intellect@PrimeIntellect·
This collaboration also extends into research: - Nemotron models available through Lab - NVIDIA NeMo Gym environments in verifiers/prime-rl - NVIDIA NeMo RL stack integrated with verifiers Read more on our blog: primeintellect.ai/blog/nvidia-co…
English
1
1
58
3.5K
Prime Intellect
Prime Intellect@PrimeIntellect·
Inference is becoming central to both RL rollouts and production agents. We chose NVIDIA Dynamo because agentic inference at scale means handling global deployments, long-context reasoning, multi-turn trajectories, sparse MoEs, and large fleets of adapters.
English
1
1
66
18.1K
Prime Intellect
Prime Intellect@PrimeIntellect·
Today, we’re sharing how our collaboration with @nvidia helps power the open superintelligence stack. The next frontier of AI infrastructure is building systems for agentic models that can reason for hours, use tools, execute code, and learn from outcomes at scale. primeintellect.ai/blog/nvidia-co…
English
13
45
361
28.2K
Prime Intellect रीट्वीट किया
Lily Zhang
Lily Zhang@lily_gpupoor·
This is going to be huge if true @a1zhang fascinating talk
Lily Zhang tweet mediaLily Zhang tweet media
San Francisco, CA 🇺🇸 English
16
74
1.6K
191.4K
stochi
stochi@stochi0·
Stitched up smth fun over the weekend, prototype of an autoresearch RLM environment inspired by @karpathy, using @PrimeIntellect infra. Haven’t run full evals yet, but the setup looks like this: The model can: - modify training file - run experiments inside a sandbox - parse logs for the metric (val_bpb) - iterate to improve the score So the model does the full research loop: code, experiment, logs, hypothesis, patch, repeat Essentially turning autoresearch loop into an RLM training environment, producing trajectories of autonomous research behavior. The interesting bit would be generalizing this to: - any repo - any metric - any experiment harness - envs where model can optimize on specific pieces in a big codebase. Most importantly, this produces trajectories of autonomous research behavior. From those we can identify failure modes and iteratively improve the environment itself.👀🧋🎋 Github: github.com/stochi0/athena… Environments Hub: app.primeintellect.ai/dashboard/envi…
English
2
6
40
4.5K
Prime Intellect रीट्वीट किया
Beff (e/acc)
Beff (e/acc)@beffjezos·
Who knew the singularity would be this beautiful
English
23
45
327
11.2K
Prime Intellect रीट्वीट किया
Beff (e/acc)
Beff (e/acc)@beffjezos·
The @PrimeIntellect event is underway and the vibes are immaculate. This feels like a Church of the Singularity
Beff (e/acc) tweet media
English
31
30
492
31.1K
Prime Intellect रीट्वीट किया
Beff (e/acc)
Beff (e/acc)@beffjezos·
The one and only @willccbb walking the crowd through the @PrimeIntellect stack and continual learning concepts
Beff (e/acc) tweet mediaBeff (e/acc) tweet media
English
16
26
397
31.7K
Prime Intellect रीट्वीट किया
Benjamin Bratton
Benjamin Bratton@bratton·
Thanks to @vincentweisser and @PrimeIntellect for the invitation to give the opening sermon at the church of the singularity (that isnt' singular) at their stellar event today. Also... their recursive language models release is going to be 🤯
Beff (e/acc)@beffjezos

The @PrimeIntellect event is underway and the vibes are immaculate. This feels like a Church of the Singularity

English
7
2
57
8.7K
Prime Intellect रीट्वीट किया
Vivek
Vivek@vivek_2332·
introducing autoresearch-rl, autonomous research for rl post-training. inspired by @karpathy autoresearch, and i think rl post-training is honestly one of the places where this idea fits perfectly. there are at least 50+ hyper parameters to tweak, learning rate, batch size, rollouts, clipping ratios, kl penalties, schedulers, the list goes on. instead of sitting there for hours turning knobs one at a time, just let the model figure out the right starting config on its own. some things worth mentioning: -> built on @PrimeIntellect prime-rl (my favourite rl post-training framework) and @willccbb verifiers for reward verification. -> ran qwen2.5-0.5b-instruct on gsm8k across 60+ autonomous experiments. eval score went from 0.475 to 0.550 and the agent actually found a way to do it in fewer steps (20 instead of 30). less compute, better results -> the whole thing was surprisingly smooth to set up and run. point the agent at the config, go to sleep, wake up to a full experiment log. i really wish i could try this on a bigger model but gpu poor for now lol -> the agent discovers things you wouldn't think to try. like how rollouts = 4 beats rollouts = 8, or how a constant lr schedule outperforms cosine. it just methodically tests everything i think the real value here is that rl training is so fragile and noisy that having an agent patiently run experiment after experiment is genuinely more effective than a human doing it manually. check it out: github.com/vivekvkashyap/…
Vivek tweet media
English
22
53
749
78.4K