NP (@np_hard) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

NP@np_hard·15h

As part of @PrimeIntellect's RL residency program, I've been exploring how to do multi-agent RL using their current stack (from verifiers + prime-rl to lab experiments with hosted training /evals) and thinking about how it could be extended to support these abstractions natively. I've summarized my findings the blogpost below and I'll leave a few comments here, too...

English

7

45

355

43.3K

NP retweetledi

will brown@willccbb·15h

veeery cool writeup digging into nuances of training, experimentation, and infra for multi-agent RL :)

NP@np_hard

As part of @PrimeIntellect's RL residency program, I've been exploring how to do multi-agent RL using their current stack (from verifiers + prime-rl to lab experiments with hosted training /evals) and thinking about how it could be extended to support these abstractions natively. I've summarized my findings the blogpost below and I'll leave a few comments here, too...

English

6

16

260

26.8K

NP@np_hard·15h

I would like to thank @PrimeIntellect for the support with this work, specifically @willccbb, @johannes_hage , @omouamoua , @hallerite , @GottliebEli and @creet_z . Also thanks for the feedback @myainotez / @BillyHoy1_ / @_djdumpling !

English

0

1

15

719

NP@np_hard·15h

I discuss some more details in the blogpost (nphard.io/2026/02/23/han…). I'm very excited to see what comes out of this, and related work in the residency, like @BillyHoy1_'s stuff - hopefully it will spark more work on open multi-agent RL!

English

1

3

14

757

NP@np_hard·15h

As part of @PrimeIntellect's RL residency program, I've been exploring how to do multi-agent RL using their current stack (from verifiers + prime-rl to lab experiments with hosted training /evals) and thinking about how it could be extended to support these abstractions natively. I've summarized my findings the blogpost below and I'll leave a few comments here, too...

English

7

45

355

43.3K

NP@np_hard·12 Mar

computer use

English

0

2

123

NP@np_hard·27 Şub

@parafactual x.com/np_hard/status…

NP@np_hard

also 23 fav number. 2, 3, 23, 2+3 ~ prime

QME

0

2

61

You@parafactual·27 Şub

!!!!!!! #newface #Newlife

2

0

32

704

NP@np_hard·21 Şub

@voooooogel it is known

English

0

1

124

thebes@voooooogel·21 Şub

bad idea. give your claw the day off

Crémieux@cremieuxrecueil

Reminder: Claude is a digital shabbos goy and may do work for you when you cannot.

English

6

10

192

10.7K

NP retweetledi

Alex Wa@_djdumpling·18 Şub

new blog! What methodologies do labs use to train frontier models? The blog distills 7 open-weight model reports from frontier labs, covering architecture, stability, optimizers, data curation, pre/mid/post-training + RL, and behaviors/safety djdumpling.github.io/2026/01/31/fro…

English

34

286

2K

280.1K

NP@np_hard·11 Şub

@willccbb @FalduFenil12 @Nitish_singla_ compounding!

English

0

1

121

will brown@willccbb·11 Şub

are you feeling it @FalduFenil12 @np_hard @Nitish_singla_ cited

kenneth@local0ptimist

@willccbb @manveerxyz already loving this. the brainstorm skill gave me a clear path to translate my local work onto prime... really slick!

English

5

2

37

6.8K

NP@np_hard·11 Şub

this is the way

Prime Intellect@PrimeIntellect

Introducing Lab: A full-stack platform for training your own agentic models Build, evaluate and train on your own environments at scale without managing the underlying infrastructure. Giving everyone their own frontier AI lab.

English

0

5

340

NP retweetledi

Prime Intellect@PrimeIntellect·11 Şub

Introducing Lab: A full-stack platform for training your own agentic models Build, evaluate and train on your own environments at scale without managing the underlying infrastructure. Giving everyone their own frontier AI lab.

English

133

289

2.5K

749.9K

NP@np_hard·7 Şub

@torchcompiled palette

Italiano

0

1

27

Ethan@torchcompiled·7 Şub

Ethan@torchcompiled

Wandb Art

ZXX

2

1

13

1.1K

NP@np_hard·24 Oca

@leothecurious @BerenMillidge @kparikh2001 Yeah this one

Beren Millidge@BerenMillidge

Was super fun giving this talk and thanks to @DavidDuvenaud for inviting me. I'll also be writing more about these topics at beren.io.

English

0

1

26

davinci@leothecurious·24 Oca

@np_hard @BerenMillidge @kparikh2001 the TED talk?

English

1

0

1

92

davinci@leothecurious·24 Oca

i love @BerenMillidge's work and remember seeing his name on many of the most interesting works around PC, FEP, and active inference when i was deep down that rabbit hole. but then i came across the following surprising text in one of his blogs (thanks to @kparikh2001 for reminding me of it) and it just feels like a very premature pivot. it's probably the first thing i'd question him about if i ever got the chance.

English

5

0

27

3.5K

NP

Keşfet