Joseph Suarez 🐡

8.9K posts

Joseph Suarez 🐡 banner
Joseph Suarez 🐡

Joseph Suarez 🐡

@jsuarez

I build sane open-source RL tools. MIT PhD, creator of Neural MMO and founder of PufferAI. DM for business: non-LLM sim engineering, RL R&D, infra & support.

Katılım Mart 2019
116 Takip Edilen27.5K Takipçiler
Sabitlenmiş Tweet
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
PufferLib 3.0: We trained reinforcement learning agents on 1 Petabyte / 12,000 years of data with 1 server. Now you can, too! Our latest release includes algorithmic breakthroughs, massively faster training, and 10 new environments. Live demos on our site. Volume on for trailer!
English
34
109
1K
188.3K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@redtachyon if only there were an entire world of RL outside of LLMs that you can run on a toaster with a certain library you don't like the design of
English
0
0
4
362
Ariel
Ariel@redtachyon·
For the next few months I'm finding myself temporarily unemployed. The good part is that I can do anything OSS and not worry about IP issues. The bad part is that my GPU resources are very limited. Anyways, I'm planning to expand gyllm a bit - I think the core API is solid, but it could use some more features, and there's only so much I can do on a single Spark. Sooooo anyone got any GPU credits for some good OSS RL?
English
6
0
54
3.7K
Santi
Santi@__selewaut__·
@jsuarez When will Mars be purchased?
English
1
0
1
777
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
Don't worry, Sam is very grateful to programmers. OpenAI won't hurt uv. (they bought the less laggy WandB and shut it down)
Joseph Suarez 🐡 tweet media
English
11
7
356
38.6K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@LukasHozda You can also use a $250 gpu to train a 150k param model to solve that game in a couple of minutes
English
1
0
37
1.1K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@altryne @wandb We do use it still! And it got less laggy. But mainly because the cap on points per graph was reduced from 100 to 50. Why can't it handle 10k+ easily?
English
0
0
15
2.2K
Alex Volkov
Alex Volkov@altryne·
@jsuarez btw @wandb has improved a LOT since then, and is NOT shut down, doing great even post acquisition! 👀 We just lunched an IOS app 👏
English
1
0
5
2.7K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@LebekMeas if they raise enough money, they'll just buy Earth and convert your atoms into blackwell chips
English
0
1
39
5.7K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@yacineMTB tbh it would be 1 week out, but we have several client projects rn, so probably 2-3
English
0
0
3
211
kache
kache@yacineMTB·
Just use a small rnn
kache tweet media
English
9
2
94
5.4K
Lucas Beyer (bl16)
Lucas Beyer (bl16)@giffmana·
@jsuarez I actually long for a new project in just c and makefiles every now and then.
English
3
0
9
1.4K
Charlie Marsh
Charlie Marsh@charliermarsh·
@jarredsumner Thank you so much! And for all your help and support over the past few years 💪
English
1
0
22
1.8K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@quiveron_x Someone would have to pay me enough that I go work on their thing for 2-3 years and then have unlimited money to go run my own open-source lab forever
English
2
0
19
903
Aditya Rathore
Aditya Rathore@adityarathore05·
@jsuarez @yacineMTB You do understand that it comes with 128 GB “vram” on a consumer PC. 200$ GPU can’t compete with that. Been using m3 max, able to do smooth inference of almost all LLMs/VLMs (up to 32B) and play cyberpunk too. This is going to be the future.
English
1
0
0
67
kache
kache@yacineMTB·
can i run cuda on a macbook
English
65
2
208
34.7K
kache
kache@yacineMTB·
Is there a tangible intuitive statistical explanation for why the Pareto distribution turns up literally everywhere I look?
English
126
5
463
41.8K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@paolini Your writing was a major influence on me in my youth. I listened to the whole series again last summer while training for my first 50k and it was every bit as good as I remembered. Thank you
English
0
0
11
4.4K
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
@shaedapk People don't hallucinate typos. This was a new form of laziness that wouldn't happen without LLMs
English
0
0
0
76
c
c@shaedapk·
@jsuarez Human slop, no?
English
1
0
0
98
Joseph Suarez 🐡
Joseph Suarez 🐡@jsuarez·
AI slop reviews do real damage to science. This was my RLC 2024 TR for PufferLib. Rejected. Wow, I should have proofread my work... except all these typos were hallucinated. PufferLib received a best paper award the next year. This delayed adoption.
Joseph Suarez 🐡 tweet media
English
8
13
252
18.2K
c
c@shaedapk·
@jsuarez I was more so suggesting that the "AI review" was probably an old/cheap model. Several non-existent hallucinations on a very short document seems very strange for current models, which is unfair to bundle as "AI review"
English
1
0
0
135