Joseph Suarez 🐡
8.9K posts

Joseph Suarez 🐡
@jsuarez
I build sane open-source RL tools. MIT PhD, creator of Neural MMO and founder of PufferAI. DM for business: non-LLM sim engineering, RL R&D, infra & support.
Katılım Mart 2019
116 Takip Edilen27.5K Takipçiler
Sabitlenmiş Tweet

@redtachyon if only there were an entire world of RL outside of LLMs that you can run on a toaster with a certain library you don't like the design of
English

For the next few months I'm finding myself temporarily unemployed.
The good part is that I can do anything OSS and not worry about IP issues.
The bad part is that my GPU resources are very limited.
Anyways, I'm planning to expand gyllm a bit - I think the core API is solid, but it could use some more features, and there's only so much I can do on a single Spark.
Sooooo anyone got any GPU credits for some good OSS RL?
English

@__selewaut__ you can just claim that one. First to land there gets it
English

@LukasHozda You can also use a $250 gpu to train a 150k param model to solve that game in a couple of minutes
English


@LebekMeas if they raise enough money, they'll just buy Earth and convert your atoms into blackwell chips
English

didn’t know planets were for sale tbh
Joseph Suarez 🐡@jsuarez
Don't worry, Sam is very grateful to programmers. OpenAI won't hurt uv. (they bought the less laggy WandB and shut it down)
English

@yacineMTB tbh it would be 1 week out, but we have several client projects rn, so probably 2-3
English

@jsuarez I actually long for a new project in just c and makefiles every now and then.
English

The hottest new language in 2026 is C99
OpenAI Newsroom@OpenAINewsroom
We've reached an agreement to acquire Astral. After we close, OpenAI plans for @astral_sh to join our Codex team, with a continued focus on building great tools and advancing the shared mission of making developers more productive. openai.com/index/openai-t…
English

@jarredsumner Thank you so much! And for all your help and support over the past few years 💪
English


Reinforcement Learning dev with Joseph Suarez x.com/i/broadcasts/1…
English

@quiveron_x Someone would have to pay me enough that I go work on their thing for 2-3 years and then have unlimited money to go run my own open-source lab forever
English

@adityarathore05 @yacineMTB For RL, our biggest runs use 10GB and it's not memory optimized
English

@jsuarez @yacineMTB You do understand that it comes with 128 GB “vram” on a consumer PC. 200$ GPU can’t compete with that. Been using m3 max, able to do smooth inference of almost all LLMs/VLMs (up to 32B) and play cyberpunk too. This is going to be the future.
English

@yacineMTB Pareto frontiers show up everywhere in RL because sensitivity analysis go brrrr
English

@adavya_sharma @yacineMTB whoops. Contributor got *metal* 3m sps, ~7x faster than torch mps. Still crap though. Our 5070s are at least 3x faster
English

@jsuarez @yacineMTB the apple equivalent is metal, not MPS.
English

@paolini Your writing was a major influence on me in my youth. I listened to the whole series again last summer while training for my first 50k and it was every bit as good as I remembered. Thank you
English

@shaedapk People don't hallucinate typos. This was a new form of laziness that wouldn't happen without LLMs
English

@shaedapk The reviewer didn't check it and sabotaged my work as a result
English








