Pengming Wang

328 posts

Pengming Wang

@PengmingWang

Founding team @poolsideai | prev @DeepMind, PhD @Cambridge_Uni, FunSearch co-author

London, England Katılım Temmuz 2021

253 Takip Edilen355 Takipçiler

Pengming Wang@PengmingWang·5d

We're sharing a few tools of the trade in our new blog series: poolside.ai/blog/tools-of-…

English

Pengming Wang@PengmingWang·5d

This is simple, but effective. Cool to see the ~13% speed gain in the backward pass just by offloading activations to host memory thanks to NVLink c2c

English

Pengming Wang retweetledi

poolside@poolsideai·5d

Training AI models requires storing temporary data mid-process. That data sits in GPU memory taking up space until it's needed. The standard fix has always been to delete it and redo the work later. It works, but it's wasteful.

English

2.1K

Pengming Wang@PengmingWang·2 Ara

This is today in a few hours! Don't miss!

Pengming Wang@PengmingWang

This will be really good. Synth data for pre-training is not easy to get right, and @marah_i_abdin is one of the best in the field.

English

422

Pengming Wang@PengmingWang·30 Kas

Find me at #NeurIPS2025 and I'll tell you the story of this loss curve 🙃

English

492

Pengming Wang@PengmingWang·29 Kas

This will be really good. Synth data for pre-training is not easy to get right, and @marah_i_abdin is one of the best in the field.

Marah Abdin@marah_i_abdin

will be @ NeurIPS this year & giving a short talk Tuesday — hit me up if you wanna chat synthetic data or just catch up!

English

3.2K

Pengming Wang@PengmingWang·24 Kas

We're starting to get out there. We'll have a live demo station and access to our models at our booth (#913) at @NeurIPSConf

Erik Thorelli@esthor

what the heck is poolside? 👀 @aiDotEngineer

English

1.6K

Pengming Wang@PengmingWang·22 Kas

Corollary: There is an optimal amount of drama you want, and it's >0

English

Pengming Wang@PengmingWang·22 Kas

LR is a slider for how much drama you want

English

150

Pengming Wang@PengmingWang·17 Kas

I'll be at Neurips this year. Keen to meet old and new friends, looking forward to catching up!

English

362

Pengming Wang@PengmingWang·26 Eki

@ElanaPearl Really good read, thank you!

English

188

Elana Simon@ElanaPearl·23 Eki

elanapearl.github.io/blog/2025/the-… it's a debugging detective story where you follow along the reasoning behind each step and solve it as we go it also explains ML & PyTorch concepts as they become necessary to understand what's breaking, why, and how to fix it🔎

English

538

31.6K

Elana Simon@ElanaPearl·23 Eki

New blog post: The bug that taught me more about PyTorch than years of using it started with a simple training loss plateau... ended up digging through optimizer states, memory layouts, kernel dispatch, and finally understanding how PyTorch works!

English

184

1.8K

727.3K

Pengming Wang@PengmingWang·23 Eki

Would love to connect with anyone who's impacted, and is looking to join a small, but well-resourced team to push to the frontier and beyond. We have one of the highest ratios of GPU resources per researcher. No politics or siloes.

Yuandong Tian@tydsh

Several of my team members + myself are impacted by this layoff today. Welcome to connect :)

English

2.7K

Pengming Wang retweetledi

Eiso Kant@eisokant·15 Eki

We believe that to compete at the frontier, you have to own the full stack: from dirt to intelligence. Today we’re announcing two major unlocks for our mission to AGI: 1. We're partnering with @CoreWeave and have 40,000+ NVIDIA GB300s secured. First capacity comes online starting Dec ’25. 2. Project Horizon: Poolside is developing a vertically integrated 2GW AI campus in West Texas to secure our medium term scale. On this site @CoreWeave will be our anchor tenant for the first 250MW phase. Learn more about it on our blog: poolside.ai/blog/announcin… Or on WSJ: wsj.com/tech/ai/a-gian…

English

419

455.5K

Pengming Wang@PengmingWang·14 Ağu

We're hiring across many roles, including evaluations: poolside.ai/careers/member…

English

250

Pengming Wang@PengmingWang·14 Ağu

If you want to learn how we orchestrate evaluations at poolside, it's part of our model factory blog series: poolside.ai/blog/the-carri…

English

334

Pengming Wang@PengmingWang·14 Ağu

In the limit, evaluations are the ~only thing that matters. When models are self-improving, and every metric can be hill climbed, picking the metric becomes the most important thing. Evals will shift from being "writing unit tests" for research to being the *main thing*

English

2.2K

Pengming Wang@PengmingWang·10 Ağu

Read @joerowell excellent blog post series here: poolside.ai/blog/introduci…

English

600

Pengming Wang@PengmingWang·10 Ağu

We've not been very public about our progress on model building, but I fully believe poolside will be the next lab joining the frontier. We're now sharing a bit more how we're doing this, with a systems-first approach we're taking with our model factory.

English

2.8K

Pengming Wang@PengmingWang·10 Ağu

If this is something that resonates with you, come join us! poolside.ai/careers

English

211

Pengming Wang@PengmingWang·10 Ağu

We've spent quite some time at poolside thinking about this, and recently put down some words on how we're approaching this: poolside.ai/vision/research

English

261

Pengming Wang@PengmingWang·10 Ağu

Test-time compute is powerful, but in its current form there is a lack of "harmony" with pre-training. Models feel split-brained: They're either deeply overthinking, with no trust in its own "common sense"; or they latch onto the nearest neighbour of meaning without deliberation

English

1.5K

Keşfet

@marah_i_abdin @NeurIPSConf @ElanaPearl @CoreWeave @joerowell @elonmusk @BarackObama @taylorswift13