Sergei Lebedev

2.6K posts

Sergei Lebedev

Sergei Lebedev

@superbobry

Adding ∇ for a living.

London, England เข้าร่วม Nisan 2009
165 กำลังติดตาม287 ผู้ติดตาม
Sergei Lebedev รีทวีตแล้ว
Adam Paszke
Adam Paszke@apaszke·
Want to improve GPU compute/comms overlap? We just published a new short tutorial for you! A few small changes to the Pallas:MGPU matmul kernel is all it takes to turn it into an all-gather collective matmul that overlaps NVLINK comms with local compute: docs.jax.dev/en/latest/pall…
English
8
45
303
32.9K
Wcabca
Wcabca@WCelhen·
@apaszke 🤣 Was looking at tpu sparsecore with jax and couldn't find any tutorial for it (apart from a repo with no docs🥹)
English
2
0
0
213
Sergei Lebedev รีทวีตแล้ว
Adam Paszke
Adam Paszke@apaszke·
Curious how to write SOTA performance Blackwell matmul kernels using MGPU? We just published a short step-by-step tutorial: docs.jax.dev/en/latest/pall… At each step, we show exactly what (small) changes are necessary to refine the kernel and the final kernel is just under 150 lines.
English
4
67
418
54.6K
Sergei Lebedev
Sergei Lebedev@superbobry·
@AliesTaha Thanks for sharing, Ali! Can you link to the kernel and benchmarking code, please?
English
0
1
4
336
AT
AT@AliesTaha·
We beat Nvidia’s cuBLAS kernels on B200s in 170 LOC. Using zero CUDA. Just pure Mojo. Here’s exactly how we went from 1% to 106% of Nvidia benchmark perf from scratch (with code) 👇🧵
AT tweet media
English
44
139
1.2K
120.2K
xjdr
xjdr@_xjdr·
the performance of jax + GPU is abysmal compared with an equivalent pytorch implementation. This is sad but unsurprising. jax might be relegated to TPU only (for me) for a little while longer. that said, there is nothing that touches jax + TPU for large scale perf (405B bby)
English
22
7
382
33.9K
Sergei Lebedev รีทวีตแล้ว
Dan F-M
Dan F-M@exoplaneteer·
I've finally landed my first proper JAX feature since joining the team: a supported "foreign function interface", which makes it easier to call into external libraries from within JAX code. Check it out: jax.readthedocs.io/en/latest/ffi.…
English
2
14
98
13.3K
Detective Griszka
Detective Griszka@youlovegriszka·
посмотрел на календарь и выяснил, что между событиями из твитов выше прошло ровно два года
Русский
2
0
6
103
Detective Griszka
Detective Griszka@youlovegriszka·
уже сутки не могу перестать угорать с того факта, что в воскресенье вечером сам себе по лицу съездил, причём сочно так, прям кулаком в глаз
Русский
2
0
4
0
Sergei Lebedev รีทวีตแล้ว
Adam Paszke
Adam Paszke@apaszke·
Many of you are excited about H100 attention, so it’s a good time to show you Mosaic GPU: a Python DSL for H100s. The attention example matches FA3 performance, while being only ~200 lines of Python: #L146-L354" target="_blank" rel="nofollow noopener">github.com/google/jax/blo… It's easy to install too! Latest JAX packages have it.
English
14
109
660
116.5K
Sergei Lebedev รีทวีตแล้ว
Ivan Lobov
Ivan Lobov@ilobov·
Our team is looking for a strong research engineer, hardware background is *not* obligatory. Please, share and recommend someone! boards.greenhouse.io/deepmind/jobs/…
English
1
4
7
733
Sergei Lebedev
Sergei Lebedev@superbobry·
@DrLangstrand Whenever there is a LaTeX alignment/spacing problem, the answer is usually minipage and its friends hspace and vspace :)
English
0
0
0
56
Ld. Vincent Hanquez 牛角包
@ChShersh The honeymoon phase will pass for you. Best to reserve judgment until you actually use the thing for many years, and see all the crufts. (Source : done many years of OCaml personally and professionally)
English
2
0
1
392
Dmitrii Kovanikov
Dmitrii Kovanikov@ChShersh·
Choose 👏 OCaml 👏 today 👏 The hype might end. But the amazing language will stay with you forever.
English
8
2
78
22.3K
Sergei Lebedev
Sergei Lebedev@superbobry·
@mitsuhiko @robsmallshire That's sadly "by design". PEP-484 decided that int should be a subtype of float, even though it isn't at runtime...
English
0
0
0
116
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
On the topic of Python typing: One of my biggest qualms with the system is that once you are past having everything typed, it lets you do stuff like this and does not even complain.
Armin Ronacher ⇌ tweet media
English
30
8
164
42.9K
Sergei Lebedev
Sergei Lebedev@superbobry·
@mitsuhiko I think a lot of the ergonomics were sacrificed to the idea of annotations being used for things other than types. So, most typing features are designed to abuse existing syntax to the benefit of nobody.
English
0
0
0
532
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
Python typing makes my blood boil. How the hell does anyone enjoy that experience?
English
169
69
1.2K
271.1K
Sergei Lebedev รีทวีตแล้ว
ëugene kharitonov 🏴‍☠️
Are you a PhD student interested in the interface between generative AI, LLMs and audio? Our team at Google behind AudioLM, MusicLM and AudioPaLM is looking for a talented student researcher! See details and apply google.com/about/careers/… and send your cv to bmcw@google.com.
English
1
5
43
5.6K
Sergei Lebedev
Sergei Lebedev@superbobry·
@yminsky @ezyang You can use it in CI as well, though, right? You don't have to use mypy there.
English
1
0
1
275
Yaron (Ron) Minsky
Yaron (Ron) Minsky@yminsky·
@ezyang Have you considered pyright? It's supposed to be faster, but people mostly talk about it as something to use with vscode. It seems weird to me to use a different checker in the editor than in CI, but this seems like normal behavior in Python-world.
English
5
0
9
2.5K
Yaron (Ron) Minsky
Yaron (Ron) Minsky@yminsky·
So...does anyone have advice for picking between the various and sundry Python type systems? mypy, pyright, pyre, pytype...how do you pick?
English
30
7
86
44.4K
Sergei Lebedev
Sergei Lebedev@superbobry·
@yminsky The Python type system has no spec, so I can guarantee there will be plenty of things they disagree about. Pyright is also usually faster to adopt new type system features, so it's possible to get type errors simply because mypy doesn't fully support some feature (yet).
English
0
0
2
329
Yaron (Ron) Minsky
Yaron (Ron) Minsky@yminsky·
Sounds like lots of people use mypy for CI, and pyright for the editor (because, VSCode, and mypy isn't zippy enough). But...is using two linters really the right thing? Does anyone have experience using pyright for CI as well as the editor?
English
6
0
17
6.2K
Sergei Lebedev
Sergei Lebedev@superbobry·
@yminsky Is there a way to learn more about what Jane Street is doing in this space without applying? :)
English
1
0
0
191