Sergei Lebedev

2.6K posts

Sergei Lebedev

Sergei Lebedev

@superbobry

Adding ∇ for a living.

London, England 가입일 Nisan 2009
165 팔로잉287 팔로워
Sergei Lebedev 리트윗함
Adam Paszke
Adam Paszke@apaszke·
Want to improve GPU compute/comms overlap? We just published a new short tutorial for you! A few small changes to the Pallas:MGPU matmul kernel is all it takes to turn it into an all-gather collective matmul that overlaps NVLINK comms with local compute: docs.jax.dev/en/latest/pall…
English
8
45
303
32.9K
Wcabca
Wcabca@WCelhen·
@apaszke 🤣 Was looking at tpu sparsecore with jax and couldn't find any tutorial for it (apart from a repo with no docs🥹)
English
2
0
0
213
Sergei Lebedev 리트윗함
Adam Paszke
Adam Paszke@apaszke·
Curious how to write SOTA performance Blackwell matmul kernels using MGPU? We just published a short step-by-step tutorial: docs.jax.dev/en/latest/pall… At each step, we show exactly what (small) changes are necessary to refine the kernel and the final kernel is just under 150 lines.
English
4
67
418
54.6K
Sergei Lebedev
Sergei Lebedev@superbobry·
@AliesTaha Thanks for sharing, Ali! Can you link to the kernel and benchmarking code, please?
English
0
1
4
336
AT
AT@AliesTaha·
We beat Nvidia’s cuBLAS kernels on B200s in 170 LOC. Using zero CUDA. Just pure Mojo. Here’s exactly how we went from 1% to 106% of Nvidia benchmark perf from scratch (with code) 👇🧵
AT tweet media
English
44
139
1.2K
120.2K
xjdr
xjdr@_xjdr·
the performance of jax + GPU is abysmal compared with an equivalent pytorch implementation. This is sad but unsurprising. jax might be relegated to TPU only (for me) for a little while longer. that said, there is nothing that touches jax + TPU for large scale perf (405B bby)
English
22
7
382
33.9K
Sergei Lebedev 리트윗함
Dan F-M
Dan F-M@exoplaneteer·
I've finally landed my first proper JAX feature since joining the team: a supported "foreign function interface", which makes it easier to call into external libraries from within JAX code. Check it out: jax.readthedocs.io/en/latest/ffi.…
English
2
14
98
13.3K
Detective Griszka
Detective Griszka@youlovegriszka·
посмотрел на календарь и выяснил, что между событиями из твитов выше прошло ровно два года
Русский
2
0
6
103
Detective Griszka
Detective Griszka@youlovegriszka·
уже сутки не могу перестать угорать с того факта, что в воскресенье вечером сам себе по лицу съездил, причём сочно так, прям кулаком в глаз
Русский
2
0
4
0
Sergei Lebedev 리트윗함
Adam Paszke
Adam Paszke@apaszke·
Many of you are excited about H100 attention, so it’s a good time to show you Mosaic GPU: a Python DSL for H100s. The attention example matches FA3 performance, while being only ~200 lines of Python: #L146-L354" target="_blank" rel="nofollow noopener">github.com/google/jax/blo… It's easy to install too! Latest JAX packages have it.
English
14
109
660
116.5K
Sergei Lebedev
Sergei Lebedev@superbobry·
@DrLangstrand Whenever there is a LaTeX alignment/spacing problem, the answer is usually minipage and its friends hspace and vspace :)
English
0
0
0
56
Ld. Vincent Hanquez 牛角包
@ChShersh The honeymoon phase will pass for you. Best to reserve judgment until you actually use the thing for many years, and see all the crufts. (Source : done many years of OCaml personally and professionally)
English
2
0
1
392
Dmitrii Kovanikov
Dmitrii Kovanikov@ChShersh·
Choose 👏 OCaml 👏 today 👏 The hype might end. But the amazing language will stay with you forever.
English
8
2
78
22.3K
Sergei Lebedev
Sergei Lebedev@superbobry·
@mitsuhiko @robsmallshire That's sadly "by design". PEP-484 decided that int should be a subtype of float, even though it isn't at runtime...
English
0
0
0
116
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
On the topic of Python typing: One of my biggest qualms with the system is that once you are past having everything typed, it lets you do stuff like this and does not even complain.
Armin Ronacher ⇌ tweet media
English
30
9
165
42.9K
Sergei Lebedev
Sergei Lebedev@superbobry·
@mitsuhiko I think a lot of the ergonomics were sacrificed to the idea of annotations being used for things other than types. So, most typing features are designed to abuse existing syntax to the benefit of nobody.
English
0
0
0
532
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
Python typing makes my blood boil. How the hell does anyone enjoy that experience?
English
169
69
1.2K
271.1K
Sergei Lebedev 리트윗함
ëugene kharitonov 🏴‍☠️
Are you a PhD student interested in the interface between generative AI, LLMs and audio? Our team at Google behind AudioLM, MusicLM and AudioPaLM is looking for a talented student researcher! See details and apply google.com/about/careers/… and send your cv to bmcw@google.com.
English
1
5
43
5.6K
Yaron (Ron) Minsky
Yaron (Ron) Minsky@yminsky·
@ezyang Have you considered pyright? It's supposed to be faster, but people mostly talk about it as something to use with vscode. It seems weird to me to use a different checker in the editor than in CI, but this seems like normal behavior in Python-world.
English
5
0
9
2.5K
Yaron (Ron) Minsky
Yaron (Ron) Minsky@yminsky·
So...does anyone have advice for picking between the various and sundry Python type systems? mypy, pyright, pyre, pytype...how do you pick?
English
30
7
86
44.4K
Sergei Lebedev
Sergei Lebedev@superbobry·
@yminsky The Python type system has no spec, so I can guarantee there will be plenty of things they disagree about. Pyright is also usually faster to adopt new type system features, so it's possible to get type errors simply because mypy doesn't fully support some feature (yet).
English
0
0
2
329
Yaron (Ron) Minsky
Yaron (Ron) Minsky@yminsky·
Sounds like lots of people use mypy for CI, and pyright for the editor (because, VSCode, and mypy isn't zippy enough). But...is using two linters really the right thing? Does anyone have experience using pyright for CI as well as the editor?
English
6
0
17
6.2K
Sergei Lebedev
Sergei Lebedev@superbobry·
@yminsky Is there a way to learn more about what Jane Street is doing in this space without applying? :)
English
1
0
0
191