Paul

3 posts

Paul banner
Paul

Paul

@paulWilliamChan

FLOP-head

PTX mines Katılım Ekim 2024
59 Takip Edilen204 Takipçiler
Paul
Paul@paulWilliamChan·
@clattner_llvm Would you consider something like cluster launch control to be micromanaging?
English
0
0
1
245
Chris Lattner
Chris Lattner@clattner_llvm·
Pipelining AI kernels is required to get full perf/utilization out of modern chips. However, no one has been able to crack "full control over the hardware" without "having to micromanage it". Let's crack this open: kernel authors deserve a powerful scheduler they can control. 💪
Modular@Modular

FA4 on Blackwell: 14 ops, 5 hardware units, 28 dependency edges. One wrong sync = a race condition sanitizers won't catch. We built a constraint solver that derives the pipeline schedule automatically, in Mojo 🔥 Part 1 of our series is out → modular.com/blog/software-…

English
9
17
259
24.9K