Chris Kitching

2 posts

Chris Kitching

Chris Kitching

@ChrisKitching17

Katılım Mart 2022
2 Takip Edilen2 Takipçiler
Hot Aisle
Hot Aisle@HotAisle·
@SpectralCom If only the source code was available so that it could be analyzed for why it allows that.
English
2
0
2
115
Spectral Compute
Spectral Compute@SpectralCom·
NVCC's parser is funny. When closing many template arguments at once, you can introduce a redundant comma after every third one with no effect:
Spectral Compute tweet media
English
1
1
7
333
Chris Kitching
Chris Kitching@ChrisKitching17·
@apaszke @clattner_llvm @metaai I think at least part of it is that they seem to have compared against cuBLAS instead of cuBLASLt. The latter is able to optimise for the specific input sizes more than the former, which makes it a fairer comparison with tools like mojo/Triton/etc.
English
0
0
3
466
Adam Paszke
Adam Paszke@apaszke·
@clattner_llvm @metaai How can Mojo be faster than CUDA? Isn’t it really just PTX vs the DSL abstractions? It’s also quite important to consider productivity in addition to perf, although it is harder to quantify
English
3
2
46
11.7K
Chris Lattner
Chris Lattner@clattner_llvm·
Thank you to folks at @metaai for publishing their independent perf analysis comparing CUDA and Mojo against Triton and TileLang DSLs, showing Mojo meeting and beating CUDA, and leaving DSLs in the dust.
Chris Lattner tweet media
English
27
78
688
138.1K