Less Wright

10 posts

Less Wright

Less Wright

@lessw2020

@PyTorch, Large Scale Distributed AI Training, Object Detection, Optimizers, Stock Indexes

शामिल हुए Kasım 2016
17 फ़ॉलोइंग183 फ़ॉलोवर्स
Less Wright
Less Wright@lessw2020·
@SemiAnalysis_ ymmv, but I've run 2K scale H200 and B200 runs with 70B model, up to 3D parallel, with regional torch.compile with no issues. Compile is not distributed aware, so the better method imo is regional compile of the transformer blocks, not full model compile: #L345" target="_blank" rel="nofollow noopener">github.com/pytorch/torcht…
English
0
0
3
247
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
Torch compile is great for single node but horrible for any serious multi-node production training. It is a gaint footgun with 100+ mini footguns waiting to be stepped on
English
9
5
169
99.2K
Less Wright
Less Wright@lessw2020·
Our TorchTitan Paper has been Accepted to ICLR 2025! (lnkd.in/gw5gfVVA) From the paper chair: " I recommend Accept ...: (a) This is a production-grade framework that covers a wide range of parallelism method and .... is likely to have significant impact "
English
0
2
6
705
Less Wright रीट्वीट किया
George E. Dahl
George E. Dahl@GeorgeEDahl·
We've just released the first version of our Deep Learning Tuning Playbook! This is our attempt to distill our process for actually getting good results with deep learning. We emphasize hyperparameter tuning since it has been a large pain point. github.com/google-researc…
English
44
795
3.6K
670.7K
Less Wright
Less Wright@lessw2020·
testing my new handle
English
0
0
0
0
Less Wright
Less Wright@lessw2020·
Excellent article on vector processing on CPU and GPU. If you want to dive deeper into what's happening internally and the tradeoffs involved, this covers SIMD, SIMT, blocks, warps, and more: erik-engheim.medium.com/vector-process…
English
0
0
1
0
Less Wright
Less Wright@lessw2020·
@IOHK_Charles This is a badge of honor. BitCoin went through the very same thing with Wikipedia in 2010 or so (for real!). Basically,imo this annoints ADA as the successor to BTC. That said, I'll go update the page tomorrow with some links/news to make sure it stays.
English
0
0
2
0