
Edward Z. Yang
9K posts

Edward Z. Yang
@ezyang
I work on PyTorch at Meta. Chatty alt at @difficultyang.




Pro-tip: using CUDA graphs and annoyed that all the kernels have no labels in your profiles? Get a nightly that has mark_kernels context manager: github.com/pytorch/pytorc… (thanks Natalia and Shangdi for implementing!) You need 13.1 driver, but user mode driver is enough







Distributed training is hard. We adopted DTensor at Runway to prevent silent gradient bugs and it delivered. But we traded performance for correctness, hitting dispatch overhead, recompilation storms, and MFU drops. Wrote up what we learned and how we work around it. runwayml.com/news/dtensor-d…


Introducing Zero The programming language for agents. I wanted a systems language that was faster, smaller, and easier for agents to use and repair. Explicit capabilities. JSON diagnostics. Typed safe fixes. Made for agents on day zero.





