Victor Guerra

1.6K posts

Victor Guerra banner
Victor Guerra

Victor Guerra

@vguerra

Software engineer @CriteoAILab

Paris Katılım Ağustos 2008
2K Takip Edilen522 Takipçiler
Victor Guerra
Victor Guerra@vguerra·
@marksaroufim @vitransformer Witnessing what you did with the CUD…, 😮‍💨 sorry, GPU mode community was great, wishing you all the best on what comes next!
English
0
0
0
47
Mark Saroufim
Mark Saroufim@marksaroufim·
After 5 amazing years, I’m leaving the PyTorch team at Meta. I did my best work there and got to work with some of the smartest, most OSS pilled engineers in the industry. More soon on what’s next: still systems, still OSS (but not everything), a smaller team with a lot of GPUs
Mark Saroufim tweet media
English
100
28
1.3K
70.8K
Victor Guerra retweetledi
Matej Sirovatka
Matej Sirovatka@m_sirovatka·
What’s the best model you can train in a day if someone hands you a pile of Blackwell GPUs? You can try out yourself On April 9 in Paris, @GPU_MODE + @verdacloud + @sestercegroup are hosting a GPU hackathon with a bunch of GPUs to run on and even more of them for the winners.
English
12
8
160
8.7K
Victor Guerra retweetledi
Modular
Modular@Modular·
Mojo 🔥 just added a powerful new reflection capability that can synthesize trait conformances for you. In other languages, this is only available to compiler authors. Maxim Zaks breaks down how Mojo makes metaprogramming explicit and accessible: mzaks.medium.com/when-magic-bec…
English
1
12
78
15.9K
Victor Guerra retweetledi
Vincent Abbott
Vincent Abbott@vtabbott_·
Released pyncd/tsncd today. We now have a way to manage deep learning algorithms algebraically! The code on the left renders into the diagram on the right. Mismatched domain/codomains are automatically managed under the hood.
Vincent Abbott tweet mediaVincent Abbott tweet media
English
9
47
421
19K
Victor Guerra retweetledi
Aart Bik
Aart Bik@AartBik·
The journey that started a long time ago as sparse compilation, enriched with new ideas by so many others along the way, continues with the introduction of the Universal Sparse Tensor (UST) type. developer.nvidia.com/blog/establish…
English
0
1
9
309
Victor Guerra retweetledi
tetsuo.cpp (no slop)
tetsuo.cpp (no slop)@tetsuo_cpp·
If you work with MLIR and like programming with agents, try the Context7 MCP. LLMs have historically been abysmal at MLIR (probably due to lack of training data), but Context7 helps the agent discover APIs and use them correctly. The same goes for any relatively niche library.
English
3
3
37
3.4K
Victor Guerra retweetledi
tetsuo.cpp (no slop)
tetsuo.cpp (no slop)@tetsuo_cpp·
Oh, you're writing CUDA kernels? Everyone's on Triton now. Just kidding, we're all on Mojo. We're using cuTile. We're using ROCm. We have an in-house DSL compiler targeting the NVGPU MLIR dialect but wait, Tile IR just dropped so we're going to target that instead. Our PM is on TileLang. The team lead was on CuTe but now she's back to handwriting PTX. If you're not on Pallas, you're ngmi. Our intern is building on TT-Metalium for our Wormholes. Our CFO approved an order for some big chungus wafer-scale chips so now we're porting our kernels to CSL. Our CTO is working on a kernel-less graph compiler so we won't need to write kernels anymore. Our CEO thinks we're talking about the Linux kernel. We're building Claude for dogs.
English
67
179
2.8K
184K
Victor Guerra retweetledi
Modular
Modular@Modular·
From an idea to powering world-leading AI models and cutting-edge accelerators — Mojo has come a long way. Now, Mojo 1.0 is on the horizon: stability, open-source plans, and new tooling for developers everywhere. Read our latest blog post to explore the road to 1.0 ⬇️
English
2
17
82
12.9K
Zach Mueller
Zach Mueller@TheZachMueller·
After almost 4 years at @huggingface, it's time to move on. When I joined, I had only ever trained on Google Colab and was shellshocked and in awe at the master-class of people I'd be working with, @GuggerSylvain, @StasBekman, @mervenoyann, and so many more. 4 years later and accelerate is now the backbone of the entire Hugging Face ecosystem, and it's been a gateway to allow for anyone to do distributed training. I couldn't be more proud of my time, and my team, as we tackled the fun problems of raising up the entire community into the modern age of distributed training. I won't say what's next for me yet, but I won't be going too far in the world of AI, and especially the wonderful communities I've been able to be a part of over the nearly decade of being in OSS.
Zach Mueller tweet mediaZach Mueller tweet media
English
39
2
301
23.6K
Victor Guerra retweetledi
Chris Lattner
Chris Lattner@clattner_llvm·
We know that one of the biggest barriers to programming GPUs is access to hardware: "Code you’ve written for NVIDIA or AMD GPUs should now mostly just work on an Apple🍎 Silicon GPU, assuming no device-specific features were being used." Preview here:👇 forum.modular.com/t/apple-silico…
English
17
71
782
86K
Victor Guerra retweetledi
Zach Mueller
Zach Mueller@TheZachMueller·
In order to celebrate the release of the print version for the Ultra-Scale Playbook (of which I have no affiliation with and love deeply), I'm going to be giving away 5 copies! To enter, simply like + retweet this tweet. Winners will be selected at random 10AM EST on the 13th
Zach Mueller tweet media
English
11
149
310
31.4K
Victor Guerra retweetledi
Modular
Modular@Modular·
Learn hands-on GPU programming in Mojo🔥 by solving puzzles🧩! Start here: builds.modular.com/puzzles
English
2
20
137
21.9K