Steven G

3 posts

Steven G

Steven G

@ste_veng

@uwcse

Katılım Şubat 2026
80 Takip Edilen7 Takipçiler
Steven G retweetledi
Keisuke Kamahori
Keisuke Kamahori@KeisukeKamahori·
New multimodal model architectures shouldn't require new serving systems. Introducing our work, M* (M-Star): a universal serving system for multimodal models that separates what a model computes - a dataflow graph - from how it runs: placement, scheduling, batching, and transport. Joint work across @uwcse, @StanfordAILab, and @CMU_ECE with Atindra Jha, Naomi Sagan, Irmak Sivgin, Rohan Sanda, @ste_veng, Mark Horowitz, @LukeZettlemoyer, Olivia Hsu, @jure, @bariskasikci, and @thepadawang.
Keisuke Kamahori tweet media
English
2
14
29
5.6K
Steven G retweetledi
Stanford AI Lab
Stanford AI Lab@StanfordAILab·
Modern multimodal models aren't a single decode loop anymore; they're composite. M* is one runtime that serves them all, and it matches or beats every specialized system: up to 2.7× on omni TTS, 12.5× on world-model rollouts. Learn more here: ai.stanford.edu/blog/mstar/
Stanford AI Lab tweet media
English
4
13
62
11.2K
Steven G retweetledi
UW SyFi
UW SyFi@UWSyFi·
New distributed training strategies should not require new distributed runtimes. Introducing Piper: a programmable PyTorch training system for deploying complex training strategies by separating model placement and GPU scheduling from model code. 📄 arxiv.org/abs/2606.11169
UW SyFi tweet media
English
1
15
51
4.3K