UW SyFi (@UWSyFi) - Profil Twitter | Zamantika Mersobahis Locabet

UW SyFi@UWSyFi·2d

📄 Paper: tracelab.cs.washington.edu/paper.pdf ✍️ Blog: syfi.cs.washington.edu/blog/2026-06-2… 🛠️ Trace and code: github.com/uw-syfi/TraceL… 🌐 Live demo: tracelab.cs.washington.edu, where you can ask our chatbot questions about the trace, as well as analyze and explore your own traces. This work was led by @KanZhu854772 , @mat_jacob1002, Chenxi Ma, @conlesspan, @thepadawang, @arvind_uw, and @bariskasikci! (8/n)

English

1

0

236

UW SyFi@UWSyFi·2d

🛠️ Our release includes trace examples, the collector/sanitizer, the full anonymized trace with analysis scripts and a chatbot, and a replay client for serving engines such as vLLM and SGLang. 🔍 This trace is an early look at real coding-agent traffic: self-driven loops, long-context short-output rounds, long-tailed tool execution, and imperfect prefix caching. It is also biased toward our own projects and habits, which is why we are releasing the full pipeline. 🤝 If you use Claude Code, Codex, or another coding agent, try TraceLab on your own logs, share a sanitized trace if you are comfortable, and help turn this first data point into a shared community resource. (7/n)

English

1

0

212

UW SyFi@UWSyFi·2d

🔥 Coding agents have become one of the hottest LLM workloads. But serving them looks nothing like serving a chatbot: 294× more input than output, hundreds of thousands of tool calls, and extremely long-tailed latency. 🚀 We are releasing the SyFI Coding Trace: ~4,300 real-world coding-agent sessions from our daily use, plus TraceLab, an open-source pipeline to collect, sanitize, analyze, and replay your own traces. More in the thread below 🧵👇 (1/n)

English

3

13

24

3.9K

UW SyFi me-retweet

Stephanie Wang@thepadawang·23 Haz

M* is a new system for multimodal inference from our lab @uwsyfi. The system captures multimodal models as dataflow graphs, implements a generic engine for those graphs, and achieves SOTA results inference throughput/latency. Learn more here! m-star.org

Keisuke Kamahori@KeisukeKamahori

New multimodal model architectures shouldn't require new serving systems. Introducing our work, M* (M-Star): a universal serving system for multimodal models that separates what a model computes - a dataflow graph - from how it runs: placement, scheduling, batching, and transport. Joint work across @uwcse, @StanfordAILab, and @CMU_ECE with Atindra Jha, Naomi Sagan, Irmak Sivgin, Rohan Sanda, @ste_veng, Mark Horowitz, @LukeZettlemoyer, Olivia Hsu, @jure, @bariskasikci, and @thepadawang.

English

0

2

3

267

UW SyFi me-retweet

Keisuke Kamahori@KeisukeKamahori·24 Haz

Excited to share that I’ll be interning at @nvidia this summer at the Santa Clara HQ, working on GPU architecture! If you’re in the Bay Area, I’d love to grab coffee! Always happy to chat about agents, ML systems, GPU architecture, or anything in between. #NVIDIALife

English

1

5

23

2.8K

UW SyFi me-retweet

Keisuke Kamahori@KeisukeKamahori·22 Haz

New multimodal model architectures shouldn't require new serving systems. Introducing our work, M* (M-Star): a universal serving system for multimodal models that separates what a model computes - a dataflow graph - from how it runs: placement, scheduling, batching, and transport. Joint work across @uwcse, @StanfordAILab, and @CMU_ECE with Atindra Jha, Naomi Sagan, Irmak Sivgin, Rohan Sanda, @ste_veng, Mark Horowitz, @LukeZettlemoyer, Olivia Hsu, @jure, @bariskasikci, and @thepadawang.

English

2

14

29

5.6K

UW SyFi@UWSyFi·10 Haz

Joint work between Megan Frisella, Shubham Tiwari, Andy Ruan, Yi Pan, Parker Gustafson, Mat Jacob, Gilbert Bernstein, Stephanie Wang at UW SyFI. Check out our paper! 📄 arxiv.org/abs/2606.11169 📝 syfi.cs.washington.edu/blog/2026-06-0… 💻 github.com/uw-syfi/piper

English

0

1

264

UW SyFi@UWSyFi·10 Haz

Unlike current frameworks, Piper correctly composes pipeline parallelism with ZeRO-2 and ZeRO-3 memory optimizations. In our experiments on Qwen3 9B, Piper encodes correct sharding semantics and supports larger batch sizes where Megatron, DeepSpeed, and TorchTitan fall short.

English

1

160

UW SyFi@UWSyFi·10 Haz

New distributed training strategies should not require new distributed runtimes. Introducing Piper: a programmable PyTorch training system for deploying complex training strategies by separating model placement and GPU scheduling from model code. 📄 arxiv.org/abs/2606.11169

English

1

15

51

4.3K

UW SyFi me-retweet

Mathew Jacob@mat_jacob1002·22 May

If there’s one thing you should do to learn how to build performant systems in this AI era, it is following @KeisukeKamahori @sudopowr and @MichaelGu341332!

Baris Kasikci@bariskasikci

Super stoked that UW SyFI (syfi.cs.washington.edu) members won a number of prizes at the MLSys'26 competition, NVIDIA Track. Hugre congrats to @KeisukeKamahori , @sudopowr , Yile Gu, Wei Shen, Steven Gao! Thanks to @nvidia , @modal , and the Flashinfer team for the support. 1st place in the GDN Track — Full-Agent Approach 2nd place in the GDN Track — Agent-Assisted Approach 3rd place in the DSA Track — Full-Agent Approach

English

0

1

12

1K

UW SyFi me-retweet

Baris Kasikci@bariskasikci·22 May

Super stoked that UW SyFI (syfi.cs.washington.edu) members won a number of prizes at the MLSys'26 competition, NVIDIA Track. Hugre congrats to @KeisukeKamahori , @sudopowr , Yile Gu, Wei Shen, Steven Gao! Thanks to @nvidia , @modal , and the Flashinfer team for the support. 1st place in the GDN Track — Full-Agent Approach 2nd place in the GDN Track — Agent-Assisted Approach 3rd place in the DSA Track — Full-Agent Approach

English

3

6

38

9.7K

UW SyFi me-retweet

Keisuke Kamahori@KeisukeKamahori·23 May

Very excited to share that our team at @UWSyFi won multiple prizes at the FlashInfer AI Kernel Generation Contest in #MLSys2026! Huge thanks for organizing an amazing contest @ye_combinator @yi_xin_dong @charles_irl

Baris Kasikci@bariskasikci

Super stoked that UW SyFI (syfi.cs.washington.edu) members won a number of prizes at the MLSys'26 competition, NVIDIA Track. Hugre congrats to @KeisukeKamahori , @sudopowr , Yile Gu, Wei Shen, Steven Gao! Thanks to @nvidia , @modal , and the Flashinfer team for the support. 1st place in the GDN Track — Full-Agent Approach 2nd place in the GDN Track — Agent-Assisted Approach 3rd place in the DSA Track — Full-Agent Approach

English

2

5

28

3.4K

UW SyFi

Jelajahi