David Wang

80 posts

David Wang banner
David Wang

David Wang

@_dcw02

gpu perf @modal

Katılım Ağustos 2024
276 Takip Edilen303 Takipçiler
Eric Zhang
Eric Zhang@ekzhang1·
I keep getting sick recently. this sucks. I wish my immune system wasn't so terrible. maybe I should replace it with an AI
English
12
0
60
6.9K
David Wang retweetledi
Zhijian Liu
Zhijian Liu@zhijianliu_·
DFlash⚡ meets OpenClaw🦞 = FlashClaw Same Claw. >4X faster or cheaper. DFlash support for Qwen3.5 is live — outperforming native MTP by up to 2.3X. More to come! 🔥
English
12
38
196
19.7K
vik
vik@vikhyatk·
@realmcore_ also has zero vision capabilities
Español
3
0
22
1.8K
akira
akira@realmcore_·
Codex 5.3 appears to be a developer's developer. A swe's swe. A model for the people. Although incapable of talking to the people.
English
24
8
308
19.1K
David Wang retweetledi
Robert Clausecker
Robert Clausecker@FUZxxl·
@lisperati Just write assembly code. That has always been allowed.
English
0
1
9
2.7K
David Wang retweetledi
Tianyin Xu
Tianyin Xu@tianyin_xu·
Anyone has a job that needs strong OS/System research and engineering skills? My postdoc Jongyul Kim is looking for a research oriented job. He does great work on Storage and Memory Systems. You can find his work at: yulistic.github.io His CV can be found at: yulistic.github.io/files/CV_jongy…
Tianyin Xu tweet media
English
3
14
99
10.6K
David Wang retweetledi
Zhijian Liu
Zhijian Liu@zhijianliu_·
⚡ Speed of flash. Just 2 days after launch, DFlash is already running in SGLang (@sgl_project). With serving-engine support, we can now unlock speedup with higher concurrency, and we’ve quickly worked on a new demo based on it. We'll be cooking up more and better draft models over the next few weeks.🔥 Stay tuned!
Akshat Bubna@akshat_b

Two days since DFlash was released, and @_dcw02 (on @modal research) already shipped support for it in SGLang. Why are we so excited about this? Diffusion speculators let us get *way* higher tok/s than auto-regressive models. E.g. we're seeing a 4.73x boost with H200s + FA3 already — with still more improvements to come! Reach out to us if we can help you get this in prod today, and huge thanks to @zhijianliu_ and team for coming up with this technique.

English
7
26
230
21.1K
David Wang
David Wang@_dcw02·
@_alyxya @akshat_b @modal yes this is similar EAGLE3 where you train a separate draft model. the graph numbers are for batch size 1. we’re working to continue optimizing performance!
English
1
0
1
50
alyxya
alyxya@_alyxya·
@akshat_b @_dcw02 @modal Is this a separate trained model for speculative decoding? Is this for a single batch size? I like the idea of diffusion speculators and expect there to be a ton of possible optimizations.
English
1
0
0
658
Akshat Bubna
Akshat Bubna@akshat_b·
Two days since DFlash was released, and @_dcw02 (on @modal research) already shipped support for it in SGLang. Why are we so excited about this? Diffusion speculators let us get *way* higher tok/s than auto-regressive models. E.g. we're seeing a 4.73x boost with H200s + FA3 already — with still more improvements to come! Reach out to us if we can help you get this in prod today, and huge thanks to @zhijianliu_ and team for coming up with this technique.
Akshat Bubna tweet media
English
9
23
203
54.7K
David Wang
David Wang@_dcw02·
friendship ended with EAGLE3 now DFlash is my best friend
Akshat Bubna@akshat_b

Two days since DFlash was released, and @_dcw02 (on @modal research) already shipped support for it in SGLang. Why are we so excited about this? Diffusion speculators let us get *way* higher tok/s than auto-regressive models. E.g. we're seeing a 4.73x boost with H200s + FA3 already — with still more improvements to come! Reach out to us if we can help you get this in prod today, and huge thanks to @zhijianliu_ and team for coming up with this technique.

English
1
2
12
2.1K
vik
vik@vikhyatk·
somehow ended up with three mechanical keyboards in the office. creeps up on you
English
6
0
33
3.3K
Nathan Lambert
Nathan Lambert@natolambert·
I'm a fan of a lot of folks going to @humansand unfortunately it'll be referred to as human-sand and not humans-and. Kind of ominous. Can't unsee it. It's your viral marketing strategy.
English
8
0
94
16.2K