David Wang (@_dcw02) - Twitter Profili | Zamantika Mersobahis Locabet

David Wang@_dcw02·21 Mar

@ekzhang1 hope you feel better soon!

English

0

1

169

Eric Zhang@ekzhang1·21 Mar

I keep getting sick recently. this sucks. I wish my immune system wasn't so terrible. maybe I should replace it with an AI

English

12

0

60

6.9K

David Wang retweetledi

Zhijian Liu@zhijianliu_·16 Mar

DFlash⚡ meets OpenClaw🦞 = FlashClaw Same Claw. >4X faster or cheaper. DFlash support for Qwen3.5 is live — outperforming native MTP by up to 2.3X. More to come! 🔥

English

12

38

196

19.7K

David Wang@_dcw02·22 Şub

@vikhyatk @realmcore_ blind (codex) leading the blind (me)

English

1

0

1

28

vik@vikhyatk·22 Şub

@realmcore_ also has zero vision capabilities

Español

3

0

22

1.8K

akira@realmcore_·22 Şub

Codex 5.3 appears to be a developer's developer. A swe's swe. A model for the people. Although incapable of talking to the people.

English

24

8

308

19.1K

David Wang@_dcw02·18 Şub

@Dorialexander @AmpCode we have instructions to set up opencode with our glm5 endpoint here: modal.com/blog/try-glm-5 :)

English

0

2

183

Alexander Doria@Dorialexander·18 Şub

So I want with @AmpCode and, unfortunately… (and no, most of my students did not create accounts in time). What is the best recommended free(mium) alternative now?

Alexander Doria@Dorialexander

What is the recommended free/freemium alternative to claude code? GLM on OpenCode? Codex works with free gpt? (for my students, so that they can start without subscription).

English

13

1

37

10.6K

David Wang retweetledi

Robert Clausecker@FUZxxl·16 Şub

@lisperati Just write assembly code. That has always been allowed.

English

0

1

9

2.7K

David Wang@_dcw02·12 Şub

GIF

Charles 🎉 Frye@charles_irl

The GLM models by @Zai_org have been a gamechanger for me. I was reluctant to embrace coding agents before I could run the models myself. Now, with GLM-5, I have a top-quality self-hosted intelligence endpoint tightly integrated into my engineering work. github.com/modal-projects…

ZXX

0

1

6

1.5K

David Wang@_dcw02·11 Şub

powered in part by b300s 👀

Modal@modal

GLM-5, the latest frontier open model from @Zai_org, is available now on Modal. We partnered with Z.ai to release an endpoint that will be free for a limited time.

English

2

0

13

1.3K

David Wang retweetledi

Jian Chen@jianchen1799·7 Şub

Check out our paper and code! Follow our Hugging Face — more draft models for popular LLMs coming soon 🚀 HF: huggingface.co/collections/z-… Code: github.com/z-lab/dflash Paper: arxiv.org/abs/2602.06036

Zhijian Liu@zhijianliu_

The paper is now available: huggingface.co/papers/2602.06… More updates coming soon!

English

0

4

9

711

David Wang retweetledi

Zhijian Liu@zhijianliu_·7 Şub

The paper is now available: huggingface.co/papers/2602.06… More updates coming soon!

Zhijian Liu@zhijianliu_

Holiday cooking finally ready to serve! 🥳 Introducing DFlash — speculative decoding with block diffusion. 🚀 6.2× lossless speedup on Qwen3-8B ⚡ 2.5× faster than EAGLE-3 Diffusion vs AR doesn’t have to be a fight. At today’s stage: • dLLMs = fast, highly parallel, but lossy • AR LLMs = accurate, sequential, but slow DFlash = diffusion drafts, AR verifies.

English

6

42

305

39.3K

David Wang@_dcw02·6 Şub

@tianyin_xu happy to refer him to Modal

English

0

4

606

Tianyin Xu@tianyin_xu·6 Şub

Anyone has a job that needs strong OS/System research and engineering skills? My postdoc Jongyul Kim is looking for a research oriented job. He does great work on Storage and Memory Systems. You can find his work at: yulistic.github.io His CV can be found at: yulistic.github.io/files/CV_jongy…

English

3

14

99

10.6K

David Wang@_dcw02·24 Oca

@charles_irl you can leave gpu poor, but gpu poor never leaves you

English

1

0

2

137

Charles 🎉 Frye@charles_irl·24 Oca

just tripled my money (throughput per dollar) by predicting multiple tokens at once. amazing alpha (acceptance rate) here

Charles 🎉 Frye@charles_irl

I went into AI, not crypto, but I still ended up speculating on tokens.

English

1

0

52

3.9K

David Wang@_dcw02·24 Oca

docker pull modalresearch/sglang:v0.5.7-fa4-preview

Erik Bernhardsson@bernhardsson

It’s ironic that Blackwells have been out since 2024 but people still prefer Hoppers because the kernels aren’t Blackwell-optimized yet, and now the Hopper prices are going up.

English

1

11

2.9K

David Wang retweetledi

sona dolasia@teenychairs·17 Oca

🌟

Sam Hogan 🇺🇸@samhogan

All the homies love @modal

QME

3

2

39

3.9K

David Wang retweetledi

Zhijian Liu@zhijianliu_·10 Oca

⚡ Speed of flash. Just 2 days after launch, DFlash is already running in SGLang (@sgl_project). With serving-engine support, we can now unlock speedup with higher concurrency, and we’ve quickly worked on a new demo based on it. We'll be cooking up more and better draft models over the next few weeks.🔥 Stay tuned!

Akshat Bubna@akshat_b

Two days since DFlash was released, and @_dcw02 (on @modal research) already shipped support for it in SGLang. Why are we so excited about this? Diffusion speculators let us get *way* higher tok/s than auto-regressive models. E.g. we're seeing a 4.73x boost with H200s + FA3 already — with still more improvements to come! Reach out to us if we can help you get this in prod today, and huge thanks to @zhijianliu_ and team for coming up with this technique.

English

7

26

230

21.1K

David Wang@_dcw02·10 Oca

@_alyxya @akshat_b @modal yes this is similar EAGLE3 where you train a separate draft model. the graph numbers are for batch size 1. we’re working to continue optimizing performance!

English

1

0

1

50

alyxya@_alyxya·10 Oca

@akshat_b @_dcw02 @modal Is this a separate trained model for speculative decoding? Is this for a single batch size? I like the idea of diffusion speculators and expect there to be a ton of possible optimizations.

English

1

0

658

Akshat Bubna@akshat_b·10 Oca

Two days since DFlash was released, and @_dcw02 (on @modal research) already shipped support for it in SGLang. Why are we so excited about this? Diffusion speculators let us get *way* higher tok/s than auto-regressive models. E.g. we're seeing a 4.73x boost with H200s + FA3 already — with still more improvements to come! Reach out to us if we can help you get this in prod today, and huge thanks to @zhijianliu_ and team for coming up with this technique.

English

9

23

203

54.7K

David Wang@_dcw02·10 Oca

@zhijianliu_ and @jianchen1799 are really amazing, they have the Mandate of Heaven

English

0

2

267

David Wang@_dcw02·10 Oca

friendship ended with EAGLE3 now DFlash is my best friend

Akshat Bubna@akshat_b

Two days since DFlash was released, and @_dcw02 (on @modal research) already shipped support for it in SGLang. Why are we so excited about this? Diffusion speculators let us get *way* higher tok/s than auto-regressive models. E.g. we're seeing a 4.73x boost with H200s + FA3 already — with still more improvements to come! Reach out to us if we can help you get this in prod today, and huge thanks to @zhijianliu_ and team for coming up with this technique.

English

1

2

12

2.1K

David Wang retweetledi

Erik Bernhardsson@bernhardsson·8 Oca

We want to make B200s competitive with H100s with open source inference engines. You're welcome, Jensen! * github.com/Dao-AILab/flas… * github.com/Dao-AILab/flas… * github.com/Dao-AILab/flas…

English

4

14

175

31.9K

David Wang@_dcw02·3 Oca

@tenderizzation @fujikanaeda @vikhyatk grave escape?

Español

0

64

tender@tenderizzation·2 Oca

@fujikanaeda @vikhyatk sorry got distracted swapping out some leaps

English

2

0

13

5.1K

vik@vikhyatk·2 Oca

somehow ended up with three mechanical keyboards in the office. creeps up on you

English

6

0

33

3.3K

David Wang@_dcw02·5 Ara

@natolambert @humansand human sand would be a metal name for a chip making company

English

0

5

484

Nathan Lambert@natolambert·5 Ara

I'm a fan of a lot of folks going to @humansand unfortunately it'll be referred to as human-sand and not humans-and. Kind of ominous. Can't unsee it. It's your viral marketing strategy.

English

8

0

94

16.2K

David Wang

Keşfet