jason

2.8K posts

jason banner
jason

jason

@jvmncs

❤️s/RTs are randomized and differentially private.

Brooklyn Bergabung Şubat 2010
1.3K Mengikuti1.1K Pengikut
🎭
🎭@deepfates·
We know about "load-bearing" "smoke test" "genuinely" "explicitly" etc.. What other llmisms have you found?
English
398
110
986
121.2K
jason me-retweet
Connor
Connor@cnnradams·
light work
Connor tweet media
English
3
8
40
18.3K
François Fleuret
François Fleuret@francoisfleuret·
Speculative decoding is the closest you can get to a free lunch method. It is beautiful and astounding. I am surprised that it does not play a greater role in "AI".
English
23
15
440
48.6K
jason
jason@jvmncs·
buried in this post is a note about our from-scratch framework for draft model training by @_dcw02 I’ve used it to chase down a few research ideas, and damn does that thing rip. such a joy to use
Charles 🎉 Frye@charles_irl

Speculation Is All You Need. In this blog post, we announce the co-release (w/ Z Lab) of six more state-of-the-art DFlash speculators for @Alibaba_Qwen 3.x. Over 1k output tps for 3.5 122B-A10B on a B200. Read the blog for why we're all-in on spec dec. modal.com/blog/spec-is-a…

English
3
3
29
8.1K
jason me-retweet
Charles 🎉 Frye
Charles 🎉 Frye@charles_irl·
Speculation Is All You Need. In this blog post, we announce the co-release (w/ Z Lab) of six more state-of-the-art DFlash speculators for @Alibaba_Qwen 3.x. Over 1k output tps for 3.5 122B-A10B on a B200. Read the blog for why we're all-in on spec dec. modal.com/blog/spec-is-a…
English
35
100
697
185.3K
jason
jason@jvmncs·
current status
jason tweet media
English
0
0
16
1.2K
jason me-retweet
Cedar You
Cedar You@our_decay·
Watched a cute animal video that I knew to be AI all the way through
Cedar You tweet media
English
108
1.8K
27.1K
656.3K
jason me-retweet
David Wang
David Wang@_dcw02·
9+ accept lengths on coding workloads generic drafter btw qwen 397b 4x faster repro btw dflash go brrr
Modal@modal

We worked with @lmsysorg and z-lab.ai to - integrate DFlash spec into @sgl_project - make it faster with overlap - train a DFlash drafter for @Alibaba_Qwen 397B-A17B The result: up to 4.3x greater throughput over baseline and 1.5x over native MTP.

English
2
6
46
9.8K
🎭
🎭@deepfates·
yep. That looks like an event designed by AIs all right
🎭 tweet media
English
12
5
229
15.5K
Federico Cassano
Federico Cassano@ellev3n11·
interesting project: training codebase that gets high utilization for training small small models. big runs are cool, but lots of small runs is also cool.
English
4
1
70
7.6K
Andrew Hinh
Andrew Hinh@ajhinh·
After graduating this weekend, I'll be joining @modal as a Developer Relations Engineer! I want to describe how I got here, as my path was rather unconventional. My first "connection" to Modal was back in 2022: after graduating high school, I took @charles_irl's Full Stack Deep Learning course, where I created admirer, a flavor of a VLM powered by AWS Lambda and GPT-3 (for those who remember!). I suppose my age and being a one-person team left an impression on him, and we continued to stay in touch. When he discovered Modal, I quickly became a user and found it just so delightful and easy to use. Plus, the free $30/month was more than enough for personal projects and experimentation, and I was always telling others to try it out. In fact, during a summer internship at an edtech startup, I helped secure a $5000 grant that allowed us to switch from to Modal for our fine-tuning and deployment jobs. Last summer, Charles unexpectedly offered an internship on the growth team, where I was initially uncertain how I'd use my full-stack ML experience at an infrastructure company. As it turned out, quite nicely: while contributing to the wide-ranging (and actually helpful!) set of examples (modal.com/docs/examples), I quickly saw that a sufficiently useful and captivating example empowered devs to take the next step. Soon after, I was tasked with showing how to mesh together RL, LLMs, and Modal Sandboxes. After a weekend or two of experimentation, I came up with a web demo of Street Fighter III where you could play against an RL-trained Qwen 3-8B (btw, you can try it out here: andrewhinh--sf3.modal.run). The most fun part for me, besides getting it to work as well as it did, was seeing the joy and excitement from the team. What makes me so excited to rejoin is that, really, I'm just continuing where I left off last summer to spread the good word about Modal. I can't thank Charles, @bernhardsson, @akshat_b, and the team at Modal enough for the opportunity to do so. Stay tuned for more!
Andrew Hinh tweet media
English
10
5
100
12.9K
jason me-retweet
Leon
Leon@iamleonli·
How far can we compress the discrete tokens in an LLM's context into compact latent vectors? With the right training recipe at large scale, our Latent Context Language Models (LCLMs) compress context up to 16× and land on a new Pareto frontier for long-context inference. 🧵(1/n)
Leon tweet media
English
1
23
68
8.3K
jason me-retweet
joy liu
joy liu@qjoyliu·
The future of training is open source. Super excited to announce that we've joined forces with HuggingFace, Nvidia, Meta, Mercor and other leading companies to support OpenEnv :)
Ben Burtenshaw@ben_burtenshaw

So excited to be opening up OpenEnv to the whole community. It will now be owned by @huggingface , Meta-PyTorch, @reflection_ai , @UnslothAI , @modal, @PrimeIntellect , @NVIDIAAI , @mercor_ai , and @fleet_ai . the reason is: frontier labs train the model and the harness together, so the model is fitted to its harness. that coupling is a chunk of why claude code and codex feel so good. open source can't do that. you bring whatever harness, whatever model, whatever env, whatever trainer. which is the whole point of open source and also the problem for training. openenv is the socket in between all of this. in short: it's a protocol layer, not a reward framework. it does not have opinions about your rewards or your training loop. those live in the libs that are actually good at them. read more in the blog post. it's early, come break it.

English
1
5
43
5.7K
jason me-retweet
Modal
Modal@modal·
Reinforcement learning has exploded on Modal, and we've been cooking. Here's a review of lessons learned helping teams train at scale, the patterns we kept seeing, and an open-source library to get started with RL on Modal quickly.
English
3
28
272
100K