Jeff Willette

35 posts

Jeff Willette

Jeff Willette

@TheOneJeffrey

Katılım Mayıs 2012
41 Takip Edilen27 Takipçiler
Jeff Willette retweetledi
Jeff Willette
Jeff Willette@TheOneJeffrey·
We propose to first perform query-sparse attention and combine the output with a generic key-sparse attn. method. This drastically boosts the performance of sparse attention while adding minimal overhead, and its dead simple to implement on pretrained models.
English
0
0
0
44
Jeff Willette
Jeff Willette@TheOneJeffrey·
Most sparse attention methods focus on key-sparse attention. We found that key-sparse attention causes a distributional shift in outputs. Therefore, even if you switch to dense attention during decode, the queries and the keys might no longer match...
English
1
0
0
56
Jeff Willette
Jeff Willette@TheOneJeffrey·
Noice! Our paper "Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction" has been accepted to NeurIPS 2025! See you in San Diego (See part 2 of post for breakdown of our work) arxiv.org/pdf/2505.11254
Jeff Willette tweet media
English
2
6
16
1.2K
Jeff Willette retweetledi
Jinheon Baek
Jinheon Baek@jinheonbaek·
So excited to share that five papers have been accepted to #ACL2025 🎉 Huge thanks to all my amazing collaborators. I am especially grateful that all the internship projects (which I have worked on during my PhD journey) have all found their way to publication. Its rewarding. 😊
Jinheon Baek tweet media
English
1
5
70
3K
Jeff Willette
Jeff Willette@TheOneJeffrey·
There is a HUGE problem with sparse attention! We found it causes a misalignment of queries and keys, so even if you add a dense decode/generation phase, the sparsely encoded context can be forgotten. But wait... We can fix it with a simple correction! huggingface.co/papers/2505.11…
English
0
3
11
650
Jeff Willette
Jeff Willette@TheOneJeffrey·
@netflix I applied for an internship and got an email asking me to fill out a form on lever.co. the link in the email is broken. Your support and lever.co support both said they can't help. What else can I do? There is no one else to contact.
English
0
0
0
8
Jeff Willette
Jeff Willette@TheOneJeffrey·
@mhartington Ok Thanks! I've got a hard deadline of March 1st to submit my materials, so if you have some time before then I would be forever grateful.
English
0
0
0
0
Jeff Willette
Jeff Willette@TheOneJeffrey·
@mhartington can you check your DM's? I sent an inquiry related to nvim typescript last week.
English
1
0
0
0
Jeff Willette
Jeff Willette@TheOneJeffrey·
@gatsbyjs I have a bit of an unusual (but simple) request related to my contributions to the project, any chance I can speak to someone about it via DM or email?
English
1
0
1
0
Jeff Willette
Jeff Willette@TheOneJeffrey·
@mhartington Cool. Yeah I see all the files in question being watched but still no fixes. Ever seen anything like that before?
English
1
0
0
0
Mike Hartington
Mike Hartington@mhartington·
@delta_skelta You _can_ use `echo TSGetProjectInfoFunc()` or something similar to output all the files tsserver watches, which in theory would provide the symbols.
English
2
0
0
0
Jeff Willette
Jeff Willette@TheOneJeffrey·
@mhartington I am debugging nvim_typescript and why :TSImport is giving me no candidates. I'm down to sending raw requests to the TSServer. Is there a better way to debug this? TSServer isn't doc'd well...
English
1
0
0
0
Jeff Willette
Jeff Willette@TheOneJeffrey·
@mhartington Cool. Yeah I see the files in question are being watched but still no imports. Ever seen anything like that before?
English
0
0
0
0
Jeff Willette
Jeff Willette@TheOneJeffrey·
@mhartington Yeah im already logging it all from that wiki. Dobyou know if any of the commands to the server can dump all exported symbols?
English
1
0
0
0
Mike Hartington
Mike Hartington@mhartington·
@delta_skelta TSImport isn't always reliable, but `TSGetCodeFix` is a bit better. I'd checkout github.com/Microsoft/Type… for some details on how to log it. Most of the time it means that the project doesnt export proper types, or that tsserver cant resolve a symbol. This rarely happens though
English
1
0
0
0
Jeff Willette
Jeff Willette@TheOneJeffrey·
@mwitkow does grpc-proxy allow regular gRPC calls as well as grpc-web? or do they have to be on different servers?
English
0
0
0
0