David Stap

131 posts

David Stap

@davidstap

ai research @nx_ai_com | phd from @UvA_Amsterdam | prev @Amazon

Amsterdam, The Netherlands Entrou em Nisan 2009

1K Seguindo296 Seguidores

David Stap retweetou

Sepp Hochreiter@HochreiterSepp·17 Mar

xLSTM Distillation: arxiv.org/abs/2603.15590 Near-lossless distillation of quadratic Transformer LLMs into linear xLSTM architectures enables cost- and energy-efficient alternatives without sacrificing performance. xLSTM variants of instruction-tuned Llama, Qwen, & Olmo models.

English

312

22.6K

David Stap retweetou

Catherine Arnett@linguist_cat·24 Haz

The call for papers is out for the 5th edition of the Workshop on Multilingual Representation Learning which will take place in Suzhou, China co-located with EMNLP 2025! See details below!

English

11.3K

David Stap@davidstap·21 May

8/8📋 Key takeaway: Fine-tune with diverse language directions even when optimizing for specific translation pairs. But identify an optimal diversity threshold - too many languages can diminish performance for well-supported pairs while still benefiting less-represented ones.

English

David Stap@davidstap·21 May

7/8 But there's a sweet spot! When scaling beyond 132 directions to 272 directions, we found benefits plateau or even slightly decrease for well-represented language pairs, while still helping underrepresented languages.

English

David Stap@davidstap·21 May

🔍 How does language diversity affect LLM fine-tuning for translation? We fine-tuned LLMs and found that MORE diversity consistently improves performance - even for language pairs that less diverse models were specifically trained to handle! arxiv.org/abs/2505.13090

English

173

David Stap retweetou

Navalism@NavalismHQ·9 Kas

With my desire to improve everything, I destroy the moment. @naval

English

119

1.2K

39.1K

David Stap@davidstap·14 Eki

@prajdabre1 Congratulations!!

English

David Stap retweetou

Seth Aycock@sethjsa·11 Eki

@JeffDean x.com/sethjsa/status… Actually we find LLMs learn most/all translation ability from parallel sentences in the book, not the grammar. And we can predict translation performance just from prompts' test set vocab coverage! But we do find that grammar can help *linguistic* tasks

Seth Aycock@sethjsa

Our work “Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?” is now on arXiv! arxiv.org/abs/2409.19151 - in collaboration with @davidstap, @diwuNLP, @c_monz , and Khalil Sima'an from @illc_amsterdam and @ltl_uva 🧵

English

218

David Stap retweetou

Seth Aycock@sethjsa·11 Eki

English

127

17.9K

David Stap retweetou

tobi lutke@tobi·26 Eyl

What overregulation feels like. AI progress is now skipping Europe

English

253

684

5.5K

769.9K

David Stap@davidstap·26 Eyl

It's great to see a strong and publicly available LLM that supports all official European languages! 🇪🇺

Pedro Martins@PedroHenMartins

Today we release the first EuroLLM paper and models: EuroLLM-1.7B and EuroLLM-1.7B-Instruct! The EuroLLM project will develop open-weight multilingual LLMs that understand and generate text in all official EU languages. Stay tuned for the bigger and stronger EuroLLMs (9B, 22B)!

English

192

David Stap@davidstap·19 Eyl

@EvaHasler @unattributed @c_monz @ketran Our IdiomsInCtx-MT dataset, consisting of idiomatic expressions in context and their human-written translations, is now available on Huggingface: huggingface.co/datasets/david…

English

106

David Stap@davidstap·25 Tem

Work done with @EvaHasler @unattributed @c_monz @ketran

English

151

David Stap@davidstap·25 Tem

1/4 #ACL2024 Excited to share our new paper on the impact of fine-tuning on the qualitative advantages of LLMs in machine translation! 🤖 Our work highlights the importance of preserving LLM capabilities during fine-tuning. arxiv.org/abs/2405.20089

English

1.5K

David Stap retweetou

David Ifeoluwa Adelani 🇳🇬@davlanade·7 Eyl

Are you working in the area of multilingual NLP with some reviewing experience (*ACL, EMNLP, NeurIPS, ICLR, COLING,LREC)? We are looking for reviewers for the multilingual representation learning Workshop co-located at EMNLP. Please register in the form 👇

MRL@mrl2024_emnlp

We are looking for reviewers to join our program committee and help prepare high quality reviews for paper submissions. If you're interested please fill out this form: forms.gle/VVYbYnjKBJrAGb…

English

2.4K

Descobrir

@naval @JeffDean @diwuNLP @c_monz @illc_amsterdam @ltl_uva @EvaHasler @unattributed