Colin Cherry

136 posts

Colin Cherry

Colin Cherry

@ColinCherry

NLP Researcher; Twitter lurker

Montréal, Québec Katılım Kasım 2009
187 Takip Edilen516 Takipçiler
Colin Cherry retweetledi
Bryan Li
Bryan Li@bryanlics·
Externally retrieving knowledge empowers LLMs for domain-adapted MT ⚖️🩺. But how is knowledge best represented, and how viable is generating it from an LLM itself? Our @GoogleAI paper investigates these questions through a careful experimental setup 📜. arxiv.org/abs/2503.05010
English
1
2
6
442
Colin Cherry retweetledi
NAACL HLT 2027
NAACL HLT 2027@naaclmeeting·
The call for Diversity and Inclusion Subsidies is out: #NAACL2025" target="_blank" rel="nofollow noopener">2025.naacl.org/calls/dei_subs…
English
0
7
10
2.1K
Colin Cherry retweetledi
Mara Finkelstein
Mara Finkelstein@marafinkels·
LLMs are typically evaluated w/ automatic metrics on standard test sets, but metrics + test sets are developed independently. This raises a crucial question: Can we design automatic metrics specifically to excel on the test sets we prioritize? Answer: Yes! arxiv.org/abs/2411.15387
Mara Finkelstein tweet media
English
4
11
49
11.7K
Colin Cherry retweetledi
NAACL
NAACL@naacl·
Thank you to those who participated in our recent all-member vote regarding our name change. The change is happening! We are: The Nations of the Americas Chapter of the Association for Computational Linguistics! Announcement 👉 naacl.org/posts/2024-10-…
English
1
13
64
3.2K
Colin Cherry retweetledi
Dan Deutsch
Dan Deutsch@_danieldeutsch·
Interested in doing research on Google Translate and Gemini? Good news! I’m hiring for full-time roles on the Google Translate Research Team! Apply here: google.com/about/careers/…
English
3
82
248
38.3K
Colin Cherry retweetledi
Eleftheria Briakou
Eleftheria Briakou@ebriakou·
[1/5] Are verbose #LLM translations skewing evaluation results? TLDR: Yes! Our recent work dives into the prevalence and impact of LLM verbosity in automatic and human evaluations. 📎 Paper: arxiv.org/pdf/2410.00863
Eleftheria Briakou tweet media
English
1
10
48
4.5K
Colin Cherry retweetledi
Eleftheria Briakou
Eleftheria Briakou@ebriakou·
Translation is a complex task involving pre-translation research and post-translation stages. Can #LLMs handle this process step-by-step, relying solely on their internal knowledge? ✨We show that decomposing the translation process significantly improves #Gemini translation quality of long-form texts across all #WMT24 languages! 📜arxiv.org/pdf/2409.06790
Eleftheria Briakou tweet media
English
2
13
69
6.6K
Colin Cherry retweetledi
Mara Finkelstein
Mara Finkelstein@marafinkels·
🥳 LLMs are changing the game, even for datasets! NewsPaLM, a publicly released LLM-generated dataset, outperforms larger web-crawled corpora for MT. It includes sentence & paragraph-level, MBR-decoded data. See paper for more, incl. LLM self-distillation. arxiv.org/abs/2408.06537
English
1
8
36
3.5K
Colin Cherry retweetledi
Rishabh Agarwal
Rishabh Agarwal@agarwl_·
[New paper] If you are sampling multiple outputs from a teacher LLM (e.g., Gemini 1.5 GPT), ranking them, and fine-tuning the student on the best output, you can do better. Simple idea: Fine-tune / Distill on the top-k outputs instead. Consistent gains on machine translation.
Rishabh Agarwal tweet media
English
4
26
186
20.7K