Dan Deutsch

92 posts

Dan Deutsch

@_danieldeutsch

Research Scientist at Google Translate working on text generation evaluation

San Francisco Sumali Eylül 2012

91 Sinusundan600 Mga Tagasunod

Naka-pin na Tweet

Dan Deutsch@_danieldeutsch·10 Ara

Excited to receive an Outstanding Paper award for this work at @emnlpmeeting! Thanks to my co-authors George Foster and @markuseful! Updated version available here: aclanthology.org/2023.emnlp-mai…

Dan Deutsch@_danieldeutsch

LLM-based metrics like GEMBA predict many ties, but the way that ties should be handled in Kendall’s tau for meta-evaluating metrics has been a longstanding issue. We propose an update to the meta-evaluation methodology to handle ties. arxiv.org/pdf/2305.14324…

English

11.9K

Dan Deutsch nag-retweet

Vilém Zouhar @ EACL@zouharvi·12 Mar

Machine translation is tough to evaluate, partly because most of what you throw at is too easy. That doesn't at all mean that translation is solved; we're just not doing a good job finding interesting inputs.

English

609

Dan Deutsch nag-retweet

John Hewitt@johnhewtt·19 Kas

Come do a PhD with me at Columbia! My lab tackles basic problems in alignment, interpretability, safety, and capabilities of language systems. If you love adventuring in model internals and behaviors---to understand and improve---let's do it together! pic: a run in central park

English

127

952

77K

Dan Deutsch nag-retweet

Eleftheria Briakou@ebriakou·31 Eki

🗺️ Are we making our #LLMs multilingual, or anglocentric? Much work brings languages closer to English, but that comes at the cost of crucial #cultural nuance. @h__j___han tackles this trade-off with surgical steering, adapting LLMs to cultural contexts at inference time.

HyoJung Han@h__j___han

Lots of work on cross-lingual alignment encourages multilingual LLMs to generalize knowledge across languages. But this push for uniformity creates a tension: what happens to knowledge that should remain local? We look into this trade-off of transfer and cultural erasure:🧵

English

8.9K

Dan Deutsch nag-retweet

Markus Freitag@markuseful·27 Tem

Our Google Translate team is bringing a strong presence to #ACL2025 in Vienna this week! 🇦🇹 My group is excited to present several of our latest papers. 👇 Don't miss them!

English

3.2K

Dan Deutsch nag-retweet

Markus Freitag@markuseful·19 Şub

Two new datasets from Google Translate targeting high and low resource languages! WMT24++: 46 new en->xx languages to WMT24, bringing the total to 55 SMOL: 6M tokens for 115 very low-resource languages WMT24++: huggingface.co/datasets/googl… SMOL: huggingface.co/datasets/googl…

English

15.6K

Dan Deutsch nag-retweet

iseeaswell꩜bʂky@iseeaswell·19 Şub

😼SMOL DATA ALERT! 😼Anouncing SMOL, a professionally-translated dataset for 115 very low-resource languages! Paper: arxiv.org/pdf/2502.12301 Huggingface: huggingface.co/datasets/googl…

English

4.1K

Dan Deutsch@_danieldeutsch·19 Şub

@shrutirij @prk_riley @esalesk @FirasTr88060642 Stephanie Winkler @BZhangGo @markuseful #nlproc #nlp #ai

English

239

Dan Deutsch@_danieldeutsch·19 Şub

This project was a highly collaborative effort with many people contributing translations, evaluations, analyses, etc., so I want to thank all of my co-authors! @ebriakou @iseeaswell @marafinkels Rebecca Galor @JurikJuraska @gezakovacs Alison Lui @RicardoRei7 @jasonriesa

English

217

Dan Deutsch@_danieldeutsch·19 Şub

🚨New machine translation dataset alert! 🚨We expanded the language coverage of WMT24 from 9 to 55 en->xx language pairs by collecting new reference translations for 46 languages in a dataset called WMT24++ Paper: arxiv.org/abs/2502.12404… Data: huggingface.co/datasets/googl…

English

6.8K

Dan Deutsch nag-retweet

Yusuf Kocyigit@mykocyigit·7 Şub

Thrilled to share our latest findings on data contamination, from my internship at @Google! We trained almost 90 Models on 1B and 8B scales with various contamination types using machine translation as our task and analyze the impact of contamination. arxiv.org/abs/2501.18771

English

12K

Dan Deutsch@_danieldeutsch·14 Oca

@srush_nlp Sent you an email about tennis!

English

678

Sasha Rush@srush_nlp·14 Oca

Open coffee schedule in downtown SF on Thursday. Interested in chatting about reasoning, cheap scaling, academia/teaching, webgpu/slang. Also trying to play some tennis... calendly.com/srush-research…

English

13.2K

Dan Deutsch nag-retweet

Jurik Juraska@JurikJuraska·12 Ara

🚀 We have just released bfloat16 variants of all 3 MetricX-24 models, offering nearly identical performance to their float32 counterparts, but with a 50% smaller memory footprint. ✨ We hope this makes the XL and XXL models more accessible! 🔗 GitHub: github.com/google-researc…

Jurik Juraska@JurikJuraska

🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin? 🔗 Code: github.com/google-researc…

English

353

Dan Deutsch nag-retweet

Jurik Juraska@JurikJuraska·3 Ara

English

2.3K

Dan Deutsch@_danieldeutsch·26 Kas

Super simple and effective way of significantly increasing the performance of your evaluation metric!

Mara Finkelstein@marafinkels

LLMs are typically evaluated w/ automatic metrics on standard test sets, but metrics + test sets are developed independently. This raises a crucial question: Can we design automatic metrics specifically to excel on the test sets we prioritize? Answer: Yes! arxiv.org/abs/2411.15387

English

892

Dan Deutsch@_danieldeutsch·20 Kas

@psingh522 Unfortunately this role requires that you are enrolled in a PhD program. But there are plenty of roles at Google for Master's students that you can find on the Google Careers page buildyourfuture.withgoogle.com/internships

English

234

Prabhav Singh@psingh522·19 Kas

@_danieldeutsch Hi Dan! Are you open to Thesis Masters students applying to the internship?

English

637

Dan Deutsch@_danieldeutsch·12 Kas

New application link! google.com/about/careers/… I am at EMNLP/WMT this week. Please come find me if you want to learn more about this role!

Dan Deutsch@_danieldeutsch

Interested in doing research on Google Translate and Gemini? Good news! I’m hiring for full-time roles on the Google Translate Research Team! Apply here: google.com/about/careers/…

English

5.5K

Tuklasin

@h__j___han @shrutirij @prk_riley @esalesk @FirasTr88060642 @BZhangGo @markuseful @ebriakou