Dan Deutsch

92 posts

Dan Deutsch

Dan Deutsch

@_danieldeutsch

Research Scientist at Google Translate working on text generation evaluation

San Francisco Sumali Eylül 2012
91 Sinusundan600 Mga Tagasunod
Naka-pin na Tweet
Dan Deutsch nag-retweet
Vilém Zouhar @ EACL
Vilém Zouhar @ EACL@zouharvi·
Machine translation is tough to evaluate, partly because most of what you throw at is too easy. That doesn't at all mean that translation is solved; we're just not doing a good job finding interesting inputs.
Vilém Zouhar @ EACL tweet media
English
1
2
16
609
Dan Deutsch nag-retweet
John Hewitt
John Hewitt@johnhewtt·
Come do a PhD with me at Columbia! My lab tackles basic problems in alignment, interpretability, safety, and capabilities of language systems. If you love adventuring in model internals and behaviors---to understand and improve---let's do it together! pic: a run in central park
John Hewitt tweet media
English
13
127
952
77K
Dan Deutsch nag-retweet
Eleftheria Briakou
Eleftheria Briakou@ebriakou·
🗺️ Are we making our #LLMs multilingual, or anglocentric? Much work brings languages closer to English, but that comes at the cost of crucial #cultural nuance. @h__j___han tackles this trade-off with surgical steering, adapting LLMs to cultural contexts at inference time.
HyoJung Han@h__j___han

Lots of work on cross-lingual alignment encourages multilingual LLMs to generalize knowledge across languages. But this push for uniformity creates a tension: what happens to knowledge that should remain local? We look into this trade-off of transfer and cultural erasure:🧵

English
0
10
51
8.9K
Dan Deutsch nag-retweet
Markus Freitag
Markus Freitag@markuseful·
Our Google Translate team is bringing a strong presence to #ACL2025 in Vienna this week! 🇦🇹 My group is excited to present several of our latest papers. 👇 Don't miss them!
English
1
5
53
3.2K
Dan Deutsch nag-retweet
Yusuf Kocyigit
Yusuf Kocyigit@mykocyigit·
Thrilled to share our latest findings on data contamination, from my internship at @Google! We trained almost 90 Models on 1B and 8B scales with various contamination types using machine translation as our task and analyze the impact of contamination. arxiv.org/abs/2501.18771
English
3
18
85
12K
Sasha Rush
Sasha Rush@srush_nlp·
Open coffee schedule in downtown SF on Thursday. Interested in chatting about reasoning, cheap scaling, academia/teaching, webgpu/slang. Also trying to play some tennis... calendly.com/srush-research…
English
3
5
75
13.2K
Dan Deutsch nag-retweet
Jurik Juraska
Jurik Juraska@JurikJuraska·
🚀 We have just released bfloat16 variants of all 3 MetricX-24 models, offering nearly identical performance to their float32 counterparts, but with a 50% smaller memory footprint. ✨ We hope this makes the XL and XXL models more accessible! 🔗 GitHub: github.com/google-researc…
Jurik Juraska@JurikJuraska

🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin? 🔗 Code: github.com/google-researc…

English
0
2
2
353
Dan Deutsch nag-retweet
Jurik Juraska
Jurik Juraska@JurikJuraska·
🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin? 🔗 Code: github.com/google-researc…
English
1
6
18
2.3K