Dan Deutsch

92 posts

Dan Deutsch

Dan Deutsch

@_danieldeutsch

Research Scientist at Google Translate working on text generation evaluation

San Francisco Tham gia Eylül 2012
91 Đang theo dõi600 Người theo dõi
Tweet ghim
Dan Deutsch đã retweet
Vilém Zouhar @ EACL
Vilém Zouhar @ EACL@zouharvi·
Machine translation is tough to evaluate, partly because most of what you throw at is too easy. That doesn't at all mean that translation is solved; we're just not doing a good job finding interesting inputs.
Vilém Zouhar @ EACL tweet media
English
1
2
16
609
Dan Deutsch đã retweet
John Hewitt
John Hewitt@johnhewtt·
Come do a PhD with me at Columbia! My lab tackles basic problems in alignment, interpretability, safety, and capabilities of language systems. If you love adventuring in model internals and behaviors---to understand and improve---let's do it together! pic: a run in central park
John Hewitt tweet media
English
13
127
952
77K
Dan Deutsch đã retweet
Eleftheria Briakou
Eleftheria Briakou@ebriakou·
🗺️ Are we making our #LLMs multilingual, or anglocentric? Much work brings languages closer to English, but that comes at the cost of crucial #cultural nuance. @h__j___han tackles this trade-off with surgical steering, adapting LLMs to cultural contexts at inference time.
HyoJung Han@h__j___han

Lots of work on cross-lingual alignment encourages multilingual LLMs to generalize knowledge across languages. But this push for uniformity creates a tension: what happens to knowledge that should remain local? We look into this trade-off of transfer and cultural erasure:🧵

English
0
10
51
8.9K
Dan Deutsch đã retweet
Markus Freitag
Markus Freitag@markuseful·
Our Google Translate team is bringing a strong presence to #ACL2025 in Vienna this week! 🇦🇹 My group is excited to present several of our latest papers. 👇 Don't miss them!
English
1
5
53
3.2K
Dan Deutsch đã retweet
Yusuf Kocyigit
Yusuf Kocyigit@mykocyigit·
Thrilled to share our latest findings on data contamination, from my internship at @Google! We trained almost 90 Models on 1B and 8B scales with various contamination types using machine translation as our task and analyze the impact of contamination. arxiv.org/abs/2501.18771
English
3
18
85
12K
Sasha Rush
Sasha Rush@srush_nlp·
Open coffee schedule in downtown SF on Thursday. Interested in chatting about reasoning, cheap scaling, academia/teaching, webgpu/slang. Also trying to play some tennis... calendly.com/srush-research…
English
3
5
75
13.2K
Dan Deutsch đã retweet
Jurik Juraska
Jurik Juraska@JurikJuraska·
🚀 We have just released bfloat16 variants of all 3 MetricX-24 models, offering nearly identical performance to their float32 counterparts, but with a 50% smaller memory footprint. ✨ We hope this makes the XL and XXL models more accessible! 🔗 GitHub: github.com/google-researc…
Jurik Juraska@JurikJuraska

🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin? 🔗 Code: github.com/google-researc…

English
0
2
2
353
Dan Deutsch đã retweet
Jurik Juraska
Jurik Juraska@JurikJuraska·
🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin? 🔗 Code: github.com/google-researc…
English
1
6
18
2.3K