Barry Haddow

581 posts

Barry Haddow

Barry Haddow

@bazril

Researcher in Informatics at University of Edinburgh. Mainly working on machine translation.

Edinburgh, Scotland Katılım Nisan 2010
658 Takip Edilen1.2K Takipçiler
Barry Haddow retweetledi
Weixuan Wang
Weixuan Wang@WeixuanWang66·
📣 Excited to share our latest research: "Demystifying Multilingual Chain-of-Thought in Process Reward Modeling" where we explore process reward models beyond English to improve multi-step reasoning in 11 languages! Link: arxiv.org/abs/2502.12663 Code: github.com/weixuan-wang12…
Weixuan Wang tweet media
English
1
1
6
894
Barry Haddow retweetledi
HPLT
HPLT@hplt_eu·
New paper on the HPLT v2 dataset making-of: - pipeline documentation and code - extensive analysis of the quality and characteristics - evaluation of the performance of language models and machine translation systems trained on it 🤓Happy reading! arxiv.org/pdf/2503.10267
HPLT tweet media
English
0
4
12
542
Barry Haddow retweetledi
HPLT
HPLT@hplt_eu·
We are happy to announce the second release of HPLT bilingual datasets: - 50 English-centric language pairs = 380M parallel sentences (HPLT) 🤩 - 1,275 non-English-centric language pairs = 16.7B parallel sentences (MultiHPLT) 😮 Available at the HPLT dataset catalogue and OPUS.
English
0
12
15
1.2K
Barry Haddow
Barry Haddow@bazril·
MT Summit 2025 - deadline extended! The deadline for all papers (technical/user/translator/products/projects) has been extended to February 10th. MT Summit will be in Geneva, June 23--27. mtsummit2025.unige.ch/index.html
English
0
1
7
333
Barry Haddow
Barry Haddow@bazril·
EAMT best thesis award - closes on January 31st. Completed an MT-related PhD in 2024? In Europe, Africa or Middle East. Then why not submit your thesis. eamt.org/2024/11/28/the…
English
0
3
5
541
Barry Haddow
Barry Haddow@bazril·
EAMT Best thesis award - now open! Have you defended an MT-related thesis in 2024, in EMEA? Then why not submit to the prestigious EAMT BTA? eamt.org/2024/11/28/the… . Deadline: 2025-01-31
English
0
3
2
523
Barry Haddow retweetledi
HPLT
HPLT@hplt_eu·
Join us on a new edition of the Winter School! "Pretraining Data Quality 🧐 and Multilingual Evaluation of LLMs👀" 🪂Feb. 3–5, 2025, Norway More info and registration: wiki.nlpl.eu/Community/trai… Jointly organised by @hplt_eu and the Nordic Language Processing Laboratory (NLPL)
HPLT tweet media
English
0
4
11
689
Barry Haddow retweetledi
Helsinki-NLP
Helsinki-NLP@HelsinkiNLP·
The 18th MT marathon will be organized in beautiful Helsinki in the end of August, 2025. We invite you to a week-long gathering of researchers, developers and students with lectures, labs and hacking projects. More information will come - stay tuned!
English
1
7
22
1.5K
Barry Haddow retweetledi
Vilém Zouhar @ EACL
Vilém Zouhar @ EACL@zouharvi·
Have you recently used COMET for MT evaluation? ☄️ - Did you report the specific model? ≥12% of papers don't! - Did you report the package version? Makes a difference. - `pip install sacrecomet` generates a nice version+model signature. Not too late for WMT/EMNLP camera-ready!
Vilém Zouhar @ EACL tweet media
English
2
9
57
6.6K
Barry Haddow retweetledi
Simon Yu
Simon Yu@simon_ycl·
❗Are We Truly Achieving Multilingualism in LLMs or Just Relying on Translation?❗ Need multilingual instruction data and benchmarks? Just translate from English. LLM multilingualism can be easily solved! If you agree, check out our #EMNLP 2024 paper which says this is sub-optimal. arxiv.org/abs/2406.12822 🧵Below
Simon Yu tweet media
English
1
15
51
10K
Barry Haddow retweetledi
Pedro Martins
Pedro Martins@PedroHenMartins·
Today we release the first EuroLLM paper and models: EuroLLM-1.7B and EuroLLM-1.7B-Instruct! The EuroLLM project will develop open-weight multilingual LLMs that understand and generate text in all official EU languages. Stay tuned for the bigger and stronger EuroLLMs (9B, 22B)!
English
3
18
77
13.4K
Barry Haddow retweetledi
Vivek Iyer
Vivek Iyer@remorax98·
We know LLMs are poor at MT in low-resource languages (LRLs): curious how to adapt them to perform better? 🚀 Our new paper explores the interplay between scale (of MT data) and diversity (of tasks/langs) in instruction tuning in determining LLM-MT performance for LRLs💡 arxiv.org/abs/2408.12780
Vivek Iyer tweet media
English
1
20
66
16.2K