Huda Khayrallah

842 posts

Huda Khayrallah

@HudaKhay

Machine Translation/#NLProc/ML Researcher at Microsoft. Past: @UCBerkeley CS ugrad; @LiltHQ research intern; @jhuCLSP/@jhuCompSci PhD

rarely on here; email me Katılım Temmuz 2012

852 Takip Edilen1K Takipçiler

Huda Khayrallah retweetledi

HyoJung Han@h__j___han·31 Eki

Lots of work on cross-lingual alignment encourages multilingual LLMs to generalize knowledge across languages. But this push for uniformity creates a tension: what happens to knowledge that should remain local? We look into this trade-off of transfer and cultural erasure:🧵

English

17.5K

Huda Khayrallah retweetledi

Eleftheria Briakou@ebriakou·31 Eki

🗺️ Are we making our #LLMs multilingual, or anglocentric? Much work brings languages closer to English, but that comes at the cost of crucial #cultural nuance. @h__j___han tackles this trade-off with surgical steering, adapting LLMs to cultural contexts at inference time.

HyoJung Han@h__j___han

English

Huda Khayrallah retweetledi

Haoran Xu@fe1ixxu·22 Oca

Excited to share that X-ALMA got accepted at #ICLR2025! See you in Singapore!

Haoran Xu@fe1ixxu

Multilingual models are usually heavily skewed in favor of high-resource languages. We change this with X-ALMA: an LLM-based translator committed to ensuring top-tier performance across 50 diverse languages, regardless of their resource levels! Paper: arxiv.org/pdf/2410.03115

English

650

Huda Khayrallah retweetledi

HyoJung Han@h__j___han·22 Oca

Excited to share that our VocADT paper got accepted at #ICLR2025✨! I am looking forward to participating in @iclr_conf in Singapore 🇸🇬.

HyoJung Han@h__j___han

🧐Which languages benefit the most from vocabulary adaptation? We introduce VocADT, a new vocabulary adaptation method using a vocabulary adapter, and explore the impact of various adaptation strategies on languages with diverse scripts and fragmentation to answer this question.

English

3.2K

Huda Khayrallah retweetledi

Akiko I. Eriguchi@akikoe_·22 Oca

Congratulations @h__j___han 🥳 it was great to work with you!

HyoJung Han@h__j___han

Excited to share that our VocADT paper got accepted at #ICLR2025✨! I am looking forward to participating in @iclr_conf in Singapore 🇸🇬.

English

680

Huda Khayrallah retweetledi

Suzanna Sia@suzyahyah·26 Oca

Large Model Inference Efficiency can be tackled from many angles, mixture of experts, efficient self-attention, quantisation, distillation, hardware acceleration.. But what if we could completely avoid redundant computational processing over context window? In our NeurIPS'24 paper "where does in-context (task-location) learning happen" We find three distinct regions for LLM inference time processing 1️⃣ [Task Location]; LLM discovers the task from reading instructions and examples 2️⃣ [Task Processing]; After task location, the model no longer requires any self-attention over the prompts. 3️⃣ [Task Completion]; final layers of processing where the model no longer requires self-attention over the query. ===> Implications for Industry ✅ ~50% In Computational savings (theoretical) If we avoided redundant context processing in later layers of the model ✅ Very sample efficient adaptation of LLMs to task specific Models. Contrary to common wisdom on Fine-tuning, LoRA layers are most effective at earlier layers of the model compared to the later ones. ===> Implications for Academia: * New Interpretability technique progressively masks out all self-attention to the context, * Task Location layer is not affected by the number of prompt examples provided to the model. * Related Work with similar findings are Task Vectors (@RoeeHendel et al) , Function Vectors (@ericwtodd et al), providing additional supporting evidence for this phenomena. 💻 Paper: lnkd.in/gduKF27X Github: lnkd.in/g_q6T2kE Models: Llama3.1-8B, LLama3.1-8B-Instruct, Starcoder2-7B, GPTN2.7B, Bloom3B Tasks: Machine Translation (en-fr, fr-en, en-pt), Code Generation (en-py)

English

7.5K

Huda Khayrallah retweetledi

Barry Haddow@bazril·26 Oca

EAMT best thesis award - closes on January 31st. Completed an MT-related PhD in 2024? In Europe, Africa or Middle East. Then why not submit your thesis. eamt.org/2024/11/28/the…

English

546

Huda Khayrallah retweetledi

Marine Carpuat@MarineCarpuat·19 Ağu

Incredibly proud of Dr Eleftheria Briakou for receiving the first ever Best Thesis Award from the Association for Machine Translation in the Americas!

Eleftheria Briakou@ebriakou

I’m super thrilled to have won the AMTA Best Thesis Award!! A huge thanks to the AMTA organizers for this recognition ☺️ See you all in Chicago amtaweb.org

English

2.5K

Huda Khayrallah@HudaKhay·16 Ağu

🎉🎉🎉🎉🎉🎉

Eleftheria Briakou@ebriakou

I’m super thrilled to have won the AMTA Best Thesis Award!! A huge thanks to the AMTA organizers for this recognition ☺️ See you all in Chicago amtaweb.org

ART

321

Huda Khayrallah retweetledi

Eleftheria Briakou@ebriakou·16 Ağu

I’m super thrilled to have won the AMTA Best Thesis Award!! A huge thanks to the AMTA organizers for this recognition ☺️ See you all in Chicago amtaweb.org

English

11.3K

Huda Khayrallah retweetledi

Akiko I. Eriguchi@akikoe_·16 Ağu

On behalf of the AMTA Board of Directors, I am pleased to announce the winner of the first-ever AMTA Best Thesis Award: Dr. Eleftheria Briakou (@ebriakou) for her thesis “Detecting Fine-Grained Semantic Divergences to Improve Translation Understanding Across Languages”. [1/n]

Eleftheria Briakou@ebriakou

I’m super thrilled to have won the AMTA Best Thesis Award!! A huge thanks to the AMTA organizers for this recognition ☺️ See you all in Chicago amtaweb.org

English

2.8K

Huda Khayrallah retweetledi

Jordan Boyd-Graber@boydgraber·14 Ağu

I'm bummed that family obligations prevented me from presenting this epic paper. This work represented a long journey for me. I first began working on the language of Diplomacy in 2015, and I struggled for years to get funding to build a bot that could play it ...

Joy Wongkamjan@joywwong

1⃣Meta’s Cicero by Bakhtin et al. was the talk of town when it was released in November 2022. Even main journals published that Cicero had achieved human-level Diplomacy in strategy and negotiation. Well, had it!? Our paper had this answer: arxiv.org/pdf/2406.04643

English

10.3K

Huda Khayrallah retweetledi

Sarah Jabbour@SarahJabbour_·13 Ağu

My mom wants to come out of retirement. She was a software validation engineer working on human machine interfaces. She (and I) have no idea where to look. She just wants to spend time testing the things that people build. Does anyone know where she could look??

English

1.5K

Huda Khayrallah retweetledi

HyoJung Han@h__j___han·13 Ağu

✨XLAVS-R will be presented during today’s (August 13th) #ACL2024 poster session 4, starting at 10:30 AM. Looking forward to talking with people interested in our work!

HyoJung Han@h__j___han

🗣️XLAVS-R is accepted at #ACL2024 main! 🚀🚀 We present XLAVS-R, a cross-lingual audio-visual model for noise-robust speech perception in over 100 languages. Very happy to present our work done during my @AIatMeta internship with @ChanghanWang. arxiv.org/abs/2403.14402

English

Huda Khayrallah@HudaKhay·18 Tem

🎉🎉🎉🎉🎉🎉

JHU CLSP@jhuclsp

Congratulations to Xuan Zhang (advised by @kevinduh) on successfully defending her PhD thesis “Hyperparameter Optimization for Neural Machine Translation Systems”. cs.jhu.edu/~xzhan138/

ART

183

Huda Khayrallah retweetledi

Elliot Schumacher@elliotschu·19 Haz

Great work Albert! Check out the paper at aclanthology.org/2024.naacl-lon… .

albert yu sun@Albertyusun

Had a wonderful time presenting my paper from my internship last year with @CuraiHQ at #NAACL2024! Grateful for the opportunity to talk to the awesome and thoughtful people in the NLP community. @elliotschu @anithakan @nairvarun18

English

488

Huda Khayrallah retweetledi

Naomi Saphra@nsaphra·13 Haz

NAACL is this week and that means you should read our "history" paper! And if you're in Mexico City then say hi to Eve Fleisig, who is presenting it!

Naomi Saphra@nsaphra

It's not the first time! A dream team of @enfleisig (human eval expert), Adam Lopez (remembers the Stat MT era), @kchonyc (helped end it), and me (pun in title) are here to teach you the history of scale crises and what lessons we can take from them. 🧵arxiv.org/abs/2311.05020

English

6.6K

Huda Khayrallah retweetledi

Armita R. Manafzadeh@armanafzadeh·12 Haz

Three postdocs were too tired to go to the party on the last night of SICB this year, so we decided to order pizza to the hotel and write a paper together instead. Out in @ICB_journal now! academic.oup.com/icb/advance-ar…

Andrew K. Schulz, Ph.D.@SchulzScience_

Today in @ICB_journal - @armanafzadeh, Janneke Schwaner, and I co-led a brief article on Strategies for Organizing Interdisciplinary Events. @SICB_ @SICB_DCB_DVM If you are interested in hosting an interdisciplinary event, we hope it is helpful: doi.org/10.1093/icb/ic…

English

4.8K

Huda Khayrallah retweetledi

Elias Stengel-Eskin@EliasEskin·3 Haz

🚨 Excited to share our new work on **confidence calibration** in LLMs! LLMs are often badly calibrated & overconfident, explicitly (eg. "I'm 100% sure") and implicitly, eg. giving details/authoritative tone. We address both w/ a pragmatic speaker-listener multi-agent method 🧵

English

144

34.4K

Huda Khayrallah@HudaKhay·4 Haz

Deadline is 6/6 for the AMTA thesis award Apply if you finished a PhD in MT in the Americas in the last year! amtaweb.org/amta-2024-cfp-… questions? reach out to mtresearchers@amtaweb.org (Rebecca Knowles and Akiko Eriguchi).

Akiko I. Eriguchi@akikoe_

🏆 Thrilled to share the launch of the AMTA Best Thesis Award, which aims to highlight the achievements of a recent PhD graduate at an institution in the Americas whose thesis has focused on topics related to machine translation. [1/2]

English

337

Keşfet

@h__j___han @iclr_conf @RoeeHendel @ericwtodd @ebriakou @ICB_journal @elonmusk @BarackObama