Jennifer Hsia

21 posts

Jennifer Hsia

@jen_hsia

PhD student @mldcmu | Prev. @PrincetonCS

Pittsburgh, PA Katılım Aralık 2021

180 Takip Edilen181 Takipçiler

Sabitlenmiş Tweet

Jennifer Hsia@jen_hsia·16 Tem

1/6 Retrieval is supposed to improve generation in RAG systems. But in practice, adding more documents can hurt performance, even when relevant ones are retrieved. We introduce RAGGED, a framework to measure and diagnose when retrieval helps and when it hurts.

English

105

10.2K

Jennifer Hsia retweetledi

Fahim Tajwar@FahimTajwar10·5 Şub

Are we done with new RL algorithms? Turns out we might have been optimizing the wrong objective. Introducing MaxRL, a framework to bring maximum likelihood optimization to RL settings. Paper + code + project website: zanette-labs.github.io/MaxRL/ 🧵 1/n

English

161

806

205.8K

Jennifer Hsia retweetledi

Yuda Song @ ICLR 2026@yus167·3 Şub

RL on LLMs inefficiently uses one scalar per rollout. But users regularly give much richer feedback: "make it formal," "step 3 is wrong." Can we train LLMs on this human-AI interaction? We introduce RL from Text Feedback, with 1) Self-Distillation; 2) Feedback Modeling (1/n) 🧵

English

101

601

106.5K

Jennifer Hsia@jen_hsia·16 Tem

Excited to share our work at #ICML2025! 📍 East Exhibition Hall A-B E-1707 🗓️ Wed July 16, 11am–1:30pm 📄 github.com/neulab/ragged

Jennifer Hsia@jen_hsia

English

801

Jennifer Hsia@jen_hsia·16 Tem

6/6 Thankful for the collaboration w/ Afreen Shaikh, @ZhiruoW, @gneubig! 🔗Paper: arxiv.org/abs/2403.09040 🔗Project page: github.com/neulab/ragged

English

331

Jennifer Hsia@jen_hsia·16 Tem

5/6 Use RAGGED to analyze RAG systems with confidence: ✅ Detect fragile readers and unstable retrieval depths ✅ Compare models and setups using consistent, quantitative signals ✅ Guide training, evaluation, and design of more robust readers

English

340

Jennifer Hsia@jen_hsia·16 Tem

English

105

10.2K

Jennifer Hsia retweetledi

Jacob Yeung@JacobYeung·12 Haz

1/6 🚀 Excited to share that BrainNRDS has been accepted as an oral at #CVPR2025! We decode motion from fMRI activity and use it to generate realistic reconstructions of videos people watched, outperforming strong existing baselines like MindVideo and Stable Video Diffusion.🧠🎥

English

6.8K

Jennifer Hsia retweetledi

Daniel P Jeong@danielpjeong·14 Kas

🧵 Are "medical" LLMs/VLMs *adapted* from general-domain models, always better at answering medical questions than the original models? In our oral presentation at #EMNLP2024 today (2:30pm in Tuttle), we'll show that surprisingly, the answer is "no". arxiv.org/abs/2411.04118

English

104

24.1K

Jennifer Hsia retweetledi

Emily Byun@yewonbyun_·30 Nis

Estimating notions of unfairness/inequity is hard as it requires that data captures all features that influenced decision-making. But what if it doesn't? In our work (arxiv.org/abs/2403.14713), we answer this question w/ @dylanjsam @MichaelOberst @zacharylipton @brwilder

English

15.9K

Jennifer Hsia retweetledi

Pratyush Maini@pratyushmaini·25 Nis

1/What does it mean for an LLM to “memorize” a doc? Exactly regurgitating a NYT article? Of course. Just training on NYT?Harder to say We take big strides in this discourse w/*Adversarial Compression* w/@A_v_i__S @zhilifeng @zacharylipton @zicokolter 🌐:locuslab.github.io/acr-memorizati…🧵

English

150

48.5K

Jennifer Hsia retweetledi

Pratyush Maini@pratyushmaini·12 Nis

1/ 🥁Scaling Laws for Data Filtering 🥁 TLDR: Data Curation *cannot* be compute agnostic! In our #CVPR2024 paper, we develop the first scaling laws for heterogeneous & limited web data. w/@goyalsachin007 @zacharylipton @AdtRaghunathan @zicokolter 📝:arxiv.org/abs/2404.07177

English

321

87.4K

Jennifer Hsia retweetledi

Zora Wang@ZhiruoW·20 Mar

Tools can empower LMs to solve many tasks. But what are tools anyway? github.com/zorazrw/awesom… Our survey studies tools for LLM agents w/ –A formal def. of tools –Methods/scenarios to use&make tools –Issues in testbeds and eval metrics –Empirical analysis of cost-gain trade-off

English

212

68.7K

Jennifer Hsia retweetledi

Sri Vardhamanan@SVardhamanan·17 Mar

Quite an Interesting analysis by Jennifer Hsia, Afreen Shaikh, @ZhiruoW & @gneubig : arxiv.org/abs/2403.09040 Awaiting the RAGGED repo to be published: github.com/neulab/ragged

English

6.1K

Jennifer Hsia@jen_hsia·18 Mar

5/5 With RAGGED, you can easily optimize your RAG systems, analyze data slices with common features, and more. Try out our RAGGED framework and let us know what you think! GitHub: github.com/neulab/ragged Joint work w/ Afreen Shaikh, @ZhiruoW, @gneubig

English

668

Jennifer Hsia@jen_hsia·18 Mar

4/5 Finding #2: Retriever-Reader synergy 🤝 The synergy between retriever and reader models can make or break your RAG system. Its effectiveness depends on the domain, question type, and reader's sensitivity to retrieval quality. RAGGED helps you pinpoint the best pairings.

English

710

Jennifer Hsia@jen_hsia·18 Mar

1/5 Unleash the full power of RAG systems! 🔥 Introducing RAGGED, a framework for finding the optimal RAG configurations and bypassing common pitfalls. Dive deep into our findings: arxiv.org/pdf/2403.09040…

English

243

33.1K

Keşfet

@ZhiruoW @gneubig @dylanjsam @MichaelOberst @zacharylipton @brwilder @A_v_i__S @zhilifeng