Jennifer Hsia

21 posts

Jennifer Hsia

Jennifer Hsia

@jen_hsia

PhD student @mldcmu | Prev. @PrincetonCS

Pittsburgh, PA Katılım Aralık 2021
180 Takip Edilen181 Takipçiler
Sabitlenmiş Tweet
Jennifer Hsia
Jennifer Hsia@jen_hsia·
1/6 Retrieval is supposed to improve generation in RAG systems. But in practice, adding more documents can hurt performance, even when relevant ones are retrieved. We introduce RAGGED, a framework to measure and diagnose when retrieval helps and when it hurts.
Jennifer Hsia tweet media
English
1
23
105
10.2K
Jennifer Hsia retweetledi
Fahim Tajwar
Fahim Tajwar@FahimTajwar10·
Are we done with new RL algorithms? Turns out we might have been optimizing the wrong objective. Introducing MaxRL, a framework to bring maximum likelihood optimization to RL settings. Paper + code + project website: zanette-labs.github.io/MaxRL/ 🧵 1/n
English
14
161
806
205.8K
Jennifer Hsia retweetledi
Yuda Song @ ICLR 2026
Yuda Song @ ICLR 2026@yus167·
RL on LLMs inefficiently uses one scalar per rollout. But users regularly give much richer feedback: "make it formal," "step 3 is wrong." Can we train LLMs on this human-AI interaction? We introduce RL from Text Feedback, with 1) Self-Distillation; 2) Feedback Modeling (1/n) 🧵
Yuda Song @ ICLR 2026 tweet media
English
14
101
601
106.5K
Jennifer Hsia
Jennifer Hsia@jen_hsia·
5/6 Use RAGGED to analyze RAG systems with confidence: ✅ Detect fragile readers and unstable retrieval depths ✅ Compare models and setups using consistent, quantitative signals ✅ Guide training, evaluation, and design of more robust readers
English
1
0
4
340
Jennifer Hsia
Jennifer Hsia@jen_hsia·
1/6 Retrieval is supposed to improve generation in RAG systems. But in practice, adding more documents can hurt performance, even when relevant ones are retrieved. We introduce RAGGED, a framework to measure and diagnose when retrieval helps and when it hurts.
Jennifer Hsia tweet media
English
1
23
105
10.2K
Jennifer Hsia retweetledi
Jacob Yeung
Jacob Yeung@JacobYeung·
1/6 🚀 Excited to share that BrainNRDS has been accepted as an oral at #CVPR2025! We decode motion from fMRI activity and use it to generate realistic reconstructions of videos people watched, outperforming strong existing baselines like MindVideo and Stable Video Diffusion.🧠🎥
English
2
12
37
6.8K
Jennifer Hsia retweetledi
Daniel P Jeong
Daniel P Jeong@danielpjeong·
🧵 Are "medical" LLMs/VLMs *adapted* from general-domain models, always better at answering medical questions than the original models? In our oral presentation at #EMNLP2024 today (2:30pm in Tuttle), we'll show that surprisingly, the answer is "no". arxiv.org/abs/2411.04118
English
2
34
104
24.1K
Jennifer Hsia retweetledi
Zora Wang
Zora Wang@ZhiruoW·
Tools can empower LMs to solve many tasks. But what are tools anyway? github.com/zorazrw/awesom… Our survey studies tools for LLM agents w/ –A formal def. of tools –Methods/scenarios to use&make tools –Issues in testbeds and eval metrics –Empirical analysis of cost-gain trade-off
Zora Wang tweet media
English
2
62
212
68.7K
Jennifer Hsia
Jennifer Hsia@jen_hsia·
5/5 With RAGGED, you can easily optimize your RAG systems, analyze data slices with common features, and more. Try out our RAGGED framework and let us know what you think! GitHub: github.com/neulab/ragged Joint work w/ Afreen Shaikh, @ZhiruoW, @gneubig
English
0
0
10
668
Jennifer Hsia
Jennifer Hsia@jen_hsia·
4/5 Finding #2: Retriever-Reader synergy 🤝 The synergy between retriever and reader models can make or break your RAG system. Its effectiveness depends on the domain, question type, and reader's sensitivity to retrieval quality. RAGGED helps you pinpoint the best pairings.
English
1
0
6
710
Jennifer Hsia
Jennifer Hsia@jen_hsia·
1/5 Unleash the full power of RAG systems! 🔥 Introducing RAGGED, a framework for finding the optimal RAG configurations and bypassing common pitfalls. Dive deep into our findings: arxiv.org/pdf/2403.09040…
Jennifer Hsia tweet media
English
2
46
243
33.1K