Yining Lu

138 posts

Yining Lu

@Yining__Lu

Second year CS PhD student @NotreDame | Prev: @amazon @JHUCLSP 🦋: https://t.co/gPXvdPuesy

South Bend, IN Katılım Aralık 2019

591 Takip Edilen232 Takipçiler

Sabitlenmiş Tweet

Yining Lu@Yining__Lu·11 Nis

Thrilled to share that I'll start my Ph.D. at @ND_CSE this fall, working with @Meng_CS. I am so grateful for the sincere guidance from my current advisor, @DanielKhashabi, and for the unconditional support I received from my family, friends, and collaborators over the past years!

English

7.3K

Yining Lu retweetledi

Daniel Khashabi 🕊️@DanielKhashabi·12 Şub

LLMs continue to struggle with long-context tasks—such as needle-in-a-haystack problems—because of “positional bias.” What can we do if we only have 𝘣𝘭𝘢𝘤𝘬-𝘣𝘰𝘹 access to the model? (i.e., we can’t modify the model weights or attention patterns, as is often the case with API models.) We introduce ⭐𝐆𝐨𝐥𝐝-𝐏𝐚𝐧𝐧𝐢𝐧𝐠⭐, a black-box Bayesian framework that, at inference time, strategically and iteratively shuffles documents to overcome positional bias. Specifically, it searches over long contexts by (i) reordering documents to concentrate high-belief items in highly “diagnostic” positions, and (ii) updating beliefs about document relevance from model outputs. We show that GP provably identifies a target among N documents in O(log N) rounds, ensuring scalability to many-document settings. More in the paper: arxiv.org/pdf/2510.09770

English

6.3K

Yining Lu@Yining__Lu·9 Şub

Our code is available at github.com/yining610/ctwa. Huge thanks to my advisor @Meng_CS for his support on this project.

English

Yining Lu@Yining__Lu·9 Şub

Our method, CTWA, effectively mitigates cross-objective interference compared to others. Competing methods either quickly sacrifice accuracy to achieve superficially high conciseness and clarity (e.g., GradNorm in 1a, Linear and Dynamic weighting in 1b), or trying to maintain high accuracy while overlooking the improvement of others (e.g., Lagrangian in 1a and PAMA in 1b). In contrast, CTWA achieves strong, balanced performance across all three objectives.

English

Yining Lu@Yining__Lu·9 Şub

Why does improving one objective in multi-objective RL sometimes hurt others, even when they shouldn't conflict from the MOO and MTL sense? And why only on certain models? 🤔 This isn't Pareto tradeoffs. It's 𝐜𝐫𝐨𝐬𝐬-𝐨𝐛𝐣𝐞𝐜𝐭𝐢𝐯𝐞 𝐢𝐧𝐭𝐞𝐫𝐟𝐞𝐫𝐞𝐧𝐜𝐞 in LLM alignment. We uncover this in arxiv.org/abs/2602.06869 by - answering why and when this model-dependent interference actually happens. - conducting the first systematic benchmark of classic MOO/MTL methods for LLM alignment. - proposing CTWA (Covariance-Targeted Weight Adaptation), our solution that actually works across the board. We hope this work could provide actionable insights for multi-objective alignment for LLMs. Details in the thread below 👇

English

658

Yining Lu retweetledi

Zilong Wang@zlwang_cs·14 Oca

Awesome work! 🧠 GDPO solves the signal collapse issue beautifully. Our Dynamic Reward Weighting tackles the optimization side: since objectives vary in difficulty, static weights often waste training effort or get stuck. Both findings highlight that simple linear scalarization is just not enough for complex alignment. arxiv.org/pdf/2509.11452

Shizhe Diao@shizhediao

RLVR is powerful — but how do you train with multiple rewards effectively? 🤔 🎯GDPO (not GRPO) is coming. We introduce Group reward-Decoupled Normalization Policy Optimization (GDPO), a new multi-reward RL algorithm that consistently improves per-reward convergence over GRPO across a wide range of tasks. (1/n)

English

9.3K

Yining Lu@Yining__Lu·14 Kas

Thanks for the question! In real-time queries, blockchain interactions are limited to batched reads, which are similar to querying data from a conventional service. As a result, they have minimal impact on the RAG process when generating responses. For time-consuming blockchain updates, we plan to handle them asynchronously in batches to improve the user experience.

English

Karan Jagtiani@karanjagtiani04·12 Kas

@Yining__Lu Nice to see blockchain used for data reliability in RAG. How do you handle latency impacts in real-time queries?

English

Yining Lu@Yining__Lu·12 Kas

📣 My first system paper 📣 We built a decentralized RAG system that solves data reliability challenges in real-world settings. The sources provided by each data owner will be securely managed and scored on the blockchain. 🚀 one-line command to deploy: github.com/yining610/Reli…

Meng Jiang@Meng_CS

Decentralized RAG allows your database to benefit all LLM clients. On the other side, not all data sources are reliable. Managing source reliability on blockchain can avoid third-party manipulation. Introducing dRAG + Blockchain + Truth Discovery: arxiv.org/abs/2511.07577

English

542

Yining Lu@Yining__Lu·12 Kas

Paper link: arxiv.org/abs/2511.07577. Huge thanks to all my amazing coauthors: Wenyi Tang, Max Johnson, Prof. Taeho Jung, and Prof. @Meng_CS.

English

187

Yining Lu retweetledi

Meng Jiang@Meng_CS·12 Kas

English

1.3K

Yining Lu@Yining__Lu·24 Eki

@jackjingyuzhang @AmazonScience Congrats Jack!

English

Jack Jingyu Zhang@jackjingyuzhang·23 Eki

I’m super thrilled and honored to be named an Amazon AI PhD Fellow 💫 Huge thanks to @AmazonScience for generously supporting our research at JHU! We’ll be advancing AI alignment in collaboration with folks at Amazon.

Rohit Prasad@RohitPrasadAI

Excited to announce @amazon's new AI PhD Fellowship Program supporting 100+ students across 9 universities like Carnegie Mellon, MIT & Stanford. Fellows will be paired with senior scientists working in related fields, plus receive financial support and AWS credits for research. Learn more: amazon.science/news/amazon-la…

English

11.7K

Yining Lu@Yining__Lu·15 Eki

Very interesting approach to recursive decomposition for LLM inputs with python code! It reminds me of our early experiments for aclanthology.org/2025.acl-long.…, where we used AMR to understand long context and guide the recursive decomposition. looking forward to seeing more details and results from your future works!

alex zhang@a1zhang

We implement this as a Jupyter-like REPL environment: The key idea is to put the user’s prompt in a Python variable and give the LLM a REPL loop where it can try to understand the prompt, without directly reading the whole content. The “root” LM interacts with the environment by writing code and looking at the outputs of each cell, and can recursively call LM calls inside this REPL environment to navigate its context. This is far more general than any “chunking” strategy. We argue that you should let the LLM itself decide how to best poke around, decompose, and recursively process the long prompt.

English

764

Yining Lu retweetledi

Zhengping Jiang@zhengping_jiang·10 Eki

🚀 Excited to share our COLM25 paper, “Always Tell Me The Odds”! You can train an LLM to be a generic subjective probability estimator — e.g., producing fine-grained probabilities for everyday situations that go far beyond coarse confidence scores.

Liaoyaqi Wang@LiaoyaqiW

Title: Always Tell Me The Odds: Fine-grained Conditional Probability Estimation 📄Paper: openreview.net/pdf?id=xhDcG8q… 🤗Models: huggingface.co/Zhengping/cond… 💻Code: github.com/zipJiang/decod… 🤝Coauthor: @zhengping_jiang @anqi_liu33 @ben_vandurme

English

494

Yining Lu retweetledi

Arda Uzunoğlu@aardauzunoglu·1 Eki

🛑 What's the Flaw of Averages? 📄: arxiv.org/abs/2509.25671 We’re in an evaluation crisis. Benchmarks are saturating, creating a false sense that tasks are solved. As training/eval chase these sets, plateaued averages hide shortcutting and distributional skew. 🧵1/7

English

12.4K

Yining Lu@Yining__Lu·1 Eki

Very insightful blog! Some claims like "correct trade-off between the conflicting objectives at different states is precisely given by the distribution" were also found in our recent work (arxiv.org/pdf/2509.11452, limitation section) that different pre-trained models exhibit distinct tolerance to trade-offs between conflicting objectives.

Nan Jiang@nanjiang_cs

My 3rd blogpost on PG, the topic I am least familiar with but get asked a lot, so I thought I'd just put together the very limited stuff I know on this topic. Somehow the post gets cynical from time to time🙃 nanjiang.cs.illinois.edu/2025/09/29/pg.…

English

1.1K

Keşfet

@Meng_CS @jackjingyuzhang @AmazonScience @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates