Shenglai Zeng@NeurIPS-2025

28 posts

Shenglai Zeng@NeurIPS-2025 banner
Shenglai Zeng@NeurIPS-2025

Shenglai Zeng@NeurIPS-2025

@snowzeng2

PhD student in MSU DSE lab @dse_msu. Intern at @AmazonScience Research interests: RAG, Agentic AI, LLM privacy/safety

East Lansing, MI Katılım Ağustos 2022
318 Takip Edilen165 Takipçiler
Shenglai Zeng@NeurIPS-2025
🎉 Heading to NeurIPS 2025 in San Diego (Dec 2-7)! Looking forward to great discussions and exploring collaboration & internship opportunities related to LLMs, RAG(Agentic) systems, and trustworthy AI! See you in San Diego! 🌊
English
1
0
3
111
Shenglai Zeng@NeurIPS-2025
🎯 Stage 1: Attribute-based extraction to preserve key contextual information 🤖 Stage 2: Agent-based iterative refinement to enhance privacy protection ✅ Results: Comparable performance to original data while substantially reducing privacy risks arxiv.org/abs/2406.14773
English
0
0
0
27
Shenglai Zeng@NeurIPS-2025
Paper 2: "Mitigating Privacy Issues in RAG via Pure Synthetic Data" 🛡️ Privacy-preserving solution: SAGE - a two-stage synthetic data generation framework for RAG
English
1
0
0
35
Shenglai Zeng@NeurIPS-2025
Shenglai Zeng@NeurIPS-2025@snowzeng2·
Heading to #NAACL2025 in Albuquerque! 🎉 Presenting our paper on knowledge checking in RAG systems on May 2, 9:00-10:30 AM in Mesilla. Stop by to chat about how representation can help LLMs better integrate external knowledge! Coffee chats welcome! ☕ arxiv.org/abs/2411.14572
Shenglai Zeng@NeurIPS-2025@snowzeng2

🎯 Detect & Filter RAG Contexts with LLM Representations Excited to share our work on Representation-based knowledge checking in #RAG! arxiv.org/abs/2411.14572 We show how LLM representations detect & filter misleading/unhelpful knowledge and improve performance.

English
0
0
2
208
Shenglai Zeng@NeurIPS-2025
2. Representation vs Traditional Methods: Traditional methods (e.g., answer-based or probability-based) struggle with these tasks, while representation-based approaches (e.g., rep-PCA and rep-Con) achieve superior performance by leveraging distinct patterns in representations.
English
1
0
0
87
Shenglai Zeng@NeurIPS-2025
🎯 Detect & Filter RAG Contexts with LLM Representations Excited to share our work on Representation-based knowledge checking in #RAG! arxiv.org/abs/2411.14572 We show how LLM representations detect & filter misleading/unhelpful knowledge and improve performance.
Shenglai Zeng@NeurIPS-2025 tweet mediaShenglai Zeng@NeurIPS-2025 tweet media
English
1
2
2
534
Shenglai Zeng@NeurIPS-2025 retweetledi
Yuping Lin
Yuping Lin@yuplin2333·
✨ Excited to share our new preprint "Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis"! arxiv.org/abs/2406.10794 🔍 We delve into why some jailbreak attacks succeed by exploring harmful and harmless prompts in the LLM's representation space.
English
1
6
5
907
Shenglai Zeng@NeurIPS-2025
Our results show that SAGE achieves comparable performance to using original data while significantly reducing privacy risks! 📊✨
English
0
0
0
133
Shenglai Zeng@NeurIPS-2025
SAGE works in two steps: 1️⃣Attribute-based extraction and generation: Identifies and generates synthetic data based on key attributes. 2️⃣Agent-based refinement: Ensures privacy through iterative assessment and refinement by privacy and rewriting agents.
English
1
0
0
175
Shenglai Zeng@NeurIPS-2025
Shenglai Zeng@NeurIPS-2025@snowzeng2·
2️⃣ Mitigation Efforts: We've explored naive defenses such as summarization and retrieval thresholds. These methods help mitigate risks but don't completely resolve the issue, indicating the gravity of privacy risks in RAG.
English
1
0
1
309
Shenglai Zeng@NeurIPS-2025
Shenglai Zeng@NeurIPS-2025@snowzeng2·
🔒💡 Excited to share our latest #RAG #Privacy research! We've uncovered two pivotal aspects: 1️⃣ Privacy challenges within RAG's own data 2️⃣ RAG's potential to safeguard training data 🔍 Discover the dual-edged sword of RAG technology in our paper arxiv.org/pdf/2402.16893
Shenglai Zeng@NeurIPS-2025 tweet mediaShenglai Zeng@NeurIPS-2025 tweet media
English
1
6
6
1.8K