DSE Lab @ MSU

223 posts

DSE Lab @ MSU banner
DSE Lab @ MSU

DSE Lab @ MSU

@dse_msu

Data Science and Engineering Lab Director: Dr. Jiliang Tang (@tangjiliang)

East Lansing, MI, USA Katılım Kasım 2016
215 Takip Edilen485 Takipçiler
DSE Lab @ MSU
DSE Lab @ MSU@dse_msu·
🚀 Excited to announce our new survey on Retrieval-Augmented Generation with Graphs (GraphRAG)! How do graphs transform Retrieval-Augmented Generation(RAG)? By encoding relational and domain knowledge, GraphRAG is unlocking new possibilities across real-world applications! 🌐
English
1
3
4
618
DSE Lab @ MSU retweetledi
Shenglai Zeng@NeurIPS-2025
🎯 Detect & Filter RAG Contexts with LLM Representations Excited to share our work on Representation-based knowledge checking in #RAG! arxiv.org/abs/2411.14572 We show how LLM representations detect & filter misleading/unhelpful knowledge and improve performance.
Shenglai Zeng@NeurIPS-2025 tweet mediaShenglai Zeng@NeurIPS-2025 tweet media
English
1
2
2
534
DSE Lab @ MSU retweetledi
Yuping Lin
Yuping Lin@yuplin2333·
✨ Excited to share our new preprint "Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis"! arxiv.org/abs/2406.10794 🔍 We delve into why some jailbreak attacks succeed by exploring harmful and harmless prompts in the LLM's representation space.
English
1
6
5
907
DSE Lab @ MSU retweetledi
Wenzhuo Tang
Wenzhuo Tang@WenzhuoTang·
🙋‍♂️What is the wildest dream for graph foundation models? 🎯Graph across domains → a single model → all the downstream 🙋‍♀️Can we achieve that? ✅Yes! UniAug: Cross-Domain Graph Data Scaling with Diffusion Models 📃arxiv.org/pdf/2406.01899
Wenzhuo Tang tweet media
English
1
6
7
1.8K
DSE Lab @ MSU retweetledi
Haitao Mao
Haitao Mao@haitao_mao_·
arxiv.org/pdf/2402.02212… "A Data Generation Perspective to the Mechanism of In-Context Learning". We investigate the mechanism of In-context learning which helps ground debate on whether LLM can achieve intelligence to whether LLM can learn new data generation function in context
English
2
5
23
4.7K
DSE Lab @ MSU retweetledi
Jie Ren
Jie Ren@rjthuer·
Exciting News! Our new paper on memorization in text-to-image diffusion is now available. We delve into the understanding of memorization via attention, and throw a light on the internal model behavior when memorization happens. Please find our paper at arxiv.org/abs/2403.11052
English
0
5
4
1.3K
DSE Lab @ MSU retweetledi
Jie Ren
Jie Ren@rjthuer·
Our paper for LLM watermark is accepeted by NAACL findings! We proposed a new method to strengthen the robustness of watermark agains paraphrase using the semantics. This is very meaningful factor for the practical application! Please find the paper at openreview.net/forum?id=hbMR7…
English
0
4
3
851
DSE Lab @ MSU retweetledi
Jiayuan Ding
Jiayuan Ding@JiayuanDing·
Exciting News! Our DANCE version 1, "DANCE: a deep learning library and benchmark platform for single-cell analysis" is now finally published in Genome Biology (@GenomeBiology ) 🎉 !!! DANCE has impacted the field, and got 290+ GitHub stars 🌟 before its official publication!
Jiayuan Ding tweet media
English
2
13
28
7.8K
DSE Lab @ MSU retweetledi
evo-devo
evo-devo@Xiaojie_Qiu·
With the imaging-based spatial transcriptomics such as MERFISH, seqFISH, CosMx SMI, Xenium and others, have you ever wondered how we can leverage their subcellular spatial information? Check out our latest preprint on Focus by Qiaolin and Jiayuan @JiayuanDing , two talent students, to find out how we approach this problem: biorxiv.org/content/10.110…. Focus is a state-of-the-art graph contrastive learning based approach to properly model RNA subcellular spatial distribution that dramatically improves cell type annotation and reveals critical molecular pathways that were not possible before. Specifically, Focus first constructs gene neighborhood networks based on the subcellular colocalization relationship of RNA transcripts. Next the subcellular graph of each cell can be augmented by adding important edges and nodes or removing trivial edges and nodes. Focus then aims to maximize the similarity between positive pairs from two augmented views of the same cell and minimize the similarity between negative pairs from different cells within a common batch. Guided by a limited amount of labeled data, Focus is capable of assigning cell type identities and revealing intricate cell type-specific subcellular spatial gene patterns and providing interpretable subcellular gene analysis, such as defining the gene importance score. Focus is still in its prototype stage but we are excited about this direction and will continue to improve Focus and extend it to many other settings. In the meantime, please let us know if you may have any comments or suggestions! As I just started my lab at Stanford, we are excited about many collaboration opportunities from the Bay area and others as well!
evo-devo tweet media
English
4
19
99
15.1K
DSE Lab @ MSU
DSE Lab @ MSU@dse_msu·
1/2🚀Excited to share our latest #RAG #privacy research! We’ve uncovered two pivotal aspects: 1⃣Privacy challenges within RAG’s own data- up to 50% retrieval data could be leaked 2⃣RAG’s potential to safeguard training data-reducing the tendency of LLM outputing memorized data
English
1
0
5
327
DSE Lab @ MSU retweetledi
Shenglai Zeng@NeurIPS-2025
Shenglai Zeng@NeurIPS-2025@snowzeng2·
🔒💡 Excited to share our latest #RAG #Privacy research! We've uncovered two pivotal aspects: 1️⃣ Privacy challenges within RAG's own data 2️⃣ RAG's potential to safeguard training data 🔍 Discover the dual-edged sword of RAG technology in our paper arxiv.org/pdf/2402.16893
Shenglai Zeng@NeurIPS-2025 tweet mediaShenglai Zeng@NeurIPS-2025 tweet media
English
1
6
6
1.8K
DSE Lab @ MSU retweetledi
Haitao Mao
Haitao Mao@haitao_mao_·
See our new repo github.com/CurryTang/Towa… including (1) theoretical guidance (2) existing benchmark datasets and (3) existing GFM summarization. The new seminar focusing on GFM will be on board soon!!!
English
4
13
36
4.7K
DSE Lab @ MSU
DSE Lab @ MSU@dse_msu·
2/2: We examine it from two distinct viewpoints: the copyrights pertaining to the source data held by the data owners and those of the generative models maintained by the model builders. 📚 Dive in now: arxiv.org/abs/2402.02333
English
0
0
1
122
DSE Lab @ MSU
DSE Lab @ MSU@dse_msu·
1/2: 🚀Exciting release our latest preprint, “Copyright Protection in Generative AI: A Technical Perspective,” tackles the pressing issue of copyright in Generative AI. This work provides a comprehensive overview of copyright protection from a technical perspective.
English
1
2
3
1K
DSE Lab @ MSU
DSE Lab @ MSU@dse_msu·
You may know gene language models, but what about cell language models? We just published a blog on Valence Portal about our ICLR 2024 paper CellPLM: Pre-training of Cell Language Model Beyond Single Cells. Check it out📷: portal.valencelabs.com/blogs/post/cel…
English
0
5
10
2.4K
DSE Lab @ MSU
DSE Lab @ MSU@dse_msu·
🌟 Exciting News! 🎉 Huge congratulations to our lab's esteemed alum, Wei Jin, for being honored at the KAUST Rising Stars in AI Symposium 2024 AND for being featured in AAAI-2024 New Faculty Highlights!🤖💡 Congratulations, Wei! 🏆
English
0
0
7
335