Data Mining Group@UIUC

72 posts

Data Mining Group@UIUC banner
Data Mining Group@UIUC

Data Mining Group@UIUC

@dmguiuc

led by Prof. Jiawei Han. Data Mining, AI, ML, NLP

Urbana, IL Katılım Haziran 2023
88 Takip Edilen679 Takipçiler
Data Mining Group@UIUC retweetledi
Ke Yang
Ke Yang@EmpathYang·
📰New preprint: How can we build a task-agnostic plug-and-play memory module for LLM agents that supports multiple memory types? We present PlugMem🔌🧠, a plugin memory module that works across tasks by turning heterogeneous experience into knowledge. Evaluated unchanged on long-term dialogue🗣️, multi-hop QA🕵️, and web agents🕸️🤖, PlugMem improves performance while using far fewer memory tokens. 📜Paper: empathyang.github.io/files/PlugMem.… 🔨Code: github.com/TIMAN-group/Pl…
Ke Yang tweet media
English
13
63
168
10.9K
Data Mining Group@UIUC retweetledi
Ruozhen Yang
Ruozhen Yang@Seattleyrz·
Maintaining agent performance over long horizons remains challenging—largely because memory systems fail to associate latent context with intent. 🎉 Introducing our paper: Grounding Agent Memory in Contextual Intent. STITCH achieves 35.6% gains on our new CAME-Bench.
Ruozhen Yang tweet media
English
6
1
3
352
Data Mining Group@UIUC retweetledi
Patrick (Pengcheng) Jiang
📣 Excited to share #DeepRetrieval - our novel approach using reinforcement learning for query augmentation in information retrieval! 🚀 Our preliminary results (we got on Feb 16) CRUSH previous SOTA: 60.8% vs 24.7% recall on PubMed search engine 70.8% vs 32.1% recall on ClinicalTrial search engine with a SMALLER model (3B vs 7B) 💡NO supervision data: - [no💰] vs [💰💰💰💰...] on creating augmented queries from ChatGPT/Claude! 💻 Github: github.com/pat-jj/DeepRet… 📝 Preliminary Technical Report: pat-jj.github.io/assets/pdf/dee… 🔬 Currently testing on general IR datasets and with dense retrieval methods 📝 Full paper with more results will be released soon. Just created this X account to share this breakthrough - follow for more NLP+IR research! #NLP #IR #MachineLearning #LLM #AAAI2025
Patrick (Pengcheng) Jiang tweet media
English
3
64
43
29.7K
Data Mining Group@UIUC retweetledi
Bowen Jin
Bowen Jin@BowenJin13·
🚀 Introducing 𝗦𝗲𝗮𝗿𝗰𝗵-𝗥𝟭 – the first 𝗿𝗲𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗼𝗳 𝗗𝗲𝗲𝗽𝘀𝗲𝗲𝗸-𝗥𝟭 (𝘇𝗲𝗿𝗼) for training reasoning and search-augmented LLM agents with reinforcement learning! This is a step towards training an 𝗼𝗽𝗲𝗻-𝘀𝗼𝘂𝗿𝗰𝗲 𝗢𝗽𝗲𝗻𝗔𝗜 “𝗗𝗲𝗲𝗽 𝗿𝗲𝘀𝗲𝗮𝗿𝗰𝗵” via RL. Our 𝟯𝗕 𝗯𝗮𝘀𝗲 𝗟𝗟𝗠𝘀—including not just 𝗤𝘄𝗲𝗻 𝟮.𝟱 but also 𝗟𝗹𝗮𝗺𝗮 𝟯.𝟮—learn to 𝗿𝗲𝗮𝘀𝗼𝗻 and 𝗰𝗮𝗹𝗹 𝘀𝗲𝗮𝗿𝗰𝗵 𝗲𝗻𝗴𝗶𝗻𝗲𝘀 all on their own! Everything will be 𝗳𝘂𝗹𝗹𝘆 𝗼𝗽𝗲𝗻 𝘀𝗼𝘂𝗿𝗰𝗲. Stay tuned! Code: github.com/PeterGriffinJi… Experimental logs: wandb.ai/peterjin/Searc… #R1 #deepresearch #deepseek
English
43
319
2.5K
363.5K
Data Mining Group@UIUC retweetledi
Data Mining Group@UIUC retweetledi
Bowen Jin
Bowen Jin@BowenJin13·
🚀Excited to share "InstructG2I: Synthesizing Images from Multimodal Attributed Graphs" has been accepted by @NeurIPSConf 2024! instructg2i.github.io We propose a graph-conditioned stable diffusion model for image generation. GO and PLAY with it! #graph #diffusion #neurips
Bowen Jin tweet media
English
1
2
22
7.5K
Data Mining Group@UIUC retweetledi
Yu Zhang
Yu Zhang@yuz9yuz·
🎓Successfully defended my Ph.D. thesis! 🎉My deepest gratitude goes to my thesis committee members: Prof. Jiawei Han @dmguiuc, Prof. Tarek Abdelzaher, Prof. Hanghang Tong, Prof. Wei Wang @WeiWang1973, and Dr. Iris Shen!
Yu Zhang tweet media
English
22
4
198
14.8K
Data Mining Group@UIUC retweetledi
Priyanka Kargupta
Priyanka Kargupta@priyanka_karg·
Happy to announce that TreeInstruct got accepted to EMNLP'24! Excited to discuss the work alongside @wonderingishika as part of a joint collaboration between @dmguiuc and @convai_uiuc. See you all in Miami! #EMNLP2024
Priyanka Kargupta@priyanka_karg

Can LLMs make us critical thinkers? TreeInstruct reorients LLMs to be instructors that guide students socratically to solve problems, instead of assistants that provide direct answers. Check out arxiv.org/abs/2406.11709 (w/ @wonderingishika) to learn more!

English
0
5
20
5.1K
Data Mining Group@UIUC
Data Mining Group@UIUC@dmguiuc·
🚀 Join our tutorial at #KDD2024, Automated Mining of Structured Knowledge from Text with Large Language Models! 👤Presented by @YunyiZhang10, @Siru_Ouyang, Professor Jiawei Han. 📅Aug 25, 10 AM - 1 PM CEST 📍 Room 129-130
English
0
5
14
1.7K
Data Mining Group@UIUC retweetledi
Yu Zhang
Yu Zhang@yuz9yuz·
📢We have finally turned our "awesome" GitHub repository (290+ stars already) into a survey of 𝐒𝐜𝐢𝐞𝐧𝐭𝐢𝐟𝐢𝐜 𝐋𝐋𝐌𝐬 and their applications in 𝐒𝐜𝐢𝐞𝐧𝐭𝐢𝐟𝐢𝐜 𝐃𝐢𝐬𝐜𝐨𝐯𝐞𝐫𝐲! #LLM #AI4Science Paper: arxiv.org/abs/2406.10833 GitHub: github.com/yuzhimanhua/Aw…
Yu Zhang tweet media
English
9
76
309
44.9K
Data Mining Group@UIUC retweetledi
Bowen Jin
Bowen Jin@BowenJin13·
🚀Excited to share "Language Models as Semantic Indexers" is accepted to ICML 2024! ⭐️We propose to learn document semantic IDs with large language models in a self-supervised fashion. ⭐️The learned semantic IDs can benefit LLM generative recommendation and retrieval. #LLM #IR
Bowen Jin tweet media
English
1
8
34
3.1K
Data Mining Group@UIUC
Data Mining Group@UIUC@dmguiuc·
🎉 Our KV Cache paper got #ICLR2024 Outstanding Paper Honorable Mention! @iclr_conf 🎉 blog.iclr.cc/2024/05/06/icl… 🔊 Big congrats to all the authors @ge_suyu, Yunan Zhang, @LiyuanLucas, Minjia Zhang, Jiawei Han, and @JianfengGao0217!
Data Mining Group@UIUC@dmguiuc

Our group has 3 papers accepted to #ICLR2024. They are led by @ge_suyu, @yumeng0818, and @MingZhong_ , respectively, in collaboration with @MSFTResearch and @metaai. In particular, @ge_suyu's paper has been selected for Oral Presentation (top 1.2%)! See you in Vienna! @iclr_conf

English
0
1
17
990
Data Mining Group@UIUC retweetledi
Siru Ouyang
Siru Ouyang@Siru_Ouyang·
StructChem is accepted to #ICML2024! 🎉 Structuring your mind makes better chemistry reasoning. 🔬💡 We are also thrilled to share that StructChem is accessible at LLM Reasoners, a popular library for advanced reasoning. Try it out for scientific reasoning in this great playground 🌟: github.com/maitrix-org/ll…
Siru Ouyang tweet media
Siru Ouyang@Siru_Ouyang

🚀Announcing StructChem: A simple yet effective prompting strategy, unlocking the power of LLMs for complex chemistry reasoning. This task requires: - Extensive domain knowledge - Precise scientific computing - Compositional step-by-step reasoning Paper: arxiv.org/abs/2311.09656 Website: ozyyshr.github.io/StructChem

English
5
13
103
12.6K
Data Mining Group@UIUC retweetledi
Data Mining Group@UIUC
Data Mining Group@UIUC@dmguiuc·
Excited to share that we have 5 papers accepted to #ICML2024! Three of them are led by PhD students @BowenJin13, Yanru Qu, & @Siru_Ouyang, respectively; one is led by Prof. Hanghang Tong's group; one is a position paper on LLM trustworthiness by many great researchers. @icmlconf
Data Mining Group@UIUC tweet media
English
1
1
17
1.6K