

Data Mining Group@UIUC
72 posts

@dmguiuc
led by Prof. Jiawei Han. Data Mining, AI, ML, NLP






📢We have finally turned our "awesome" GitHub repository (290+ stars already) into a survey of 𝐒𝐜𝐢𝐞𝐧𝐭𝐢𝐟𝐢𝐜 𝐋𝐋𝐌𝐬 and their applications in 𝐒𝐜𝐢𝐞𝐧𝐭𝐢𝐟𝐢𝐜 𝐃𝐢𝐬𝐜𝐨𝐯𝐞𝐫𝐲! #LLM #AI4Science Paper: arxiv.org/abs/2406.10833 GitHub: github.com/yuzhimanhua/Aw…





Can LLMs make us critical thinkers? TreeInstruct reorients LLMs to be instructors that guide students socratically to solve problems, instead of assistants that provide direct answers. Check out arxiv.org/abs/2406.11709 (w/ @wonderingishika) to learn more!















Our group has 3 papers accepted to #ICLR2024. They are led by @ge_suyu, @yumeng0818, and @MingZhong_ , respectively, in collaboration with @MSFTResearch and @metaai. In particular, @ge_suyu's paper has been selected for Oral Presentation (top 1.2%)! See you in Vienna! @iclr_conf


🚀Announcing StructChem: A simple yet effective prompting strategy, unlocking the power of LLMs for complex chemistry reasoning. This task requires: - Extensive domain knowledge - Precise scientific computing - Compositional step-by-step reasoning Paper: arxiv.org/abs/2311.09656 Website: ozyyshr.github.io/StructChem

📢Excited to share our new paper "Investigating Data Contamination for Pre-training Language Models"! We analyze the effects of data contamination in the pre-training stage of LMs by pre-training & studying GPT-2 models🚀. Paper: arxiv.org/abs/2401.06059

