

Zonglin Yang
602 posts

@Yang_zy223
Research Scientist @miromind_ai | LLMs for Scientific Discovery | Creator of the MOOSE series (Latest: MOOSE-Star 🌟)





Can we actually TRAIN LLMs for scientific discovery — or only prompt them to brainstorm? 🧬✨ 🎉 MOOSE-Star → #ICML2026 Most work on LLMs for hypothesis discovery focuses on inference-time agents or feedback-driven refinement. The core generative process — P(hypothesis | research background), or P(h|b) — has been largely sidestepped: directly training it remains an open problem. We show why: a combinatorial complexity barrier makes naive end-to-end training mathematically intractable. First scalable recipe for training P(h|b), with clean scaling laws on both training data and test-time compute. 📄 Paper: arxiv.org/abs/2603.03756 💻 GitHub: github.com/ZonglinY/MOOSE… 🤗 HF: huggingface.co/collections/Zo… 🧵👇

Can we actually TRAIN LLMs for scientific discovery — or only prompt them to brainstorm? 🧬✨ 🎉 MOOSE-Star → #ICML2026 Most work on LLMs for hypothesis discovery focuses on inference-time agents or feedback-driven refinement. The core generative process — P(hypothesis | research background), or P(h|b) — has been largely sidestepped: directly training it remains an open problem. We show why: a combinatorial complexity barrier makes naive end-to-end training mathematically intractable. First scalable recipe for training P(h|b), with clean scaling laws on both training data and test-time compute. 📄 Paper: arxiv.org/abs/2603.03756 💻 GitHub: github.com/ZonglinY/MOOSE… 🤗 HF: huggingface.co/collections/Zo… 🧵👇

🚨 LLM-based scientific hypothesis discovery now has a scalable training recipe. MOOSE-Star, accepted at ICML 2026, enables scalable training for hypothesis generation, with more scalable test-time scaling. By our researchers— x.com/Yang_zy223/sta…

(1/3) Enterprise RDBs rarely change their structure, but meet new ML tasks every day. The RDB foundation model (FM) fits this position well because no task-specific training is needed. Our latest work uses intra-column encoding and tabular FMs, achieving SOTA performance.

Can we actually TRAIN LLMs for scientific discovery — or only prompt them to brainstorm? 🧬✨ 🎉 MOOSE-Star → #ICML2026 Most work on LLMs for hypothesis discovery focuses on inference-time agents or feedback-driven refinement. The core generative process — P(hypothesis | research background), or P(h|b) — has been largely sidestepped: directly training it remains an open problem. We show why: a combinatorial complexity barrier makes naive end-to-end training mathematically intractable. First scalable recipe for training P(h|b), with clean scaling laws on both training data and test-time compute. 📄 Paper: arxiv.org/abs/2603.03756 💻 GitHub: github.com/ZonglinY/MOOSE… 🤗 HF: huggingface.co/collections/Zo… 🧵👇













Such a great evening to start a brand new research for NeurIPS in 3.5 days.🧘♂️ Day 1: planning. Night 1: running experiments and sending the abstract. Day 2: reading results fighting with Claude, and sending again. Night 2: sleep (optional). Day 3: opening Codex, and finally, write the pape in parallel. Night 3: resolving the “beef” with Claude (temporary peace) and going to sleep. Day 4: final reading, last-minute fixes, submission then some relaxation, maybe a beach walk. I’ll keep you posted on the results. This will be my only single-author paper, so I can’t hide behind other submissions if it gets rejected 😅

For AI PhDs aiming for industry, paper count matters, but only up to a point. In China, 2 to 3 (co)first author CCF-A papers is often the borderline for a Top Talent offer. Beyond that, the marginal gain drops fast. When you apply as a fresh grad, what matters more is whether you have matched experience in a big tech foundation model team. As a PhD, papers can feel like a huge part of the world. After graduation, people see it differently. And for CS PhDs, AI and LLMs are only a small slice. Many groups do not even send students to industry internships the way LLM teams do, and industry itself is much bigger than LLMs. Paper is only one part of you. Your experience matters more. The LLM boom is a winner takes all arena shaped by extreme competition, where only the hardest driving survivors make it to the top. LLM 就是卷生卷死卷出来的幸存者盛世啊




