Jure Leskovec

1.5K posts

Jure Leskovec

@jure

Professor of #computerscience @Stanford; Co-founder at https://t.co/hhm1j5wP0f #machinelearning #graphs.

Stanford, CA Katılım Ağustos 2007

424 Takip Edilen44.9K Takipçiler

Jure Leskovec@jure·1d

Paper: nature.com/articles/s4158… Code: github.com/snap-stanford/… Thanks so much to the team @YanayRosen @yusufroohani @StephenQuake!

English

889

Jure Leskovec@jure·1d

UCE connects molecular and cellular scales of biology. Genes are more than just columns in an expression matrix: in UCE, they are encoded according to the proteins they produce, using ESM, embedding novel species not seen during training, across 100Ms of years of evolution.

English

1.5K

Jure Leskovec@jure·1d

Excited to share that our Universal Cell Embedding (UCE) paper is published in @Nature ! Single-cell RNA sequencing data gives us an unprecedented look into the diversity of cell biology, but analysis has often been limited to the specific dataset or atlas that was collected. nature.com/articles/s4158…

English

213

14.1K

Jure Leskovec retweetledi

Kexin Huang@KexinHuang5·5d

Today, we're excited to share that Biomni is published in @ScienceMagazine. Biomedical research is still fragmented, manual, and difficult to scale. In this work, we introduce Biomni - the first general-purpose biomedical AI agent with an integrated biology environment that can reason, plan, and execute end-to-end scientific workflows. We show that, with the right environment and harness, AI can automate large-scale omics analyses, orchestrate laboratory robotics, optimize molecular properties, and even train new AI models for biology. We also introduce a reinforcement learning recipe for continually improving biomedical AI agents, enabling open-source models to achieve frontier-level performance. It's surreal to look back. We started the Biomni project in early 2024, when agentic AI was still nascent. It is exciting to see tens of thousands of biologists collaborating with agents every day to accelerate science. Try Biomni: biomni.phylo.bio Read more: science.org/doi/10.1126/sc… This work is not possible without this truly inter-disciplinary team: @serena2z @hcwww_ @YuanhaoQ Minta Lu, Ryan Li, @yusufroohani Lin Qiu @shiyi_c98 Gavin Junze Di @rickwierenga @kavi_deniz Sherry @TianweiShe Shruti Jennefer Xin Zhou @MWheelerMD Jon Bernstein @MengdiWang10 @PengHeAtlas @zhou_jingtian @SnyderShot @lecong Aviv Regev @jure @StanfordAILab @genentech @phylo_bio @arcinstitute @UW @berkeley_ai @RetroBio_ @tamarindbio @Princeton @UCSF

English

184

856

129.8K

Jure Leskovec@jure·6 Tem

Modern multimodal models aren't a single decode loop anymore; they're composite. M* is one runtime that serves them all, and it matches or beats every specialized system: up to 2.7× on omni TTS, 12.5× on world-model rollouts. Learn more here: ai.stanford.edu/blog/mstar/

English

3.8K

Jure Leskovec retweetledi

Moritz Schäfer@muronglizi·6 Tem

Cancer diagnosis is informed by cellular annotations of histopathology, but assays are expensive or rely on manual annotations. At #ICML2026, we present SpatialWhisperer, a trimodal model that zero-shot annotates cell types in histopathology images. 🧵👇

English

8.2K

Jure Leskovec retweetledi

Vignesh Kothapalli@kvignesh1420·6 Tem

🎉🎉 PluRel will be presented at #ICML2026 🎉🎉 Date: Tue, Jul 7, 2026 • 10:30 AM – 12:15 PM KST Location: HALL A #2715 by co-author: @_rishabhranjan_ ! Also checkout our latest website for interactive visualizations, access to code, models and data: star-project.stanford.edu/plurel/

Vignesh Kothapalli@kvignesh1420

Relational Foundation Models face a scaling problem: diverse training datasets are rarely public due to privacy constraints 🔒. 🚀 We are excited to introduce "PluRel": a framework that synthesizes diverse multi-table relational databases from scratch, unlocking scaling laws for RFMs. 🧵 Kudos to the amazing collaborators at @StanfordAILab @Kumo_ai_team , and @SAP : @_rishabhranjan_ @VHudovernik @vijaypradwi @johanneshoffart @guestrin @jure

English

3.5K

Jure Leskovec retweetledi

Kexin Huang@KexinHuang5·30 Haz

Excited to share a research collaboration with @ScaleAILabs - we rigorously evaluate bio agents on different models across 82 drug discovery tasks - interesting findings include: (1) know-how/environment >>> models (2) different LLMs have different strength - highlighting a need for model-routing for biology agents:

Yuanhao Qu@YuanhaoQ

We get this question a lot: "Which model is best for drug discovery?" Our new benchmark announced today with @ScaleAILabs, DrugDiscoveryBench (82 tasks from working drug discovery scientists, run on Biomni Open Source Environment), has a clear answer: the model matters far less than what you build around it. 🧵3 key takeaways →

English

4.9K

Jure Leskovec retweetledi

Phylo@phylo_bio·25 Haz

Biology doesn't happen in one place. We're bringing Biomni Lab from your browser to your phone, your desktop, and your agent of choice via MCP. Biomni comes with you wherever your work happens. Sign up for the closed beta (Mobile and Desktop): forms.gle/JbB4vV4GdaZcaL… Use Biomni MCP today: mcp.phylo.bio/mcp Blog: phylo.bio/blog/biomni-ev…

English

24.6K

Jure Leskovec retweetledi

Engram@EngramLab·23 Haz

x.com/i/article/2069…

ZXX

181

242

1.9K

1.8M

Jure Leskovec retweetledi

Vignesh Kothapalli@kvignesh1420·3 Haz

Can reasoning models become overly reliant on chain-of-thought examples? 🤔 Our #ACL2026 work shows excessive CoT supervision is not always beneficial, and gives a recipe for tuning the CoT fraction to improve novel-task accuracy. 🧵 Website: kvignesh1420.github.io/cot-icl-lab

GIF

English

3.3K

Jure Leskovec@jure·19 May

A fascinating reality check for AI coding agents. The new NanoGPT-Bench reveals that current agents (e.g., Claude Code and Codex) only recover 9.3% of human progress on AI R&D tasks.

Intology@intology

Can coding agents do research? We release NanoGPT-Bench, an internal eval we’ve used to test agents on an AI R&D problem with months of human progress Codex, Claude Code, Autoresearch recover only 9.3% of human progress, mostly tuning hyperparams & ignoring algorithmic research NanoGPT-Bench is built on the NanoGPT Speedrun, a popular LLM pretraining competition to minimize the training time of a GPT-2 style model. Existing human submissions constitute nearly 2 years of work. To control for dependencies and contamination in frontier models, we standardize evaluation to a 5-month window of world records. Evaluation is fully autonomous and end-to-end, with no human intervention or internet access. 🧵

English

7.4K

Jure Leskovec retweetledi

Fang Wu@WUFang40615703·15 May

Proteo-R1 (ICML 2026), the first reasoning protein foundation model for protein design, is out! 🚀🧬 Most protein design models generate structures without ever *reasoning* about which residues matter. We think that's backwards. Human protein engineers👩‍🔧 don't work this way. They identify critical interaction residues first — charged anchors, hydrophobic hotspots, specificity-determining motifs — and only then optimize geometry around those decisions. ━━━━━━━━━━━━━━━━ 🔬 THE CORE IDEA ━━━━━━━━━━━━━━━━ A dual-expert architecture that explicitly decouples molecular understanding from geometric generation: → ⚡A multimodal LLM (understanding expert) analyzes protein sequences, structures, and text to identify key functional residues governing binding and specificity → ⚡A diffusion model (generation expert) then co-designs sequence + structure — but with those residues locked in as hard constraints ━━━━━━━━━━━━━━━━ 📐 HOW IT'S TRAINED ━━━━━━━━━━━━━━━━ Three-stage curriculum: ① Multimodal Alignment — freeze the LLM, train projections to bridge ESM-2 + AF3-style structural features into language space ② Structural Reasoning Mid-Training — unfreeze the LLM, teach it residue grounding → pairwise geometry → interface localization → hotspot prediction ③ Joint Reasoning-Guided Design — end-to-end on antibody-antigen complexes. Gradients from the diffusion objective flow back through the reasoning expert. ━━━━━━━━━━━━━━━━ 📊 RESULTS ━━━━━━━━━━━━━━━━ Evaluated on simultaneous multi-CDR redesign and the RAbD CDR-H3 benchmark: ✅ Best RMSD & DockQ on RAbD — redesigned H3 loops are geometrically accurate *and* docked well ✅ Lowest backbone dihedral divergence (JSDbb) among all baselines ✅ Reduced intra- and inter-chain steric clashes ✅ Generated sequences score lower perplexity than native antibodies under IgLM & AbLang ✅ Plug-and-play: swapping the diffusion backend to UniMoMo still improves RMSD and IMP ━━━━━━━━━━━━━━━━ 💡 WHY IT MATTERS ━━━━━━━━━━━━━━━━ Proteo-R1 isn't just a better antibody design model. It's a blueprint for coupling deliberative LLM reasoning with any physical generative process — interpretable, modular, and backend-agnostic. 📄 Paper: arxiv.org/abs/2605.02937 💻 Code: github.com/smiles724/Prot… 🌐 Demo: smiles724.github.io/r1/ Great thanks to my wonderful collaborators Weihao Xuan, Heli Qi, @Hanqun_CAO, Heng-Jui Chang, @KKuanPang @XiangruTang Zehong Wang, @hcwww_ , @KejunYing @lupantech Chiho Im, Seungju Han, @richardxp888 @tikgiau. Also appreciate the guidance from advisors @YejinChoinka @jure @erranlli Naoto Yokoya, Masashi Sugiyama.

English

289

133.3K

Jure Leskovec retweetledi

Kexin Huang@KexinHuang5·7 May

We’re working with world-class experts to encode the latest research techniques and best practices into reusable skills that any scientist can use. First up: rare-variant gene burden analysis, demonstrated on UK Biobank data for obesity. Read the case study here:

Phylo@phylo_bio

Introducing the Genetic Target Hypothesis skill, co-developed with Prof. Manuel Rivas at Stanford, which transforms rare-variant burden statistics from UK Biobank into ranked therapeutic target hypotheses in minutes. In a BMI/obesity case study, the system rediscovered canonical biology such as MC4R with the correct inhibit/activate direction, while surfacing novel candidate targets for follow-up. Learn more in the blog post: phylo.bio/blog/turning-u…

English

111

17K

Jure Leskovec@jure·30 Nis

@KexinHuang5 Super useful feature. Congrats on 🚀 progress!

English

Kexin Huang@KexinHuang5·30 Nis

Introducing agent-managed sandboxes: AI agents to autonomously orchestrate fleets of sandboxes to handle massive workloads. This unlocks adaptive scaling, from small tasks to terabyte-scale processing, while minimizing unnecessary cost. With parallel sandboxes, throughput multiplies, and agents can explore multiple ideas simultaneously. Checkout our new technical report of this sandbox pattern:

Phylo@phylo_bio

x.com/i/article/2049…

English

312

35.6K

Jure Leskovec@jure·30 Nis

What if building production-ready predictive models was as simple as asking a question in plain English? Today, we’re launching Kumo Coding Agent Skills, an open-source library that turns coding agents like Claude Code and OpenAI Codex into experts at building advanced predictive models with the Kumo SDK. kumo.ai/company/news/i…

English

2.6K

Jure Leskovec@jure·21 Nis

Thrilled that Biomni-AD won the $1M Alzheimer's Insights AI Prize at the AD/PD Conference in Copenhagen 🏆 Most AI tools answer a single question. Biomni-AD is a co-scientist agent. It explores hypotheses, integrates evidence across genetics, proteomics, neuroimaging & clinical data, and explains its reasoning so scientists can interrogate and build on it. Alzheimer's will affect 152M people by 2050. No single researcher can synthesize all that data at once. That's exactly where AI agents change the equation. Proud of the whole team. And it'll be freely available to researchers worldwide 🙏 stanforddaily.com/2026/04/15/bio…

English

160

20.8K

Jure Leskovec@jure·20 Nis

Join me tomorrow to see KumoRFM-2 live! 🚀 The first foundation model to outperform supervised ML on enterprise data, scaling to 500B+ rows. Register here: events.zoom.us/ev/AhSNbvHBKR5…

English

3.2K

Jure Leskovec@jure·17 Nis

KumoRFM-2 just became the first foundation model to outperform fully supervised machine learning on enterprise data. Scaling to 500B+ rows. We're doing a free live session to show you how it works. In this session, we'll: - Break down the innovations behind KumoRFM-2 - Demo real workflows end-to-end - Showcase use cases across sales, marketing, and fraud Speakers: - Jure Leskovec - Chief Scientist & Co-founder, Professor at Stanford - Disha Dubey - Data Science Lead - Vid Kocijan - ML Engineer Date: Tuesday, April 21, 2026 Time: 10:00 AM PDT Where: Online, free to attend Register here: events.zoom.us/ev/AhSNbvHBKR5…

English

2.2K

Keşfet

@YanayRosen @yusufroohani @StephenQuake @Nature @ScienceMagazine @serena2z @hcwww_ @YuanhaoQ