Pan Lu

🔥Introducing #AgentFlow, a new trainable agentic system where a team of agents learns to plan and use tools in the flow of a task. 🌐agentflow.stanford.edu 📄huggingface.co/papers/2510.05… AgentFlow unlocks full potential of LLMs w/ tool-use. (And yes, our 3/7B model beats GPT-4o)👇 🧩A team of four specialized agents coordinates via shared memory: Planner: plan reasoning & tool calls 🧭 Executor: invoke tools & actions 🛠 Verifier: check memory status ✅ Generator: produce final results ✍️ 💡The Magic: 🌀💫 AgentFlow directly optimizes its Planner agent live, inside the system, using our new method, Flow-GRPO (Flow-based Group Refined Policy Optimization). This is "in-the-flow" reinforcement learning. 📊The Results: AgentFlow (7B backbone) outperforms top baselines on 10 benchmarks, with average gains of: +14.9% on search 🔍 +14.0% on agentic 🤖 +14.5% on math ➗ +4.1% on science 🔬 🏆It even surpasses larger-scale models like Llama-3.1-405B and GPT-4o (~200B). Try it yourself! 🛠️Code: github.com/lupantech/Agen… 🚀Demo: huggingface.co/spaces/AgentFl… 🤖Model: huggingface.co/AgentFlow/mode… 📊Visual: #visualization" target="_blank" rel="nofollow noopener">agentflow.stanford.edu/#visualization 💬Join our Slack: join.slack.com/t/agentflow-co… #agentic #llms #RL #tooluse

English

6

20

129

16.8K

Pan Lu รีทวีตแล้ว

Xinran.Z@Xander_zzzzz·1d

Have been thinking a lot about agentic systems where the harness is thin but the skills and multi-agent patterns are learned rather than hand-engineered. 🤔🤔Work like AgentFlow makes it much more realistic to imagine agent teams that can actually scale with problem complexity, instead of relying on brittle orchestration graphs.

Excited to share that AgentFlow has been selected as an ICLR 2026 Oral 🎉 agentflow.stanford.edu Since launch, AgentFlow has also grown to 1.7K GitHub stars. Thank you so much for the support. AgentFlow is a trainable multi-agent system where specialized agents learn to plan and use tools in the flow of a task. We are excited to present it at ICLR. 🛠️ Code: github.com/lupantech/Agen… 🤖 Models: huggingface.co/AgentFlow/mode… 🚀 Demo: huggingface.co/spaces/AgentFl… 🎥 Video: youtube.com/watch?v=kIQbCQ… Huge shoutout to the amazing team behind this work: 🌟 @zhuofengli96475, @GhxIsaac, @SeungjuHan3, @ShengLiu_, @jianwen_xie, @yuz9yuz, @YejinChoinka, @james_y_zou And thank you to our supporters: 📷 @LambdaAPI, @RenPhilanthropy, @StanfordHAI, @StanfordAILab, @kaist_ai. See you at ICLR 2026! #ICLR2026 #AgentFlow #AgenticAI #LLM #RL #ToolUse

English

1

5

550

Pan Lu รีทวีตแล้ว

DeepSeek@deepseek_ai·4d

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English

1.6K

7.6K

44.6K

9.2M

Pan Lu รีทวีตแล้ว

Stanford AI Lab@StanfordAILab·5d

Are you at ICLR 2026 in Rio? Check out the full list of papers from Stanford AI Lab - covering LLM reasoning, agentic systems, AI safety, robotics, spatial intelligence, video generation, and more. See you there! 🇧🇷 ai.stanford.edu/blog/iclr-2026

English

12

80

14.9K

Pan Lu รีทวีตแล้ว

Jared Duker Lichtman@jdlichtman·17 Nis

Very excited to share: I'm co-organizing the Future of Mathematics Symposium at Stanford, held on May 1-2! We have a remarkable line-up of speakers, with Keynotes from Fields Medalists Terry Tao, Maryna Viazovska, and Michael Freedman. Free pre-registration link below

English

12

59

392

54.1K

Pan Lu@lupantech·7 Nis

Excited to share that OctoTools has been accepted to ACL 2026. 🐙 OctoTools is our training-free, extensible framework for tool-using agents on complex reasoning tasks. Grateful to the broader community for the support. Our GitHub repo has now reached 1.4K stars. 📣 Huge thanks to our amazing team: @chenbowen118, @ShengLiu_, @connect_thapa, and Joseph Boen. Special thanks to @james_y_zou. Code: github.com/octotools/octo… Project: octotools.github.io See you in San Diego!🏖️🌴 @aclmeeting #OctoTools #ACL2026

🐙 Introducing OctoTools: an agentic framework with extensible tools for complex reasoning! 🚀 🧵 🔗 Explore now: octotools.github.io OctoTools tackles challenges in complex reasoning—including visual understanding, domain knowledge retrieval, numerical reasoning, and multistep problem-solving. It introduces: 🔹 Standardized tool cards to encapsulate tool functionality 🔹 A planner for structured high-level & low-level planning 🔹 An executor to carry out tool usage Featured Highlights 💡 ✅ Standardized tool cards for seamless integration of new tools-no framework changes needed (🔎 examples: #tool-cards" target="_blank" rel="nofollow noopener">octotools.github.io/#tool-cards) ✅ Planner + Executor for structured high-level & low-level decision-making ✅ Diverse tools: visual perception, math, web search, specialized tools & more ✅ Long CoT reasoning with test-time optimization: planning, tool use, verification, re-evaluation & beyond (🔎 examples: #visualization" target="_blank" rel="nofollow noopener">octotools.github.io/#visualization) ✅ Training-free & LLM-friendly—easily extend with the latest models ✅ Task-specific toolset optimization: select an optimized subset of tools for better performance 📊 Performance: OctoTools achieves generalizable gains across 16 tasks, outperforming: 📈 GPT-4o (+9.3%) 📈 AutoGen (+10.6%) 📈 GPT-4o Functions (+7.5%) 📈 LangChain (+7.3%) 🤗 Try the live demo (supported by @huggingface @_akhaliq): huggingface.co/spaces/octotoo… 🐙 OctoTools in action on diverse real-world examples: ✅ How many r letters are in the word strawberry? ✅ What's up with the upcoming Apple Launch? Any rumors? (credit: @karpathy) ✅ Which is bigger, 9.11 or 9.9? ✅ Solve gane of 24 with [1,1,6,9] ✅ Research trends in tool agents with LLMs for scientific discovery from ArXiv, PubMed, and Nature ✅ How many baseballs are there? (visual perception, GPT-4o ❌) ✅ What is the organ on the left side of this image? (radiology, GPT-4o ❌) ✅ What are the cell types in this image? (pathology, GPT-4o ❌) ... and more! Dive deep into OctoTools: 📄 Read our 89-page paper: arxiv.org/abs/2502.11271 💻 Explore the codebase: github.com/octotools/octo… Huge thanks to our amazing team: @chenbowen118, @ShengLiu_, @connect_thapa, Joseph Boen! Special thanks to @james_y_zou, @StanfordHAI, @ChanZuckerberg for the support! 🙌 #Agent #LLMs #ToolUse #Reasoning #OctoTools

English

13

99

10.2K

Pan Lu รีทวีตแล้ว

James Zou@james_y_zou·6 Nis

Training multi-agent teams is hard. #AgentFlow comes to the rescue. We introduce Flow-GRPO, an efficient method to train multi-agent teams. Improves planning and tool use. Selected as an #ICLR2026 Oral (top 1%)🚀

🔥Introducing #AgentFlow, a new trainable agentic system where a team of agents learns to plan and use tools in the flow of a task. 🌐agentflow.stanford.edu 📄huggingface.co/papers/2510.05… AgentFlow unlocks full potential of LLMs w/ tool-use. (And yes, our 3/7B model beats GPT-4o)👇 🧩A team of four specialized agents coordinates via shared memory: Planner: plan reasoning & tool calls 🧭 Executor: invoke tools & actions 🛠 Verifier: check memory status ✅ Generator: produce final results ✍️ 💡The Magic: 🌀💫 AgentFlow directly optimizes its Planner agent live, inside the system, using our new method, Flow-GRPO (Flow-based Group Refined Policy Optimization). This is "in-the-flow" reinforcement learning. 📊The Results: AgentFlow (7B backbone) outperforms top baselines on 10 benchmarks, with average gains of: +14.9% on search 🔍 +14.0% on agentic 🤖 +14.5% on math ➗ +4.1% on science 🔬 🏆It even surpasses larger-scale models like Llama-3.1-405B and GPT-4o (~200B). Try it yourself! 🛠️Code: github.com/lupantech/Agen… 🚀Demo: huggingface.co/spaces/AgentFl… 🤖Model: huggingface.co/AgentFlow/mode… 📊Visual: #visualization" target="_blank" rel="nofollow noopener">agentflow.stanford.edu/#visualization 💬Join our Slack: join.slack.com/t/agentflow-co… #agentic #llms #RL #tooluse

English

43

201

27.6K

Pan Lu รีทวีตแล้ว

Kuan-Hao Huang@kuanhaoh_·5 Nis

The first-ever Texas NLP Symposium wrapped up yesterday! 🎉🎉🎉 Huge thanks to all the speakers and attendees for making it a huge success. I hope everyone had a great time. Stay tuned for info on next year! #TexasNLP Check photos and highlights here: photos.app.goo.gl/AhuqKfKDXHyUQw…

English

11

47

5.2K

Pan Lu@lupantech·2 Nis

@percyliang @Diyi_Yang Congratulations!!

English

633

Percy Liang@percyliang·1 Nis

Academic titles are funny. After 14 years, I finally have the official title that people might have always assumed I had.

English

93

22

1.3K

115.1K

Pan Lu รีทวีตแล้ว

Haotian Ye✈️ICLR26@haotian_yeee·30 Mar

Finally getting to share one of my favorite projects. ICLR Oral! 🏆 It’s so strange how rigid video tokenization is. Think about it: why should a still landscape cost the same amount of tokens as a busy street? We built InfoTok. We went back to basics with Shannon’s information theory to make tokens "adaptive" in a principled way. Its 2.3x better compression and 11x faster inference demonstrates the magic of the old-school theory ✨ Check it out: research.nvidia.com/labs/dir/infot…

English

10

43

294

48.2K

Pan Lu รีทวีตแล้ว

Association for Computing Machinery@TheOfficialACM·18 Mar

Congratulations to Charles H. Bennett (@IBMResearch) and Gilles Brassard ( @UMontreal) on receiving the 2025 ACM A.M. Turing Award! 🔗: awards.acm.org/turing

Association for Computing Machinery tweet media

English

14

237

782

118.6K

Pan Lu รีทวีตแล้ว

Sebastian Raschka@rasbt·15 Mar

I (finally) put together a new LLM Architecture Gallery that collects the architecture figures all in one place! sebastianraschka.com/llm-architectu…

English

202

1.5K

8.2K

723.9K

Pan Lu@lupantech·14 Mar

@hbXNov @kaiwei_chang @adityagrover_ @VioletNPeng @AnthropicAI Congrats, Hritik! 🎉

Indonesia

1

277

Hritik Bansal@hbXNov·14 Mar

Finally defended my Ph.D. thesis! 🥳 A very warm thank you to my family, friends, and advisors — @kaiwei_chang, @adityagrover_, @VioletNPeng, and Hongjing Lu. Next, I will be joining @AnthropicAI as a Member of Technical Staff. My defense slides ⬇️

English

41

4

290

24K

Pan Lu@lupantech·14 Mar

Totally agree, that loop is where many AI-for-bio systems break down. Our approach in Eubiota is to optimize for hypothesis quality before handoff: mechanistic grounding, structured evidence retrieval, and prioritization by experimental feasibility. So the key is not making wet-lab fast magically, but making each wet-lab iteration much more worth running.

English

11

Manol T.@manol_ai·13 Mar

@lupantech @stanfordnlp @StanfordAILab @ChanZuckerberg @czi @StanfordHAI The wet-lab validation loop is where most AI bio projects stall. Curious how Eubiota handles the turnaround time between hypothesis and experimental confirmation.

English

1

0

1

32

Pan Lu@lupantech·4 Mar

Bridging generative AI with rigorous wet-lab validation requires bold support. 🧬🤖 We are deeply grateful to our research homes and organizations for empowering this innovation: 🏫 @stanfordnlp @StanfordAILab 🙏 @ChanZuckerberg @czi 🏛️ @StanfordHAI (Seed Research Grants) 📐 @RenPhilanthropy (AI for Math Fund Grant) ⚕️ @NIH @NIHFunding Thank you for accelerating the future of AI for Science! 🌍✨

Introducing Eubiota: A multi-agent AI framework for autonomous discovery in the human microbiome. 🧬🤖🧫 👇 Explore the platform: eubiota.ai Eubiota doesn’t just chat: it plans, uses tools, verifies evidence, and drives end-to-end discovery—from hypothesis to wet-lab validation. Eubiota achieved 87.7% accuracy on mechanistic reasoning (vs. GPT-5.1 77.3%). But we went further. We used Eubiota to drive 4 discoveries with experimental validation: ✅ Gene Discovery: Identified the uvr-ruv stress axis by screening 1,945 genes & 10K papers in hours (on 2 GPUs) 🧬⚡ ✅ Therapeutics: Designed a microbial therapy that reduced colitis inflammation 🦠💊 ✅ Antibiotics: Engineered a cocktail that kills pathogens but spares commensals 🎯🛡️ ✅ Metabolites: Discovered novel anti-inflammatory molecules from large human data 🥗🧪 📄 Paper: biorxiv.org/content/10.648… 💻 Code: github.com/lupantech/Eubi… Try the live app to start your own discovery: 🎮 App: app.eubiota.ai Huge thanks to fantastic co-lead @YifanGao15, our incredible PIs @james_y_zou @LabSonnenburg, stellar advisors Kerwyn Casey Huang, @YejinChoinka, and the excellent team! 👏 #Eubiota #Microbiome #Inflammation #AI4Sci #AgenticAI #Agent

English

3

4

20

7.2K

Pan Lu รีทวีตแล้ว

TianqiaoChen@tianqiao_chen·11 Mar

Conversation is easy for AI. Solving real problems is not. Real problems — in science, finance, and engineering — require something very different: •long reasoning chains •structured exploration •verification at every step That’s the motivation behind MiroThinker. Today we’re releasing the next generation of our research agent models: MiroThinker-1.7 and MiroThinker-H1. Instead of scaling conversations, we focused on scaling effective reasoning — improving both reasoning depth and step-level accuracy. Some highlights: 🧠 Heavy-duty reasoning for long-horizon tasks 🔎 Verification-centric architecture with both local and global checks 🌐 Strong performance on BrowseComp, BrowseComp-ZH, GAIA, and Seal-0 📊 Leading results across scientific and financial evaluation benchmarks Our long-term goal is simple: build agents that can reason, verify, and solve real problems — not just generate answers. Proud of the team for pushing this forward. Explore MiroThinker: Hugging Face lnkd.in/gkEAh88G GitHub lnkd.in/eJyH4xEM The MiroMind app integration will roll out in the coming days.

English

10

11

75

56.2K

Pan Lu รีทวีตแล้ว

Open Life Science AI@OpenlifesciAI·8 Mar

🚨 Medical AI Research Alert! 🚨 Can AI autonomously design experiments and discover mechanisms in the complex gut microbiome? @Stanford presents 𝗘𝘂𝗯𝗶𝗼𝘁𝗮: 𝗔 𝗺𝗼𝗱𝘂𝗹𝗮𝗿 𝗮𝗴𝗲𝗻𝘁𝗶𝗰 𝗳𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 𝗳𝗼𝗿 𝗲𝗻𝗱-𝘁𝗼-𝗲𝗻𝗱 𝗯𝗶𝗼𝗹𝗼𝗴𝗶𝗰𝗮𝗹 𝗱𝗶𝘀𝗰𝗼𝘃𝗲𝗿𝘆. By @lupantech, @YifanGao15 , @harrison_zhang , @GutBugs2 , @elektra_robi , @bingxuan_l , @ghxisaac , @Kunlun_Zhu @james_y_zou and team from Stanford University Now you can watch and listen to the latest Medical AI papers daily on our YouTube and Spotify channels! YouTube: youtu.be/O8CIpTc50TE?si… Here's why it's exciting: 👇🧵 1/10 ##MedicalAI ##Healthcare ##Microbiome ##AIinScience [1/10]

YouTube

English

5

20

2.4K

Pan Lu@lupantech·3 Mar

While Eubiota simulates an AI scientist, our human collaboration was the real magic behind it! ✨Extremely grateful to @YifanGao15 for 12+ months of intense, efficient, and unfiltered teamwork. We pushed each other to make this framework as rigorous as possible. Couldn't have asked for a better co-pilot to bring #Eubiota to life! 🚀

Yifan Gao@YifanGao15

Can AI move beyond chatting to actually making biological discoveries? What if AI could think like a microbiologist? 🧪 📣 Excited to introduce Eubiota: a modular agentic AI framework for autonomous discovery in the gut microbiome. 🧬🤖🦠 👉Explore: eubiota.ai The gut microbiome is complex; manual discovery is slow. 🐌 We built Eubiota to think like a researcher: planning experiments, verifying data, and synthesizing results. It’s an open-source system designed to empower the entire community. 🌐 Eubiota demonstrated its capability across end-to-end discovery tasks, with results validated in the lab: ✅ Identified uvr-ruv DNA repair axis for fitness under inflammatory stress by screening ~2,000 genes🧬 ✅ Designed a microbial consortium therapy to reduce inflammation in mice colitis models.🐁 ✅ Uncovered diet-associated anti-inflammatory metabolites & engineered antibiotic cocktails.💊 Eubiota is the "scientific copilot" the field has been waiting for. Let AI be your partner to accelerate discovery! 🚀 Incredible teamwork with @lupantech and all our co-authors! I'm deeply grateful for the guidance from amazing advisors @james_y_zou, @LabSonnenburg, and brilliant mentors KC Huang, @YejinChoinka. Read more here: 🔗biorxiv.org/content/10.648… Code: github.com/lupantech/Eubi… Try out our system in your browser! 🤖app.eubiota.ai/chat #Eubiota #AI4Science #Gut #Microbiome #AgenticAI

English

13

2.1K

Pan Lu@lupantech·3 Mar

A major step forward for AI in biology: Eubiota is here. 🦠💊 Thrilled to see our AI co-scientist successfully validate new discoveries spanning the stress axis, antibiotics & metabolites. Big thanks to @james_y_zou for leading this visionary project! Project: eubiota.ai App: app.eubiota.ai 🚀

James Zou@james_y_zou

Thrilled to introduce #Eubiota: new AI co-scientist for microbiome research! Eubiota discovered 💊new microbial therapy reducing colitis inflammation 💊new anti-inflammation metabolites and more! All experimentally validated. Eubiota is trained w/ our multi-agent RL >> GPT5. Use it for free eubiota.ai Great job led by @lupantech @YifanGao15 and fantastic collaboration w/ @LabSonnenburg 🚀

English