Xudong Guo

15 posts

Xudong Guo

@_traceur__

Researcher @Alibaba_Qwen | Ph.D. @Tsinghua_Uni | Prev. @Microsoft @UN @Princeton

Beigetreten Kasım 2023

145 Folgt37 Follower

Xudong Guo@_traceur__·30 Oca

Try evaluate your model here!

English

Xudong Guo@_traceur__·30 Oca

Planning + Reflection + Memory -> Long-Horizon Decision-Making

Qwen@Alibaba_Qwen

🚀 Introducing DeepPlanning — a new benchmark for long-horizon agent planning in real-world scenarios. Unlike step-by-step reasoning tasks, we focus on verifiable global constraints: time budgets, cost limits, and combinatorial optimization that must hold across the entire plan. ✈️ Multi-day travel w/ minute-level scheduling + hard time/budget caps 🛒 Complex shopping w/ coupon stacking & item bundling 🧠 Requires active info gathering, local constraint satisfaction & global optimality Even GPT-5.2, Claude 4.5, Gemini & Qwen3 struggle significantly. Perfect for evaluating Agent Planning / Tool Use / Long-Horizon Reasoning. Paper: arxiv.org/pdf/2601.18137 Leaderboard: qwenlm.github.io/Qwen-Agent/en/… Hugging Face Dataset: huggingface.co/datasets/Qwen/… ModelScope Dataset: modelscope.cn/datasets/Qwen/…

English

Xudong Guo@_traceur__·30 Oca

@Shawnhhhhh There is no separate memory module, but we introduce the memory capacity to both the new model Max and the Qwen-Chat platform.

English

v587@Shawnhhhhh·27 Oca

@_traceur__ 记忆模块是集成在模型内部还是平台级别？

中文

Xudong Guo@_traceur__·27 Oca

Try agentic memory here: chat.qwen.ai

Qwen@Alibaba_Qwen

🚀 Introducing Qwen3-Max-Thinking, our most capable reasoning model yet. Trained with massive scale and advanced RL, it delivers strong performance across reasoning, knowledge, tool use, and agent capabilities. ✨ Key innovations: ✅ Adaptive tool-use: intelligently leverages Search, Memory & Code Interpreter without manual selection ✅ Test-time scaling: multi-round self-reflection beats Gemini 3 Pro on reasoning ✅ From complex math (98.0 on HMMT Feb) to agentic search (49.8 on HLE)—it just thinks better. 🧠 Think deeper. Solve harder. Try the adaptive reasoning experience now: chat.qwen.ai Completions API: modelstudio.console.alibabacloud.com/ap-southeast-1… Responses API: alibabacloud.com/help/en/model-… blog: qwen.ai/blog?id=qwen3-…

English

124

Xudong Guo retweetet

Qwen@Alibaba_Qwen·25 Ara

Merry Qwristmas! 🎄🎁 Huge thanks for all the love and support this year. Get ready for Qwen’s New Year surprises! 🎆✨ See you next year — with even more Qwen magic. ✨

English

609

32.1K

Xudong Guo retweetet

Junyang Lin@JustinLin610·23 Ara

thanks for the great eval that helps our research!

Lightwheel@LightwheelAI

RoboFinals is Lightwheel's industrial grade evaluation platform for measuring Embodied AI model capabilities beyond academic benchmarks. It enables faster iteration, bottleneck diagnosis, and reliable measurement of real capability gains as models move toward real-world deployment. We’re excited to have @Alibaba_Qwen using RoboFinals for high-throughput, industry-aligned evaluation of its frontier embodied AI models. RoboFinals enables Qwen to rapidly iterate, diagnose bottlenecks, and measure real capability gains beyond academic benchmarks. Besides, Qwen plays a partner in stress-testing RoboFinals and shaping its evolution into an industry standard benchmark for evaluating robotics foundation models. #Lightwheel #Qwen #RoboFinals #EmbodiedAI #Robotics #Simulation #Evaluation

English

7.8K

Xudong Guo retweetet

Qwen@Alibaba_Qwen·27 Kas

🏆 We are incredibly honored to announce that our paper, "Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free" has received the NeurIPS 2025 Best Paper Award! A huge congratulations to our dedicated research team for pushing the boundaries of AI. Read more: blog.neurips.cc/2025/11/26/ann…

English

384

2.9K

482.9K

Xudong Guo@_traceur__·17 Kas

Not just a chat app, but an agent with more exciting features on the way...

Qwen@Alibaba_Qwen

10,000,000 users creating with Qwen Chat — and we’re just getting started. From here, let’s begin — chat.qwen.ai 🚀

English

Xudong Guo@_traceur__·19 Eki

@Pokee_AI Thx for the reminder! It works now.

English

Pokee AI@Pokee_AI·19 Eki

@_traceur__ Hi Xudong! We tried to message you with the early access information, but unfortunately your DMs are closed to the public. Do you mind opening those up so that we can get you the early access details? Thanks so much!

English

Pokee AI@Pokee_AI·17 Eki

🚨 We’re reaching out to a few of you in DMs with early access to something huge. This is one of the first major open-source drops of its kind in the U.S., and it’s almost here. If you’ve already heard from us, you’re in. If not, no worries! we still have a few spots left before launch next week. 👇 Drop “POKEE” in the comments to lock in early access before we go live!

English

1.1K

950

166.2K

Xudong Guo retweetet

Qwen@Alibaba_Qwen·15 Eki

🧠 Meet Your AI Memory Unlock richer, more personal experiences—Qwen Chat Memory uses your context and history to tailor every interaction, so everything feels made just for you. 🔖 Stores meaningful and important memories about you 🔍 Recalls past interaction relevant to the current context ✨ Transforms your history into deeply personalized experiences Your past, remembered. Your future, tailored. 🎯 Try it now：chat.qwen.ai

English

804

82.8K

Xudong Guo retweetet

Junyang Lin@JustinLin610·8 Eki

in case u don't know, i set up a small team for robotics and embodied ai inside qwen. multimodal foundation models are now being transformed to foundation agents that can leverage tools and memory to perform long-horizon reasoning thanks to reinforcement learning. they should definitely step from virtual world to physical world!

English

952

104.2K

Xudong Guo retweetet

GosuCoder@GosuCoder·24 Eyl

Qwen 3 Max is no joke Seriously I used it all day today in RooCode and opencode and it is really really good. It does well at: 1. refactoring tasks 2. finding and fixing bugs 3. 0 - 1 new things 4. Decent at design, much better than the preview version 5. Tool calling, one of the highest scores i've gotten yet. Excited about putting this video together

English

723

64.9K

Xudong Guo retweetet

Kaixuan Huang@KaixuanHuang1·10 Tem

Introducing SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths 📢 (ES-FoMo @ ICML2024) An improved version of Speculative Decoding that further boosts your speedup! 📈 arxiv.org/abs/2405.19715 🧵 [1/n]

English

103

26.5K

Xudong Guo retweetet

Mengdi Wang@MengdiWang10·14 Mar

How to capitalize #GenerativeAI and #diffusion models for modeling complex data and structured optimization? From images to proteins? Check my talk "Diffusion models for Generative Optimization" at @broadinstitute , Harvard, MIT last week. Youtube: youtube.com/watch?v=hDRDx5…

YouTube

English

246

26K

Entdecken

@Shawnhhhhh @Pokee_AI @broadinstitute @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates