Zach Xu

31 posts

Zach Xu

Zach Xu

@nehzux

PhD @UChicagoCS @dsi_uchicago Working on Language Model @togethercompute

San Francisco Katılım Temmuz 2015
1.3K Takip Edilen153 Takipçiler
Zach Xu retweetledi
Together AI
Together AI@togethercompute·
Introducing DeepSeek V4 Pro, a long-context model with hybrid attention, three reasoning modes, and SOTA coding performance. AI natives can now use DeepSeek V4 Pro on Together AI and benefit from reliable inference for long-horizon coding and agentic workflows.
Together AI tweet media
English
10
5
110
825.5K
Zach Xu retweetledi
James Zou
James Zou@james_y_zou·
Excited to share our new papers at #ICLR2026 on (multi-)agents, efficient reasoning, long context, better tokenizers and scientific applications🚀 My awesome students and collaborators will be presenting them at the main conference this week; check it out! 👇
James Zou tweet media
English
1
5
72
4.5K
Zach Xu
Zach Xu@nehzux·
Excited to share our #ICLR2026 work! One thing I learned from this project is that long-context reasoning does not fail for just one reason. Some failures come from the model getting “foggy” as context grows, some from breaking apart information that really needs to stay connected, and some from the final aggregation step. That distinction turned out to be surprisingly useful in practice: in the right regime, a carefully planned divide-and-conquer system can outperform a much stronger model reading everything in one shot. Scaling context length alone is not enough. We also need better ways to harness long context. Huge thanks to my great collaborators and mentors: @ShangZhu18 @JueWANG26088228 @JunlinWang3 @ben_athi @Chi_Wang_ @james_y_zou @ce_zhang
Together AI@togethercompute

New from Together Research: a smaller model using divide & conquer can match or beat GPT-4o single-shot on long context tasks. Paper accepted at ICLR 2026. Read more in the 🧵

English
0
4
13
2.8K
Zach Xu retweetledi
James Zou
James Zou@james_y_zou·
Super excited to release our platform for AI agents to solve open science problems! einsteinarena.com Send your agents to compete and collaborate w/ our Einstein agent, Feynman agent and more! Just ask your agent to read einsteinarena.com/skill.md and that's it
James Zou@james_y_zou

We created AI agents based on scientists' personas (eg Einstein, Feynman) and built a Kaggle-like platform for them to freely post ideas, compete and collaborate. In 30 mins, agents discovered the best new solution to the Erdos min overlap problem. Great job by @federicobianchy @ykwon_0407! The solution is here github.com/togethercomput…

English
4
33
178
50.2K
Zach Xu retweetledi
Tao Feng (Attending NeurIPS 2025 in San Diego)
We release OpenClaw Router — Production-Ready LLM Routing 🚀 Code: github.com/ulab-uiuc/LLMR…📦 PyPI: pypi.org/project/llmrou… 🔥 Meet OpenClaw Router Deploy LLMRouter as an OpenAI-compatible API with one command. Seamlessly integrate with Slack, Discord, WhatsApp via OpenClaw. Support multimodal understanding — route based on images, audio, video, not just text. pip install llmrouter-lib && llmrouter serve 💸 Why OpenClaw Router? Why pay GPT-5 prices for "What's the weather?" Smart routing = Significant Token Savings. Simple query → Cheap model. Complex query → Powerful model. Save 30-50% on inference costs without sacrificing quality. 🧠 Train Your Own Router OpenClaw Router isn't just a server — it's a learning system. Train personalized routers on your own data, tailored to your domain. Every user feedback, every usage pattern feeds back into router training — continuously iterate toward a more user-friendly, cost-efficient OpenClaw. ✅ Routing Memory: RAG-powered decisions that learn from history ✅ Personalized Routing: Adapts to individual user preferences ✅ Feedback Loop: User interactions improve routing over time ✅ 16+ Strategies: KNN, SVM, MLP, BERT, Graph, RL, Agentic — switch with one flag Route smarter. Train your own. Save more. 🚀
Tao Feng (Attending NeurIPS 2025 in San Diego) tweet media
English
7
10
36
22K
Zach Xu retweetledi
Together AI
Together AI@togethercompute·
Most AI models solve data science tasks without looking at the data—they're just recalling training patterns. DSGym is a unified framework for evaluating and training agents on real data analysis and ML pipeline tasks. 85-96% of scientific task failures = domain knowledge gaps But a 4B model trained with DSGym on 2K examples beats models 60x larger. Read more and find the paper: together.ai/blog/dsgym
Together AI tweet media
English
2
8
27
14.5K
Zach Xu retweetledi
Kaitlyn Zhou
Kaitlyn Zhou@KaitlynZhou·
Internship opportunity! Please share! 📣 I'm looking to hire an intern in human-centered NLP for the agents team @togethercompute. Come work on frontier AI systems that tackle complex agentic tasks! Research direction is open and looking to publish in NLP and HCI venues
English
21
64
584
43.8K
Zach Xu retweetledi
Gautam Kamath
Gautam Kamath@thegautamkamath·
Isabelle Guyon (Google DeepMind) is giving an invited talk at #AAAI2026 on Saturday, January 24 at 8:30 AM, titled "AI in science and technology: the future in our hands." CC @RealAAAI @hadi_hoss
Gautam Kamath tweet media
English
1
3
8
2.6K
Zach Xu retweetledi
Chi Wang
Chi Wang@Chi_Wang_·
Claude Cowork proves the interest in AI coworkers. Orion (meetorion.app) takes the autonomous working experience to another level: • Works with and enhances your experience with your favorite apps (messages, docs, emails, sheets, files, calendars…) and AI tools (ChatGPT, Claude Code, Gemini, Grok…) • Supervises workflows when you are away, from vibe coding to media creation • Coordinates AI agents across all your devices (macOS, windows, VM), each operating one device independently Your personal AI superpower. New surprising ways of vibe working every day - I just tried using Orion to add subtitles to the video and it worked! Watch it handle multiple tasks autonomously from a single cmd+k.
English
21
46
223
33.1K
Zach Xu retweetledi
Arian Khorasani 🦅
Arian Khorasani 🦅@Arian_Khorasani·
📢Our @NewInML workshop will be happening today at the Room Upper 31ABC, San Diego Convention Center, starting at 12 PM! If you're at @NeurIPSConf, it's a great opportunity for you to join us! We also have amazing speakers! We're looking forward to welcoming you! #NeurIPS2025
Arian Khorasani 🦅@Arian_Khorasani

🚨 New in ML Workshop at @NeurIPSConf We're so excited to invite you to the New In ML Workshop (@NewInML), taking place on Tuesday, December 2nd, 2025, at the San Diego Convention Center! Great opportunity, specifically for people who are new in machine learning! Details🧵

English
0
5
10
901
Haozhi Qi
Haozhi Qi@HaozhiQ·
I will join UChicago CS @UChicagoCS as an Assistant Professor in late 2026, and I’m recruiting PhD students in this cycle (2025 - 2026). My research focuses on AI & Robotics - including dexterous manipulation, humanoids, tactile sensing, learning from human videos, robot systems, and anything needed to make robots truly work and improve everyday life. I also place strong emphasis on open-source. Check my homepage to learn more: haozhi.io. Please reachout if you are interested! The deadline is Dec 11th. Link: tinyurl.com/uchiapp.
Haozhi Qi tweet media
English
26
101
646
104.7K
Zach Xu retweetledi
Arian Khorasani 🦅
Arian Khorasani 🦅@Arian_Khorasani·
🚨 New in ML Workshop at @NeurIPSConf We're so excited to invite you to the New In ML Workshop (@NewInML), taking place on Tuesday, December 2nd, 2025, at the San Diego Convention Center! Great opportunity, specifically for people who are new in machine learning! Details🧵
English
5
20
83
19K
Zach Xu retweetledi
NeurIPS Conference
NeurIPS Conference@NeurIPSConf·
NeurIPS 2025 is excited to announce this year’s Affinity Events — workshops and socials that connect communities, spark discussion, and highlight new ideas across AI and ML. Events will take place throughout the week in San Diego, with a joint gathering in Mexico City. For more details check out our latest blog post! 👉 blog.neurips.cc/2025/10/08/ann…
English
1
4
33
17.2K
Zach Xu retweetledi
Together AI
Together AI@togethercompute·
🛡️ VirtueGuard is LIVE on Together AI 🚀 AI security and safety model that screens input and output for harmful content: ⚡ Under 10ms 𝗿𝗲𝘀𝗽𝗼𝗻𝘀𝗲 🎯 𝟴𝟵% 𝗮𝗰𝗰𝘂𝗿𝗮𝗰𝘆 vs 76% (AWS Bedrock) 🧠 𝗖𝗼𝗻𝘁𝗲𝘅𝘁-𝗮𝘄𝗮𝗿𝗲 - adapts to your policies, not just keywords 👇
Together AI tweet media
English
4
5
25
7.8K
Zach Xu retweetledi
Chi Wang
Chi Wang@Chi_Wang_·
🚀 Meet MassGen! 🛠️ An open-source project for multi-agent scaling. Inspired by @grok Heavy & Gemini DeepThink. Enable parallel intelligence sharing, iterative refinement & consensus across agents. @GoogleAI @OpenAI @xai MVP out now—star & feedback! 👇 github.com/Leezekun/MassG…
English
10
42
154
17.1K
Zach Xu retweetledi
James Zou
James Zou@james_y_zou·
📢New conference where AI is the primary author and reviewer! agents4science.stanford.edu Current venues don't allow AI-written papers, so it's hard to assess the +/- of such works🤔 #Agents4Science solicits papers where AI is the main author w/ human advisors. 💡Initial reviews by LLM reviewers w/ final assessment + selection by human experts. 💡Submissions are asked to clearly document AI contribution. 💡All submissions/reviews will be public to enable transparent study of the strength and limitations of AI as researcher and reviewer. We expect AI will make mistakes and it will be instructive to study these in the open! Many thanks to the fantastic co-organizers and expert advisory board! Please see the website for more information.
James Zou tweet media
English
20
129
502
113.5K
Zach Xu
Zach Xu@nehzux·
Bottom line: "Divide and Conquer" isn't a silver bullet, but with a principled strategy, it's a powerful pathway to handling massive contexts. Our framework tells you when and why. Dive into the details in our new paper! Link: arxiv.org/abs/2506.16411
English
1
0
1
178
Zach Xu
Zach Xu@nehzux·
🤝 Task Noise: cross-chunk dependencies that can't be handled by processing each segment in isolation. 🤯 Model Noise: the model's performance degradation as the input length increases. 🧩 Aggregator Noise: incorrect combination of partial results from each chunk
English
1
0
0
227