
ULIVZ
266 posts

ULIVZ
@_ulivz
Agent. ByteDance Seed. Previously Web Infra, Alipay.




Another DeepSeek moment. This is the world’s first actual smart phone. It’s an engineering prototype of ZTE’s Nubia M153 running ByteDance’s Doubao AI agent fused into Android at the OS level. It has complete control over the phone. It can see the UI, choose/download apps, tap/type, call, and run multi-step task chains. Here I just say (in English) “find someone to wait in line for me” (something you can do in China), and it picks which app to open, configures the job, and hands me one confirm screen. I wouldn’t otherwise know how to do this, and here the phone just did it in a matter of seconds.




We can finally share UI-TARS-2🥳🥳 — a native GUI agent trained with multi-turn agent RL ⚡️⚡️Key highlights (all-in-one model!): 💻Computer Use: 47.5 OSWorld · 50.6 WindowsAgentArena 📱Phone Use: 73.3 AndroidWorld 🛜Browser Use: 88.2% Online-Mind2Web 🎮Gameplay: ~60% human on 15 titles · strong on LMGame-Bench 🧑💻TerminalUse: 68.7 SWE-Bench · 45.3 TerminalBench 🔨Tool Use: 29.6 BrowseComp Hybrid flows: GUI clicks + terminal cmds + API calls in one trace Paper arxiv.org/abs/2509.02544 Demo seed-tars.com/showcase/ui-ta…



We have a new member in the Rstack family 🦀 Introducing Rslint - a TypeScript-first linter written in Go (powered by typescript-go, not Rust 🙃) Currently in experimental stage - check out the repo's README for more details: github.com/web-infra-dev/…


We have released agent tars. Machine Vision!








Comet is here. A web browser built for today’s internet.











