Agent TARS

27 posts

Agent TARS

@agent_tars

https://t.co/1YBpC3VG6T | An open-source multimodal AI agent Offering seamless integration with a wide range of real-world tools.

Singapore เข้าร่วม Nisan 2025

17 กำลังติดตาม427 ผู้ติดตาม

Agent TARS@agent_tars·9 Eyl

🎉 Just dropped v0.3.0-beta.11! 💡 Real-time Thinking Duration Protocol 🗂️ MongoDB Provider integration 🔧 Agent UI Builder & CLI support ✨ Many Web UI optimizations Release Note: github.com/bytedance/UI-T…

English

2.3K

Agent TARS@agent_tars·9 Eyl

🎉 UI-TARS-desktop just hit #1 on GitHub Trending for TypeScript for the first time! 🚀 Huge thanks to our amazing community for the support. Let’s keep building together! 💪

English

Agent TARS@agent_tars·8 Eyl

We are excited to announce that recent releases introduces support for UI-TARS-2! 🥳 - Paper arxiv.org/abs/2509.02544 - Demo seed-tars.com/showcase/ui-ta… github.com/bytedance/UI-T…

English

8.4K

Agent TARS รีทวีตแล้ว

ULIVZ@_ulivz·29 Ağu

Did you ever feel that the current "Chat: Workspace 1:1" layout is sometimes a bit stiff? we tried to introduce the "Narrow Chat" mode in @agent_tars .

English

623

Agent TARS รีทวีตแล้ว

Yujia Qin@TsingYoga·22 Ağu

TARS just made an official website for itself, beyond merely GUI operation, MCP tool-use and Terminal-Use is finally integrated into TARS!

English

4.1K

Agent TARS รีทวีตแล้ว

ULIVZ@_ulivz·13 Ağu

Yes, we’ve just rolled out the Beta version about a month ago, Check it out: agent-tars.com/beta And guess what? A major release is coming soon. Stay tuned! ⏳

Supreme Leader Wiggum@ScriptedAlchemy

We have released agent tars. Machine Vision!

English

1.2K

Agent TARS รีทวีตแล้ว

ULIVZ@_ulivz·29 Tem

🎉 Just launched the Agent TARS Showcase! 🚀 Explore examples like AI Browser, CodeAct, Coding, MCP, Research & more. 🌟 Check them out and Replay your favorite! 👉 agent-tars.com/showcase

English

1.5K

Agent TARS รีทวีตแล้ว

ULIVZ@_ulivz·25 Tem

中文社区的朋友可以关注我们最新的 Agent TARS 介绍文档：bytedance.larkoffice.com/docx/KEbydDtCS…

中文

485

Agent TARS@agent_tars·12 Tem

✨ The details of Agent TARS's operation are being revealed bit by bit. Stay tuned for more exciting content! 👀

ULIVZ@_ulivz

An @agent_tars that supports displaying tool call duration is on the way.

English

471

Agent TARS รีทวีตแล้ว

晓@codingzx·10 Tem

如果不想花 $200 刀，体验 Perplexity Comet 浏览器的能力，可以考虑一下用 @midscene_ai 和 @agent_tars ，除了他们自己的搜索数据源上和产品交互类型上更丰富一些其实不会比 Midscene 和 Agent_tars 做的好太多，因为现在 SOTA 的 GUI agent 模型还是 doubao1.5 VL!

Perplexity@perplexity_ai

Comet is here. A web browser built for today’s internet.

中文

1.9K

Agent TARS@agent_tars·3 Tem

"GUI of CLI" is such a fun phrase! 🌟 Agent TARS CLI reflects our architectural evolution: moving from Electron's deep coupling of Agent & UI to a more flexible CLI + Web UI. Lightweight, fast iteration & almost all Electron's perks—just less product-y. 🚀 #AgentTARS

Orange AI@oran_ge

@_ulivz GUI 的 CLI 👍

English

382

Agent TARS@agent_tars·2 Tem

This looks pretty cool✨ benefiting from the new architecture of Agent TARS, the CLI can run headless 🖥️🤖, but the data is already stored locally 💾 and you can always view it through the Web UI 🌐👀 🚀 Quick Start: agent-tars.com/guide/get-star… 💻 Github: github.com/bytedance/UI-T…

ULIVZ@_ulivz

Agent TARS CLI 也有类似的能力，但不同的是，你可以 CLI Headless 运行，但随时随地，你可以唤起一个 Web UI 来查看这次 Agent 的运行过程，欢迎大家尝试 Quick Start: agent-tars.com/guide/get-star… Github: github.com/bytedance/UI-T…

English

1.6K

Agent TARS@agent_tars·29 Haz

Thanks to all the early core committers of Agent TARS Desktop: @sanyuan0704 @ycjcl @_ulivz @skychx @mylich, and all the early contributors in the community represented by @le0zh @lynxlangya ❤️

English

462

Agent TARS@agent_tars·29 Haz

Since we have released a brand new new Agent TARS CLI based on Seed1.5-VL, see agent-tars.com/beta , we have to say goodbye to the old Agent TARS Desktop github.com/bytedance/UI-T…

English

4.1K

Agent TARS@agent_tars·27 Haz

Thanks to the community for sharing. A good Context Engineering, like an environment, can inspire a smart LLM student to continuously create breakthroughs.

meng shao@shao__meng

Agent TARS 团队的设计原则关键认知 - 团队在构建这款多模态 AI Agent 时遵循三大设计原则：Context Engineering、Observable and Evaluable、Easy to Build Applications 三个设计原则概括来说： 1. 如何管理复杂的上下文：通过动态窗口、多级记忆和压缩策略，解决上下文溢出问题，确保长期运行 2. 如何让 Agent 可控：通过 Snapshot 和自动化评测，让 Agent 的运行透明且可验证 3. 如何降低开发门槛：通过 Agent Event Stream，让开发者能轻松构建应用，同时保持架构的灵活性 1. Context Engineering（上下文工程）核心理念：要让一个 AI Agent 能长时间稳定运行，上下文管理是关键。Agent TARS 的目标是打造一个能处理复杂、多轮任务的 Agent，尤其是在多模态场景（比如处理文本、图像、工具调用等）下，上下文可能会快速堆积，导致“内存溢出”。团队通过以下几个方面优化上下文管理： (1) 管理上下文窗口 LLM 的上下文窗口是有限的，比如常见的 128k token（大约是 128,000 个单词或字符片段）。在 Agent TARS 的运行中，每一轮任务都会生成新的上下文，包括用户的输入、Agent 的计划、工具调用结果等。如果不加优化，假设每次工具调用产生 5000 token 的数据，26 轮后上下文就会溢出。更别提多模态任务，比如处理一张高分辨率图片可能就占用 5000 token。为了解决这个问题，Agent TARS 采用了动态滑动窗口策略，根据不同模态（文本、图像等）分别管理上下文，优化数据的使用效率。比如，图片数据和文本数据可以有不同的保留策略，确保关键信息不丢失，同时避免溢出。 (2) MCP 的挑战 MCP 是 Agent TARS 用来连接 Agent 和外部工具的机制，类似于一个工具分发的“中间件”。它让 Agent 开发者可以专注于 Agent 逻辑，而工具开发者提供标准化的工具接口。但在实践中，MCP 带来了一些问题： · 参数不一致：有些工具的参数格式不标准，可能导致模型无法正确调用 · 结果过大：某些工具（比如从网页抓取 HTML 的工具）返回的数据量太大，一次调用就可能让上下文溢出。例如，调用 `browser_get_html` 可能返回几十万 token 的网页数据，直接“炸”掉上下文窗口为此，团队废弃了一些不高效的工具（比如 `browser_get_html`），转而使用更优的实现，比如基于 Readability 的 `browser_get_markdown`，通过分页等技术减少数据量。他们还得出一个结论：MCP 的真正价值在于**标准化的工具分发协议**，而不是让工具开发者随意注入复杂的上下文。未来，MCP 生态需要一个标准化的评估体系（Benchmark），来评判工具的质量，比如模型兼容性、上下文压缩效率等。 (3) 上下文压缩为了进一步优化，Agent TARS 设计了多级记忆机制，将上下文分为四层： · L0（永久记忆）：跨任务保留，比如用户初始输入和 Agent 的最终回答 · L1（会话记忆）：仅在当前任务中有效，比如任务计划 · L2（循环记忆）：仅在当前任务循环中有效，比如工具调用结果、屏幕截图 · L3（临时记忆）：临时的流式数据，比如实时生成的消息片段通过这种分层，Agent TARS 可以针对不同层级的上下文采取不同压缩策略，比如： · 选择性压缩：用智能算法减少不必要的消息内容 · 摘要生成：用大模型或小模型生成对话摘要，减少上下文体积此外，团队还在计划支持 Responses API，利用图像缓存等技术进一步提升多模态任务的性能。 2. Observable and Evaluable（可观测与可评估）核心理念：一个好的 Agent 框架需要让开发者清楚地看到 Agent 的运行过程（可观测），并能通过自动化测试评估其表现（可评估）。Agent TARS 在这方面做了两件事：Snapshot 机制和自动化评测。 (1) Snapshot 机制 Agent TARS 运行在一个复杂且多变的环境中，比如： · LLM 的输出随机性（同样的输入可能有不同回答） · 工具调用的不确定性（比如网络延迟或工具失败） · 环境初始化（比如启动浏览器或虚拟容器）的不稳定性这些因素让 Agent 的运行状态难以追踪。为了解决这个问题，Agent TARS 引入了 Snapshot 机制，可以在运行时保存 Agent 的环境状态（包括上下文、工具调用、模型输出等），然后通过“回放”来重现 Agent 的行为。这有点像程序员调试代码时保存的“快照”，可以用来检查问题、确保稳定性。 Snapshot 机制已经用在了 Agent TARS 的持续集成（CI）中，帮助团队发现了 10 多个潜在 bug，显著提升了开发效率。 (2) 自动化评测在 Alpha 版本中，Agent TARS 是一个 Electron 应用，只能靠人工测试，效率低下。Beta 版本引入了 Headless 模式，让自动化评测成为可能。团队参考了 OpenAI 的评测框架，开发了一个叫 browsecomp 的评测工具，可以通过 Python 和 TypeScript 跨进程调用，自动测试 Agent 的表现。此外，他们还在构建针对 MCP 工具的评测集，专门测试工具在不同模型上的兼容性和性能。未来，这些评测工具会整合成一个完整的 Benchmark 方案，供社区使用。 3. Easy to Build Applications（易于构建应用）核心理念：Agent TARS 要让开发者轻松构建自己的应用。为此，团队设计了 Agent Event Stream，一个基于事件流的机制，让 Agent 的运行过程可以被清晰地捕获和展示，方便开发者基于此构建 UI 或其他应用。 Agent Event Stream 是一个实时输出 Agent 运行状态的机制。比如，当你用 Agent TARS CLI 运行一个任务（比如查询“我的电脑型号”），你会看到一连串的事件流，包括： · Agent 的状态更新（比如“正在思考”） · 工具调用的细节（比如调用了 `getCurrentLocation` 工具） · 最终的回答（比如“你的电脑型号是 XXX”）这些事件流可以被用来构建灵活的 UI。开发者只需要基于这些事件流，编写一个渲染器（Renderer），就能创建自定义的界面。比如，你可以用 Web 技术实现一个炫酷的 Agent TARS Web UI，也可以替换成自己的 UI 风格。更厉害的是，Agent Event Stream 不仅用于 UI 渲染，还被用在 Context Engineering 中，帮助管理上下文（这点和前文提到的上下文工程无缝衔接）。这意味着，Agent TARS 的核心逻辑是围绕事件流构建的，开发者可以通过 SDK 轻松接入，感受这种设计的优雅。

English

830

Agent TARS รีทวีตแล้ว

ULIVZ@_ulivz·27 Haz

@karpathy +1 "contextual engineering" is the "art" It's hard to imagine that on the same day, the cognition we published on the Agent TARS release blog #cognition" target="_blank" rel="nofollow noopener">agent-tars.com/beta#cognition collided with the my LLM enlightenment teacher. Thank you for this era that allows us to connect with each other.

English

458

Agent TARS@agent_tars·26 Haz

Congrats, I have a new version

ULIVZ@_ulivz

Introducing Agent TARS Beta — a brand new and more powerful Agent TARS! - Agent TARS CLI - Browser Agent driven by Seed-1.5-VL - Native Streaming - Multimodal-friendly Web UI - Layered Agent architecture Blog: agent-tars.com/beta Quick start: agent-tars.com/quick-start

English

299

Agent TARS รีทวีตแล้ว

ULIVZ@_ulivz·25 Haz

We have a major release coming up

English

1.5K

Agent TARS รีทวีตแล้ว

WebAgentlab@webagentlab·21 Nis

🌟 Here are the top GUIAgent news of the Week - UI-TARS-1.5 - Cua MCP Server - FIRE-1 - Podcast w/ David Silver - Google PhD Fellowship Program - BrowseComp - 12x GUI Agent papers Here’s everything you need to know:

English

608

Agent TARS รีทวีตแล้ว

ULIVZ@_ulivz·21 Nis

The UI-TARS-Desktop v0.1.0 is designed for UI-TARS 1.5 ( seed-tars.com/1.5 ) and the new #BrowserUse mode is shipped 🌐 Checkout the release 👉 github.com/bytedance/UI-T…

Yujia Qin@TsingYoga

UI‑TARS Desktop v0.1.0 is now live! Kudos to all the web/app infra team! • Support for UI‑TARS‑1.5 model & browser automation • Integration with Volcengine Ark API service 🔽 Download: github.com/bytedance/UI-T… 🔽 Quick Start: github.com/bytedance/UI-T… 🔽 Model info: seed-tars.com/1.5/ 🔽 API service: console.volcengine.com/ark/region:ark…

English

1.6K

ค้นพบ

@midscene_ai @sanyuan0704 @ycjcl @_ulivz @skychx @mylich @le0zh @lynxlangya