Yuhao Yang

87 posts

Yuhao Yang banner
Yuhao Yang

Yuhao Yang

@itsyuhao

CS PhD Student @HKUniversity @hkudatascience @TencentGlobal @rhymes_ai_

Hong Kong Katılım Ekim 2022
534 Takip Edilen500 Takipçiler
Yuhao Yang
Yuhao Yang@itsyuhao·
@isdaonono @AI_jacksaku 其实也不只是桌面GUI应用,community里有人已经把自己的custom agent用cli-anything转成CLI给OpenClaw用了,还有转Zoom和AnyGen的Web API的,玩法可以很多
中文
2
0
0
119
daonono
daonono@isdaonono·
@AI_jacksaku 你用过吗?这玩意儿只支持桌面GUI应用。你就搁这【只要是开源软件,只要有代码库,统统都能给agent开个后门了】
中文
1
0
23
3.6K
Yuhao Yang retweetledi
阿川 | AI thinking
阿川 | AI thinking@AI_jacksaku·
太diao了这个……只要是开源软件,只要有代码库,统统都能给agent开个后门了
阿川 | AI thinking tweet media
Chao Huang@huang_chao4969

Introducing CLI-Anything🚀 Making ALL software agent-native with one command. Today's software serves humans👨‍💻. Tomorrow's users will be agents🤖. CLI-Anything: bridging the gap between AI agents and the world's software. One command line to make any software agent-ready for OpenClaw, nanobot, Cursor, Claude Code, etc. GitHub: github.com/HKUDS/CLI-Anyt… 🤔 Why CLI-Anything? CLI is the universal interface for both humans and AI agents: - Structured & Composable - Text commands match LLM format and chain for complex workflows - Lightweight & Universal - Minimal overhead, works across all systems without dependencies - Self-Describing - --help flags provide automatic documentation agents can discover - Proven Success - Claude Code runs thousands of real workflows through CLI daily - Agent-First Design - Structured JSON output eliminates parsing complexity - Deterministic & Reliable - Consistent results enable predictable agent behavior 💡 CLI-Anything's Vision: Building Agent-Native Software - 🌐 Universal Access - Every software becomes instantly agent-controllable through structured CLI. - 🔗 Seamless Integration - Agents control any application without APIs, GUI, rebuilding or complex wrappers. - 🚀 Future-Ready Ecosystem - Transform human-designed software into agent-native tools with one command. #CLIAnything #openclaw #nanobot #claudecode

中文
11
185
990
169.4K
Yuhao Yang retweetledi
AnyGen
AnyGen@AnyGenIO·
AnyGen is now part of CLI-Anything ✨ Another amazing project from HKUDS. Auto-generates CLIs for any software, the universal interface for both humans and AI agents. 600+ stars in days. → Agents can now call AnyGen to generate slides, docs, websites, diagrams → One command: create, download, done → Real .pptx, .docx, live sites back Glad to be a part of it! github.com/HKUDS/CLI-Anyt…
AnyGen tweet media
Chao Huang@huang_chao4969

Introducing CLI-Anything🚀 Making ALL software agent-native with one command. Today's software serves humans👨‍💻. Tomorrow's users will be agents🤖. CLI-Anything: bridging the gap between AI agents and the world's software. One command line to make any software agent-ready for OpenClaw, nanobot, Cursor, Claude Code, etc. GitHub: github.com/HKUDS/CLI-Anyt… 🤔 Why CLI-Anything? CLI is the universal interface for both humans and AI agents: - Structured & Composable - Text commands match LLM format and chain for complex workflows - Lightweight & Universal - Minimal overhead, works across all systems without dependencies - Self-Describing - --help flags provide automatic documentation agents can discover - Proven Success - Claude Code runs thousands of real workflows through CLI daily - Agent-First Design - Structured JSON output eliminates parsing complexity - Deterministic & Reliable - Consistent results enable predictable agent behavior 💡 CLI-Anything's Vision: Building Agent-Native Software - 🌐 Universal Access - Every software becomes instantly agent-controllable through structured CLI. - 🔗 Seamless Integration - Agents control any application without APIs, GUI, rebuilding or complex wrappers. - 🚀 Future-Ready Ecosystem - Transform human-designed software into agent-native tools with one command. #CLIAnything #openclaw #nanobot #claudecode

English
3
2
9
2.1K
Yuhao Yang
Yuhao Yang@itsyuhao·
@ysu_ChatData Thanks! For the q, I believe a lot of professional softwares can run in modern sandboxes, thus there can definitely be isolation mechanisms. Yep, safety is so important..
English
0
0
0
42
Yongrui Su
Yongrui Su@ysu_ChatData·
@itsyuhao This is a fun direction. The killer feature is when the agent can drive the terminal plus real apps, not just call tools behind the scenes.\n\nOne question: how do you sandbox it so a prompt injection cannot run destructive commands, especially when it has real shell access.
English
1
0
1
55
Yuhao Yang
Yuhao Yang@itsyuhao·
很高兴介绍 CLI-Anything:github.com/HKUDS/CLI-Anyt… 如果你也曾经希望 Agent 能像用终端一样用真正的软件 —— 这就是我们尝试在推动的事! 不知道大家有没有注意到一个现象:现在越来越多人把最难的任务丢给 OpenClaw,而 Claw 们干活的时候,选的都是 Claude Code —— 不是 Cursor,不是 VS Code,更不是什么 GUI Agent。 我们琢磨了一下,觉得核心原因就一个:CC 的 CLI 做得太好了。有状态、自描述、好用还强大。Agent 不需要跟它「搏斗」—— 拿起来就能干活。 然后我们就想到一个问题:绝大多数真正的专业软件,都没有这样的 CLI。GIMP、Blender、LibreOffice、OBS —— 每一个都是重量级工具,但 Agent基本碰不了。你能做的上限就是截图点点点的 GUI Agent,效果嘛…… 懂的都懂🥲。 所以我们做了 CLI-Anything。把代码库往里一丢: /cli-anything ./gimp 你的 Agent 就拿到了一整套 GIMP 的 CLI → /cli-anything ./blender —— Blender 同理 → LibreOffice、OBS Studio、Shotcut、Audacity、Inkscape、Kdenlive…… 都能搞 全自动生成完整的 CLI —— REPL 交互模式、JSON 结构化输出、撤销重做、直接调真实软件后端。不是玩具 wrapper,是真的在调 GIMP / Blender / LibreOffice 去渲染。 目前我们做了 8 款软件的 demo,1,298 项测试全绿。社区也已经有人拿 /cli-anything 跑更多实用的、重要的软件了,泛化能力比我们预期的还要好,说实话还蛮惊喜的。可能基于Coding的这个全生态飞轮其实转得比我们想象得还要夸张?
Yuhao Yang tweet media
GIF
中文
3
3
27
2.1K
Yongrui Su
Yongrui Su@ysu_ChatData·
@itsyuhao love seeing CLIs that treat the agent loop as a first class interface. do you log tool calls and outputs so runs are replayable and debuggable?
English
1
0
1
508
Samuel Ekpe
Samuel Ekpe@samuelekpe·
@huang_chao4969 Can’t AI zero shot this for every software since all software and spec or details are online?
English
1
0
0
723
Yuhao Yang retweetledi
Chao Huang
Chao Huang@huang_chao4969·
Introducing CLI-Anything🚀 Making ALL software agent-native with one command. Today's software serves humans👨‍💻. Tomorrow's users will be agents🤖. CLI-Anything: bridging the gap between AI agents and the world's software. One command line to make any software agent-ready for OpenClaw, nanobot, Cursor, Claude Code, etc. GitHub: github.com/HKUDS/CLI-Anyt… 🤔 Why CLI-Anything? CLI is the universal interface for both humans and AI agents: - Structured & Composable - Text commands match LLM format and chain for complex workflows - Lightweight & Universal - Minimal overhead, works across all systems without dependencies - Self-Describing - --help flags provide automatic documentation agents can discover - Proven Success - Claude Code runs thousands of real workflows through CLI daily - Agent-First Design - Structured JSON output eliminates parsing complexity - Deterministic & Reliable - Consistent results enable predictable agent behavior 💡 CLI-Anything's Vision: Building Agent-Native Software - 🌐 Universal Access - Every software becomes instantly agent-controllable through structured CLI. - 🔗 Seamless Integration - Agents control any application without APIs, GUI, rebuilding or complex wrappers. - 🚀 Future-Ready Ecosystem - Transform human-designed software into agent-native tools with one command. #CLIAnything #openclaw #nanobot #claudecode
Chao Huang tweet media
English
48
171
892
237.9K
Yuhao Yang
Yuhao Yang@itsyuhao·
@Chenyang_Lyu It's actually case-by-case. It depends on the app and percentage of the functionalities you want the CLI to replace out of the app's original design. Based on my own use of it, a Claude Pro single session can handle two medium size softwares like GIMP and Audacity 🤔
English
0
0
1
39
Chenyang Lyu 吕晨阳
Chenyang Lyu 吕晨阳@Chenyang_Lyu·
how many tokens would be used for creating one *software*?
Chao Huang@huang_chao4969

Introducing CLI-Anything🚀 Making ALL software agent-native with one command. Today's software serves humans👨‍💻. Tomorrow's users will be agents🤖. CLI-Anything: bridging the gap between AI agents and the world's software. One command line to make any software agent-ready for OpenClaw, nanobot, Cursor, Claude Code, etc. GitHub: github.com/HKUDS/CLI-Anyt… 🤔 Why CLI-Anything? CLI is the universal interface for both humans and AI agents: - Structured & Composable - Text commands match LLM format and chain for complex workflows - Lightweight & Universal - Minimal overhead, works across all systems without dependencies - Self-Describing - --help flags provide automatic documentation agents can discover - Proven Success - Claude Code runs thousands of real workflows through CLI daily - Agent-First Design - Structured JSON output eliminates parsing complexity - Deterministic & Reliable - Consistent results enable predictable agent behavior 💡 CLI-Anything's Vision: Building Agent-Native Software - 🌐 Universal Access - Every software becomes instantly agent-controllable through structured CLI. - 🔗 Seamless Integration - Agents control any application without APIs, GUI, rebuilding or complex wrappers. - 🚀 Future-Ready Ecosystem - Transform human-designed software into agent-native tools with one command. #CLIAnything #openclaw #nanobot #claudecode

English
2
0
0
123
Yuhao Yang
Yuhao Yang@itsyuhao·
Excited to introduce CLI-Anything: github.com/HKUDS/CLI-Anyt… Ever noticed how people keep throwing their hardest tasks to OpenClaw, and the Claws are calling Claude Code — not Cursor, not VS Code, not even any GUI agent? We think it comes down to one thing: CC nailed the CLI. Stateful, self-describing, and powerful. Agents don't fight with it — they just use it. And that got us thinking... most real software doesn't have this. GIMP, Blender, LibreOffice, OBS — incredibly powerful tools, but agents basically can't touch them. The best you can do is screenshot-based GUI agents, which... yeah. So we built CLI-Anything. You point it at a codebase: /cli-anything ./gimp, and your agent gets a full CLI for GIMP → /cli-anything ./blender — same for Blender → Works on LibreOffice, OBS Studio, Shotcut, Audacity, Inkscape, Kdenlive... and it auto-generates a full CLI — REPL mode, JSON output, undo/redo, real software backend calls. Not a toy wrapper. It calls actual GIMP/Blender/LibreOffice to render. We've done 8 apps as demos so far, 1,298 tests all green. Also seeing from the community that people are using our /cli-anything on more and more practical and important software codebases! Honestly surprised how well it generalizes. If you've ever wished agents could just use real software the way they use the terminal — that's what we're trying to build.
Yuhao Yang tweet media
GIF
English
2
4
20
1.9K
Yuhao Yang retweetledi
Chao Huang
Chao Huang@huang_chao4969·
We just dropped the technical report of AI-Trader🚀 - the fully autonomous trading benchmark where AI agents make real financial decisions without any human help. Check out our technical report to see how we built this systematic framework and witness the battle between leading AI models in our live trading arena.🤖⚔️ 💻 Open Source: github.com/HKUDS/AI-Trader 📚 Technical Report: arxiv.org/abs/2512.10971 📊 Live Trading Dashboard: ai4trade.ai This isn't a typical static benchmark. We throw AI models into live financial markets with actual economic consequences across US stocks, A-shares, and Crypto. Agents get minimal starting info and must independently search for market news, analyze data, and execute trades using MCP-based tools. AI-Trader operates on a "minimal information paradigm" that forces true autonomous behavior: • 💰 Real Capital & Markets: $10,000 USD for NASDAQ 100 stocks, 100,000¥ CNY for SSE 50 Chinese blue-chips, or 50,000 USDT for major cryptocurrencies • 🚫 Zero Guidance: No pre-packaged data or human intervention throughout the process • ⏰ 24/7 Trading: Continuous crypto markets, regular hours for stocks • 🔒 Clean Data: Historical replay with automatic future information filtering
Chao Huang tweet media
English
25
110
602
67.5K
Jiaxuan You
Jiaxuan You@youjiaxuan·
🚨 RL for LLMs is finally accessible. Introducing OpenTinker: The first community-driven, open-source framework designed to democratize Reinforcement Learning for LLMs. github.com/open-tinker/Op… Inspired by @thinkymachines's amazing Tinker, we realize the biggest bottleneck in agentic LLM research isn’t the math—it’s the setup. Current RL pipelines are messy. Configuring VeRL for every single experiment is a productivity killer. OpenTinker fixed it. 🛠 How OpenTinker Works: Decoupled Design of Server and Client - Setup Once, Run Forever: Configure the OpenTinker backend on your GPU cluster once. - Develop Locally: Define your RL environments directly on your laptop. - Train on the Cloud: Simply point your local client to the backend. The cluster handles the compute; you handle the science. 📉 The 10x Development Efficiency Thanks to our elegant architectural decomposition, OpenTinker reduces the time to develop a new RL training pipeline by at least an order of magnitude. ⚡ Turn Idle GPU Compute into Gold Small labs often have underutilized hardware. OpenTinker turns your idle GPUs into an internal/external API service for - RL Training - SFT - Inference 🎯 Who needs OpenTinker? - Researchers tired of infrastructure hell. - Labs needing to standardize workflows. - Teams wanting to maximize hardware ROI. Thanks my amazing PhD student @realagi25 for leading the project. We are building the future of open RL infra. Be the first to build with us. 👇 Start Building with OpenTinker Now 🚀 Repo: github.com/open-tinker/Op… 🌐 Blog: open-tinker.github.io/opentinker-pag… If you believe RL should be accessible to everyone, give us a star, repost this 🔄 post, and let us know what agents you plan to build!
Jiaxuan You tweet mediaJiaxuan You tweet media
English
15
147
1.1K
57.5K
Yuhao Yang
Yuhao Yang@itsyuhao·
@bdsqlsz Interesting. Thought DiT for actions would support relatively high fps
English
0
0
1
93
青龍聖者
青龍聖者@bdsqlsz·
I quickly tested it and looked at the code, and the model is based on a pattern of pause→ inference→ unpause→ action. Unable to run on real-time games.🥲
English
23
7
441
53.1K
Jeff Liang
Jeff Liang@LiangJeff95·
我觉得video的制作是最需要agent,包括剧本的编写,切片生成,剪辑,配乐等等。 在LLM agent如火如荼的时候,有比较好用的video agent的paper或者product嘛?学习一下!
Jeff Liang tweet media
中文
2
0
5
647
Yuhao Yang
Yuhao Yang@itsyuhao·
Lessons learnt: Diversify your positions 👍
Yuhao Yang tweet media
Chao Huang@huang_chao4969

AI-Trader Just Dropped 🚀 5 AI models trading $10K each, fully autonomously DeepSeek vs GPT vs Gemini vs Qwen vs Claude Zero human input. Pure AI decisions. NASDAQ 100 battlefield. Who rules the market? 🔗 GitHub: github.com/HKUDS/AI-Trader 🎯 What's Inside • 100% Autonomous Trading - AI makes every decision • 5-Way AI Battle - Best models compete head-to-head • Live Performance Tracking - Real-time P&L dashboards • Pure Tool-Driven - No human programming allowed • Historical Backtesting - Scientific replay capability • Open Architecture - Add your own AI strategies #AITrading

English
1
0
3
635