Yaowei Zheng

68 posts

Yaowei Zheng banner
Yaowei Zheng

Yaowei Zheng

@hiyouga_dev

Founder @llamafactory_ai, CTO @prismshadow_ai Building platform for next-generation AI. Opinions are my own.

Millennium Science School Katılım Temmuz 2020
86 Takip Edilen40 Takipçiler
JustJerry
JustJerry@JustJerry121·
@hiyouga_dev Do you usually keep the CSS inline/tiny, or is plain semantic HTML already enough for most outputs?
English
1
0
0
23
Yaowei Zheng
Yaowei Zheng@hiyouga_dev·
A small product thought from today: LLMs naturally default to Markdown, but it is not always the easiest format to read. Ask for a minimal one-page HTML instead, and with only 1.1~1.2x token cost, the output feels much cleaner.
Yaowei Zheng tweet mediaYaowei Zheng tweet media
English
2
1
2
106
fullstack
fullstack@DavidFSWD·
@teortaxesTex don't use openrouter for anything production use the actual original company, like deepseek. they cram scamming LLM providers that use really bad quants.
English
1
0
0
53
Eric Jang
Eric Jang@ericjang11·
.@tinkerapi by TML is a highly underrated product. It should be part of the “starter post training” infra for any newco
English
8
9
145
18.1K
Yaowei Zheng
Yaowei Zheng@hiyouga_dev·
We can basically think of an agent as a GitHub repo. The LLM is an executable inside it. The context is a set of text files. The harness is the code that manages how context and the LLM interact. Apply harness engineering to this repo, and you get a self-improving agent.
English
0
0
1
36
Yaowei Zheng
Yaowei Zheng@hiyouga_dev·
Some providers on OpenRouter cannot even decode tool calls correctly. I ran into this issue while testing the Qwen3.6-35B-A3B on AgentHub, so I had to block these providers ☹️ The services we commonly use are a huge black box. I truly trust nothing but the official provider.
Yaowei Zheng tweet mediaYaowei Zheng tweet media
English
0
0
1
52
Yaowei Zheng retweetledi
Bowen Wang
Bowen Wang@BowenWangNLP·
RLVR has become the recipe for agentic post-training. But for Computer-Use Agents, the bottleneck is not the algorithm, it is the data. 🐌 🚀 We introduce CUA-Gym: a scalable, lightweight synthesis engine that turns arbitrary task queries into verifiable RLVR data for computer-use agents. The largest open CUA RLVR dataset to date: 🎯 32,122 verifiable RLVR tasks with programmatic setup scripts + rewards 🌐 110 environments: 16 desktop apps + 94 synthesized mock web apps 🏆 Qwen3.5-based CUA models trained with GSPO reach 72.6% on OSWorld-Verified and 56.6% on WebArena 📄 Paper: huggingface.co/papers/2605.25… 🏠 Homepage: cua-gym.xlang.ai 🤗 Dataset: huggingface.co/datasets/xlang… 💻 Codebase: github.com/xlang-ai/CUA-G… 🧩 Environments: github.com/xlang-ai/CUA-G… 🧵[1/6]
English
15
82
441
81.7K
Yaowei Zheng retweetledi
Weiran Yao
Weiran Yao@iscreamnearby·
Introducing CHI-Bench on @huggingface: the world’s first long-horizon healthcare benchmark for AI agents. 75 real healthcare workflows + 20 apps + 200+ MCP tools + 1,290 skills + process / outcome rewards huggingface.co/datasets/actav… Any questions, lmk!
English
7
26
134
24.9K
JustJerry
JustJerry@JustJerry121·
When building a project, choosing the right technology stack is very important. I suggest that you first clarify the product requirements, and then repeatedly communicate with AI about the choice of the technology stack. If the technology stack is wrong, the subsequent workload will be unimaginably amplified.
English
1
0
3
121
Yaowei Zheng
Yaowei Zheng@hiyouga_dev·
@JustJerry121 If you are building an AI app for a specialized use case, you can use statistics from other users to improve the initial context.
English
0
0
1
23
JustJerry
JustJerry@JustJerry121·
做 AI 产品时,我现在会先看一个很朴素的问题: 用户开口前,产品已经知道了多少上下文? 如果每次都要用户重新讲背景,再聪明的模型也像第一天入职的同事。
中文
1
0
4
133
Yaowei Zheng
Yaowei Zheng@hiyouga_dev·
Biggest lesson from building an AI startup: If you're building a full agent system, don't build a web app. Local is the best interface. That's why Claude Code and Codex chose local. Web only makes sense when you're building one narrow, specialized feature.
English
0
0
2
256
Yaowei Zheng
Yaowei Zheng@hiyouga_dev·
Whale maxxing + deep drink 😆🍻 What a night at the Blue Whale event.
Yaowei Zheng tweet media
English
0
0
1
37
Yaowei Zheng retweetledi
vLLM
vLLM@vllm_project·
🎉 Congrats to the VeRL-Omni team on the pre-release of a general RL post-training framework for multimodal generative models. Built on verl + vllm-omni. vLLM-Omni handles the multimodal rollout with step-wise continuous batching and embedding caching; vLLM serves the VLM-as-judge / OCR reward model, overlapped with rollout and training. In the Qwen-Image OCR demo, moving the reward to its own GPU cuts per-step wall-clock by ~14%. Released: Qwen-Image with FlowGRPO / MixGRPO / GRPO-Guard. BAGEL and Qwen3-Omni-Thinker PR-ready. Excited to push multimodal generative RL forward together with VeRL-Omni and the broader community. 🙌 📖 vllm.ai/blog/2026-05-1… 🔗 github.com/verl-project/v…
vLLM tweet media
English
6
24
159
22.2K
Yaowei Zheng
Yaowei Zheng@hiyouga_dev·
When we say "meta harness" or "agentic harness engineering", we may miss what "harness" means. It is not just code assets for building agents. Harness is the top-level concept: leveraging intelligence inside a Ralph Loop until the human is satisfied. ghuntley.com/loop/
English
0
0
0
41