Cua
662 posts

Cua
@trycua
Open-source infrastructure for Computer-Use Agents // YC X25

Our ability to measure AI has been outpaced by our ability to develop it, and this evaluation gap is one of the most important problems in AI. Today we're launching Open Benchmarks Grants — a $3M commitment to fund open benchmarks for frontier AI and close the evaluation gap. Grateful to be partnering with @HuggingFace, @togethercompute, @PrimeIntellect, Factory HQ, @harborframework, and @PyTorch to back the teams building these benchmarks! 🚀


Trending on GitHub again in Python, right next to our friends at @Prince_Canuma - thank you for 12k ⭐ - your support means everything. Chef @trycua cuala will keep cooking releases this week👨🍳🐨


Solution to run multiple clawdbots on my Mac Studio found: lume! 🔥 Gonna try later today!

A bunch of you asked about our Remotion setup after the article. It's now open-source: github.com/trycua/launchp… • Video templates for product launches • Shared animation components • Works with Claude Code + Remotion skills • How we made the Cua-Bench video in 2 hours



We've been using Cua-Bench internally—and with customers—for the last few months to evaluate every computer-use agent we deploy. Today it's open-source. 15 public tasks, 40 variations, adapters for OSWorld and Windows Agent Arena. One CLI, self-hostable.

We've been using Cua-Bench internally—and with customers—for the last few months to evaluate every computer-use agent we deploy. Today it's open-source. 15 public tasks, 40 variations, adapters for OSWorld and Windows Agent Arena. One CLI, self-hostable.








