OpenAdaptAI

70 posts

OpenAdaptAI banner
OpenAdaptAI

OpenAdaptAI

@OpenAdaptAI

Open source AI that automates tasks in desktop apps by observing human demonstrations. Mac/Win compatible. https://t.co/7BQASEeo82

Beigetreten Mayıs 2023
0 Folgt420 Follower
OpenAdaptAI retweetet
OpenAdaptAI
OpenAdaptAI@OpenAdaptAI·
New and improved! More coming soon.
OpenAdaptAI tweet media
English
0
0
2
73
OpenAdaptAI retweetet
Xinyuan Wang
Xinyuan Wang@xywang626·
We are super excited to release OpenCUA — the first from 0 to 1 computer-use agent foundation model framework and open-source SOTA model OpenCUA-32B, matching top proprietary models on OSWorld-Verified, with full infrastructure and data. 🔗 [Paper] arxiv.org/abs/2508.09123 📌 [Website] opencua.xlang.ai 🤖 [Models] huggingface.co/xlangai/OpenCU… 📊[Data] huggingface.co/datasets/xlang… 💻 [Code] github.com/xlang-ai/OpenC… 🌟 OpenCUA — comprehensive open-source framework for computer-use agents, including: 📊 AgentNet — first large-scale CUA dataset (3 systems, 200+ apps & sites, 22.6K trajectories) 🏆 OpenCUA model — open-source SOTA on OSWorld-Verified (34.8% avg success, outperforms OpenAI CUA) 🖥 AgentNetTool — cross-system computer-use task annotation tool 🏁 AgentNetBench — offline CUA benchmark for fast, reproducible evaluation 💡 Why OpenCUA? Proprietary CUAs like Claude or OpenAI CUA are impressive🤯 — but there’s no large-scale open desktop agent dataset or transparent pipeline. OpenCUA changes that by offering the full open-source stack 🛠: scalable cross-system data collection, effective data formulation, model training strategy, and reproducible evaluation — powering top open-source models including OpenCUA-7B and OpenCUA-32B that excel in GUI planning & grounding. Details of OpenCUA framework👇
Xinyuan Wang tweet media
English
14
97
466
163.7K
OpenAdaptAI retweetet
Xinyuan Wang
Xinyuan Wang@xywang626·
🙌 Acknowledgement: We thank @ysu_nlp, @CaimingXiong , and the anonymous reviewers for their insightful discussions and valuable feedback. We are grateful to Moonshot AI for providing training infrastructure and annotated data. We also sincerely appreciate Jin Zhang, Hao Yang, Zhengtao Wang, and Yanxu Chen from the Kimi Team for their strong infrastructure support and helpful guidance. The development of our tool is based on the open-source projects DuckTrack @arankomatsuzaki and @OpenAdaptAI we are very grateful for their commitment to the open-source community. Finally, we extend our deepest thanks to all annotators for their tremendous effort and contributions to this project. ❤️
English
0
3
21
1.3K
OpenAdaptAI retweetet
Rico Pagliuca
Rico Pagliuca@pagilgukey·
Anybody looking for a GUI+ICL-->MCP library should definitely check out OmniMCP which puts Microsoft's Omniparser to use in generating GUI tool use APIs. Early days but pretty neat omnimcp.openadapt.ai
English
1
2
5
261
OpenAdaptAI retweetet
Richard Abrich
Richard Abrich@abrichr·
I prompted @openai's ChatGPT o3-mini-high and @DeepSeek's R1 to implement code to for deploying @alibaba_qwen's Qwen2.5-VL. Both agree that R1's implementation is "more comprehensive" and better "for production systems".
Richard Abrich tweet mediaRichard Abrich tweet media
English
1
1
7
738
OpenAdaptAI retweetet
OpenAdaptAI retweetet
Richard Abrich
Richard Abrich@abrichr·
Another day, another breakthrough: Apply DCT to convert actions into frequency components, quantize them prioritizing low frequencies, then use autoregressive prediction in frequency order (low to high) to generate actions. From @physical_int. May generalize to @OpenAdaptAI.
Richard Abrich tweet mediaRichard Abrich tweet media
English
1
1
7
331
OpenAdaptAI retweetet
Richard Abrich
Richard Abrich@abrichr·
@hwchase17 With @OpenAdaptAI you start and stop recording demonstrations of repetitive tasks via the tray icon. Show, don't tell. Perform, don't prompt.
English
0
1
3
244
OpenAdaptAI retweetet
Richard Abrich
Richard Abrich@abrichr·
@OpenAdaptAI @julien_c @Microsoft @AWS @Docker (venv) % python client.py http://34.206.53.77:7861 ~/Desktop/screenshot.png Loaded as API: http://34.206.53.77:7861/ ✔ Parsed content: ... 2024-10-29 11:13:07.414 | INFO | __main__:predict:84 - Output image saved to: output_image.png
Richard Abrich tweet media
English
3
1
2
200