

Osama Romoh
4.5K posts

@romoh
Follow me for practical AI tips, reviews, and more.



New CursorBench results just dropped. Two big takeaways. Composer 2.5 is way better than most people think. 63.2% score at $0.55 per task. Nearly matching Opus 4.7 Max and GPT 5.5 Extra High at 20x less cost. This is insane value. Gemini 3.5 Flash is #10 at 49.8%. Below GPT 5.5 Low. Below Opus 4.7 Low. Google's newest model can't even beat budget tier competition. Composer 2.5 is the sleeper. Gemini 3.5 Flash is the disappointment.




Anyway, I fixed it. I built a tiny native app that lets you open any Markdown file on Mac. > Just press Spacebar → instant clean preview > Watch your AI agent write in real time > Open your favorite Markdown editor to edit Install once and literally forget it exists.

New for financial services: ready-to-run Claude agent templates for building pitches, conducting valuation reviews, closing the books at month-end, and more. Install them as plugins in Cowork and Claude Code, or use our cookbooks to run them in production as Managed Agents.




Every major operating system should come with a default Markdown reader at this point

Just released the new stable version of Tolaria. It fixes basically all bugs that have been reported! For today and the weekend the focus will be on dark mode and windows support, plus merging some of the PRs (they are good!) Also got up today to Tolaria on the front page of HN, double the Github stars, and just a ton of messages from everyone 🙏 Release page here for those interested: refactoringhq.github.io/tolaria/








Anthropic is rolling out "Epitaxy", an upgraded version of its Claude Code solution for desktop. > Simultaneous work on multiple repositories > Updated layout with terminal, plan, tasks, and preview sections > Loads of hotkeys and more Did you get it already? 👀

One piece of good news - T3 Code confirmed SAFE FOR CLAUDE SUBS 🫡 We FINALLY have explicit confirmation that tools wrapping Claude Code for local use are allowed