
LeGOAT
7.9K posts





Great read -- all it really takes is: - a harness - connectors to your data/tools - reliable, always-accessible agent(s) The models have reached the inflection point where it's not more complicated than this

Game 7 used to have real wars man






Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.





composer 2.5 is opus 4.7 class coding at 1/10 the cost. but it was cursor only. that just changed. i just shipped cursor as a hermes agent provider tonight. PR open upstream to nousresearch/hermes-agent, available from my fork right now while it merges. what this means: composer 2.5 + hermes memory + hermes skills + cron + acp subagents + multi-platform delivery, all in one harness. cheapest frontier coding model + deepest agent runtime. neither alone gets you here. the math: - composer 2.5: $0.50 input / $2.50 output per 1M - opus 4.7: $5.00 / $25.00 (10x cost) - gpt-5.5: $5.00 / $30.00 (12x cost) - gpt-5.5 pro: $30.00 / $180.00 (70x cost) same coding benchmark band (79.8% swe-bench multilingual vs opus 4.7's 80.5%, 63.2% cursorbench v3.1 vs 61.6%) at a fraction of the budget. PR: github.com/NousResearch/h… fork: github.com/sudoingX/herme… article with full receipts drops sat ~9pm ICT.




GPT 5.5 turned out a steaming pile overnight and wow is anyone actually good at this yet? This is starting to feel like programming again, that feeling it’s impossibly hard and painful










