
@Zaddyzaddy The difference between ~65% and 80%+ on Cybergym is harness. Look at the results for GPT 5.4 with Codex CLI vs “OpenAI Agent” harnesses.
MDASH is 100% harness magic too.

English
Philo Groves
1.5K posts

@PhiloGroves
Philo (fai·low). Data Architect & Software Engineer. Good vibes only. No politics. github@philo-groves








This is your bodies signal to sink 2000 hours into turn based strategy games


Cursor is building software that could replace core parts of GitHub, including repositories, security reviews and automated testing tools. The effort comes as GitHub faces outages and rising pressure from AI-native coding rivals. thein.fo/3RnVNcX






