Andrew Sullivan

26 posts

Andrew Sullivan banner
Andrew Sullivan

Andrew Sullivan

@licyeus

web1.0 junkie building @TeambookAI

🇦🇷 شامل ہوئے Mart 2007
698 فالونگ315 فالوورز
Andrew Sullivan
Andrew Sullivan@licyeus·
@bendersej @mattpocockuk Do you also keep planning/roadmap in-repo? I've been considering leaving Linear and textfile-ing everything. I'm monorepo+custom CLI for vision/decisions/docs, but haven't made jump yet. At some point, even Github serves little purpose beyond CI (ie, local review workflows).
English
1
0
1
124
Benjamin André-Micolon
@mattpocockuk It’s not groundbreaking, but seldom discussed: I run my entire company from a single mono-repository: my AI has access to my code, marketing, ICP, customer profiles, SEO changelogs, architecture diagrams, data logs, decision logs. + CLI for DB readonly access, Search Console…
English
7
0
56
8.2K
Matt Pocock
Matt Pocock@mattpocockuk·
What 'advanced' AI coding techniques are you using? I.e. what do you feel like you've discovered that no-one else knows about yet?
English
181
14
492
92.3K
Andrew Sullivan
Andrew Sullivan@licyeus·
How is this any different from a harness template? It's packaged as a bunch of harness behaviors (research-specific tools + dbs). If this is a tuned model, it doesn't seem much better: "outperforms GPT‑5.4 on 6 out of 11 tasks". Ie, worse on 5 out of 11? openai.com/index/introduc…
English
0
0
0
27
Andrew Sullivan
Andrew Sullivan@licyeus·
@_nakedeyes E.g. Codex pushing back on my hypothesis, showing changes from last "turn", etc. Also during planning iteration it shows *diff* of changes to plan instead of just reprinting the whole plan. Better DX overall IMO.
Andrew Sullivan tweet media
English
0
0
1
43
Andrew Sullivan
Andrew Sullivan@licyeus·
Uff yeah, "Claude Codex". Freudian slip 🤦‍♂️ I'm a little uneasy using OpenAI, but I've found Codex better out-of-the-box than Claude Code. And the desktop app feels like they're trending in the right direction (chat on left, diff on right). I'd love to go back to Pi or OpenCode but unsubsidized tokens are so expensive. 🫠
English
1
0
2
182
Andrew Sullivan
Andrew Sullivan@licyeus·
People want transparency+control over agentic tools/workflows. The Claude Codex fiasco underscores this.
English
1
0
1
43
Andrew Sullivan
Andrew Sullivan@licyeus·
@_nakedeyes Far as I can tell, they changed settings in the CC harness (default effort max -> medium, introduced larger context = higher token usage, esp. with cache misses) + reduced usage quotas. Lower quality output, people hitting limits sooner, no word from Anthropic. Everybody unhappy
English
1
0
1
20
Tim Rawcliffe
Tim Rawcliffe@_nakedeyes·
@licyeus What’s this fiasco? I’m devastated they cut off OpenCode support 😭 destroyed my workflow 🥀
English
1
0
0
21
Andrew Sullivan
Andrew Sullivan@licyeus·
Not sure if I buy the hype that Anthropic is making Opus 4.6 weaker, but they've certainly made lots of changes to Claude Code harness that make it *feel* worse. I could experiment with effort/context settings, but I'm trying out Codex CLI instead. First impressions are good: it's is a lot more hands-on and seems slower (w/ Plus plan), but quality is better.
English
0
0
0
72
Andrew Sullivan
Andrew Sullivan@licyeus·
If Mythos is what Anthropic is implying it is, they’re about to get a lesson in power from the US government. Hope they’ve thought that through.
English
0
0
0
37
Andrew Sullivan
Andrew Sullivan@licyeus·
This is Anthropic realizing models are commodities and trying to protect what they think can be a moat: the harness. I expect them to move more of the harness into the opaque API call for greater lock-in (alongside already-existing hosted tools: web search/fetch, code execution, etc).
Gergely Orosz@GergelyOrosz

I think we’re speed running understanding that when a company controls the model and the harness, and they are both closed: they not only CAN pull stuff like this, but WILL do so. I expect a renewed interest in open models, open source harnesses + self hosted models. This stuff is getting really disruptive and is just not acceptable as a paying customer!

English
0
0
0
80