Sabitlenmiş Tweet
zryph
10.9K posts

zryph
@FCB_Cartel
building AI agents · backend systems · occasionally yelling about football
Katılım Temmuz 2017
454 Takip Edilen308 Takipçiler

Codex in the last 3 days has been a nightmare. Hallucination after hallucination, writing so much unwanted code, creating things that are not at all intended, taking too much time and burning tokens unnecessarily. Especially
/goal.
@sama something is wrong. It burned 50% of my Pro plan tokens for the week in 1 night creating a PR of size 104k lines and when asked about the PR : it said it’s useless and cleared the whole PR and closed it.
English

@FCB_Cartel @sama experiencing the opposite lol, keep going bro
English

@ProfAdebay @sama Yes it is very good!
I use Codex&Claude together with cursor as IDE, Codex has been pretty good until it started behaving oddly since yesterday.
English

@FCB_Cartel @sama Codex is working well for me. Maybe it’s because I know my way around it. (I didn’t feed it any skill files)
English

@FCB_Cartel @sama I deleted 10,000 lines of code yesterday. Codex been totally ignoring Agent.md, very weird.
English

@FCB_Cartel @sama A bit more upfront effort can go a way in getting decent results. This iterative code design loop once before any request of medium+ complexity will drastically reduce unfavorable results. Also have an improved version I'll post up soon.
x.com/slopwareindy/s…
Slopware Engineer@slopwareindy
English

@sheathinkler @sama I used /goal for building an incredibly big project that had to go through like 15 repos and call 11 different MCPs to build flows. It went on and on for 3 full days and built 300 flows precisely. After this I got good confidence to use /goal
English

@FCB_Cartel @sama Write your own objective mode and recompile the app - keep the focal loop local and don’t get hit by whatever they are screwing with behind /goal. Theirs is omega broken
English

@FCB_Cartel @sama Got the same issue , deleted my whole bot trading folder and said sorry after 🫠
English
zryph retweetledi

@FCB_Cartel @sama This happened to me as well. I gave it a clean set of instructions got stuck in one part of my workflow, looped that that 30 times over the span of 24 hours, came back 24hrs later, nothing useful all garbage.
English

@FCB_Cartel @sama "when asked about the PR : it said it’s useless and cleared the whole PR and closed it."
Well that were best invested money! Think if that code was in prod :D
English

@cyganztrojkatu @sama Good for you! Do you run /goal? What do you mean advanced workflows? Do these workflows happen in one coded session? Or are you talking about multi-agent workflows?
English

@FCB_Cartel @sama Am I the only one not experiencing any issues? Can anyone relate? I have pretty advanced workflows and I felt no regression at all
English

Idk what the full picture and original user prompt looks like, but based on that reply, I’d imagine:
- insufficient/ineffective context engineering
- imperfect prompting skills
- and bad/no document architecture (for efficient & autonomous progressive disclosure)
I have never had an issue like this even once
Ever
And I’ve had Codex running 24/7 for last 3 days on 50 different things I’ve told it
English

@OliwierMako @sama Good idea! But this only works when the human is right there waiting for checkpoints, but i usually run /goal when i go to sleep to get some boring tasks done while i'm asleep.
English

@FCB_Cartel @sama This is why agent workflows need rollback points
Not just better models
Every 15 min it should leave a clean checkpoint you can inspect or nuke
English

@andrei_ot @sama I actually switched to Opus 4.7 today, and Opus's time for first byte was 4 mins.
English

@FCB_Cartel @sama getting the opus 4.7 experience I see, its coming full circle, can't wait for 4.8 to be great for 2-3 weeks
English

@FCB_Cartel @sama He's taking 1h10 to do something easily on fast mode, /goal burnt more than 30% of weekly usage on pro (x20) in 8h ... Everything is broken on it
English
zryph retweetledi
zryph retweetledi

@FCB_Cartel @sama I had so many issues yesterday that I ended up closing my laptop and stepping away for the entire day, which had never happened since I’ve been using codex. It’s definitely screwy right now
English







