
@AlextheYounga That last move is the tell. When the agent writes the assertion, it owns the goalpost, so it always finds the cheapest way to green. The only fix that holds is a check it didn't write and can't edit. Let it write the code. Never the condition that says the code is done.
English

























