
@devayushrout @shapovalim Right, evidence is the cleaner signal. A tiny diff with no reasoning behind it is just lucky, not clean. We try to score whether a change is justified by what the agent observed before making it. Diff size on its own can be misleading without that link.
English









