
@petergostev @ericssunLeon What's the direction of causation though? Does reasoning make it worse, or does falling for the BS then use more reasoning tokens (because it continues to think through the question rather than just giving an initial refusal)?
English

















