
@DamienTeney Qualitative insights about what LLMs as a group can and cannot do, and problems that arise, are very interesting and valuable. But detailed performance numbers (especially in "horse race" between models) are not interesting if models investigated are obsolete when paper read
English










