Bassem Al-Sady retweetledi

We also introduced a new award for broader evaluation:
🏆 Generalist Prize: Team @altos_labs ranked highest across 7 metrics. They showed the most reliable generalization—robust performance across diverse criteria vs. optimization for a single score.

English














