Bai Li
152 posts

Bai Li
@libai_94
ML Engineer & PhD in NLP



Early this year, we trained a 70B model optimized for reasoning and coding. This model roughly matches LLAMA 3 70B despite being trained on 7x less data. Today, we’re releasing a toolkit to help others do the same, including: • 11 sanitized and extended NLP reasoning benchmarks including ARC, GSM8K, HellaSwag, and Social IQa • An original code-focused reasoning benchmark • A new dataset of 450,000 human judgments about ambiguity in NLP questions • A hyperparameter optimizer for scaling small experiments to a 70B run • Infrastructure scripts for bringing a cluster from bare metal to robust high-utilization training …and more! Read more and access the toolkit here: imbue.com/research/70b-i…


We are thrilled that Isabel Papadimitriou (@isabelpapad) will be joining @UBCLinguistics as an Assistant Professor as of Sept 2025!















