Sabitlenmiş Tweet

like everyone else i am hopping on the blog post trend
gene.ttic.edu/blog/incomplet…
English
Gene Li
6 posts



@geneli0 will be presenting our paper @iclr_conf 🇧🇷 (this Saturday), which has implications for SFT of LLMs. arxiv.org/abs/2510.15464

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey. Best fully open 32B reasoning model & best 32B base model. 🧵

Very satisfied with some neat results on imitation learning. When distribution matching isn’t possible, what’s even the role of demonstrations? Cloning/log-loss minimization? We propose directly encoding reward structure—motivating new algorithmic ideas. arxiv.org/abs/2510.15464

