Gene Li

@geneli0

https://t.co/5Bpr5cdNBE

Katılım Şubat 2017

136 Takip Edilen94 Takipçiler

Sabitlenmiş Tweet

Gene Li@geneli0·21 Tem

like everyone else i am hopping on the blog post trend gene.ttic.edu/blog/incomplet…

English

179

19.2K

Gene Li@geneli0·23 Nis

Happy to chat about this paper, RL, deep learning, etc. Feel free to reach out! Poster Session 6 on Saturday!

Nirmit Joshi@nirmitj_

@geneli0 will be presenting our paper @iclr_conf 🇧🇷 (this Saturday), which has implications for SFT of LLMs. arxiv.org/abs/2510.15464

English

685

Gene Li retweetledi

Mayee Chen ✈️ ICML 🇰🇷@MayeeChen·13 Şub

Data mixing - determining ratios across your training datasets - matters a lot for model quality. While building Olmo 3, we learned it’s hard to set up a method that finds a strong mix, and hard to maintain that mix as datasets change throughout development. Introducing Olmix👇

English

274

58.8K

Gene Li retweetledi

Mayee Chen ✈️ ICML 🇰🇷@MayeeChen·20 Kas

Thrilled to have contributed to Olmo 3! The best fully open 32B model (data, training recipes, checkpoints and more!) As an intern at AI2 these last 8 months, I’ve grown to deeply appreciate the careful science, iteration, and collaboration that go into models like this and have learned so much from the team. I am more optimistic than ever about the future of open-source and data-centric research right now. My particular contribution was working on the Dolma 3 data mix 👩‍🍳 I was able to apply ideas from some of my earlier mixing work, explore new problem settings, and see firsthand the data challenges that arise when building datasets intended for real models at scale. More on this coming soon!

Ai2@allen_ai

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey. Best fully open 32B reasoning model & best 32B base model. 🧵

English

271

69.7K

Gene Li@geneli0·20 Eki

Check this out! Some fun work with interesting implications for LLM training 🧐

Nirmit Joshi@nirmitj_

Very satisfied with some neat results on imitation learning. When distribution matching isn’t possible, what’s even the role of demonstrations? Cloning/log-loss minimization? We propose directly encoding reward structure—motivating new algorithmic ideas. arxiv.org/abs/2510.15464

English

1.7K

Gene Li retweetledi

Charlie Hou@hou_char·2 Eki

Gave a talk at @OpenAI on our work 🌸 POPri “Policy Optimization for Private Data”. POPri is a huge improvement in synthetic data generation under security+privacy constraints! Learn more:

English

2.4K

Keşfet

@OpenAI @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine