Gene Li

6 posts

Gene Li

Gene Li

@geneli0

https://t.co/5Bpr5cdNBE

Katılım Şubat 2017
135 Takip Edilen96 Takipçiler
Gene Li retweetledi
Mayee Chen @ ICLR 🇧🇷
Mayee Chen @ ICLR 🇧🇷@MayeeChen·
Data mixing - determining ratios across your training datasets - matters a lot for model quality. While building Olmo 3, we learned it’s hard to set up a method that finds a strong mix, and hard to maintain that mix as datasets change throughout development. Introducing Olmix👇
Mayee Chen @ ICLR 🇧🇷 tweet media
English
13
71
269
56.1K
Gene Li retweetledi
Mayee Chen @ ICLR 🇧🇷
Mayee Chen @ ICLR 🇧🇷@MayeeChen·
Thrilled to have contributed to Olmo 3! The best fully open 32B model (data, training recipes, checkpoints and more!) As an intern at AI2 these last 8 months, I’ve grown to deeply appreciate the careful science, iteration, and collaboration that go into models like this and have learned so much from the team. I am more optimistic than ever about the future of open-source and data-centric research right now. My particular contribution was working on the Dolma 3 data mix 👩‍🍳 I was able to apply ideas from some of my earlier mixing work, explore new problem settings, and see firsthand the data challenges that arise when building datasets intended for real models at scale. More on this coming soon!
Ai2@allen_ai

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey. Best fully open 32B reasoning model & best 32B base model. 🧵

English
16
34
271
69.4K
Gene Li retweetledi
Charlie Hou
Charlie Hou@hou_char·
Gave a talk at @OpenAI on our work 🌸 POPri “Policy Optimization for Private Data”. POPri is a huge improvement in synthetic data generation under security+privacy constraints! Learn more:
Charlie Hou tweet media
English
2
2
12
2.3K