Oren Melamud
227 posts

Oren Melamud
@orenmelamud
CTO at @getanyword Previously, NLP Research Scientist at IBM Research





Recent Advances in Language Model Fine-tuning New blog post that takes a closer look at fine-tuning, the most common way large pre-trained language models are used in practice. ruder.io/recent-advance…




Microsoft researchers and engineers release Zero Redundancy Optimizer (ZeRO) and DeepSpeed library, a system able to train 100-billion-parameter deep learning models. Learn about this breakthrough and how it led to Turing Natural Language Generation: aka.ms/AA79s5c






Does limited data also restrict your models with clinical texts? What if we learn to generate some? How good is it? For which tasks? Check out our paper with @orenmelamud at the Clinical NLP Workshop in NAACL 2019. Paper: arxiv.org/abs/1905.07002 Code: github.com/orenmel/synth-…








