Oren Melamud

227 posts

Oren Melamud

Oren Melamud

@orenmelamud

CTO at @getanyword Previously, NLP Research Scientist at IBM Research

New York, USA Katılım Temmuz 2015
123 Takip Edilen231 Takipçiler
Oren Melamud retweetledi
Mike Lewis
Mike Lewis@ml_perception·
New paper in Science today on playing the classic negotiation game "Diplomacy" at a human level, by connecting language models with strategic reasoning! Our agent engages in intense and lengthy dialogues to persuade other players to follow its plans. This was really hard! 1/5
Mike Lewis tweet media
English
74
728
3.7K
0
Oren Melamud retweetledi
Shachar Mirkin
Shachar Mirkin@shacharmirkin·
Not from Twitter, but I've also been part of large-scale layoffs, so I'm looking for a new machine learning / data science position, esp. in NLP Lots of experience in both academic & industry settings I live in France so I'm after jobs that can be done from here please retweet
English
4
57
94
0
Oren Melamud retweetledi
Royi Rassin
Royi Rassin@RoyiRassin·
Apparently DALL-E 2 couldn’t pick the appropriate word-sense for “bass”, and just settled on using both senses. I find it surprising #dalle2
Royi Rassin tweet mediaRoyi Rassin tweet media
English
9
17
253
0
Oren Melamud retweetledi
Janelle Shane
Janelle Shane@JanelleCShane·
Stunning transcript proving that GPT-3 may be secretly a squirrel. GPT-3 wrote the text in green, completly unedited!
Janelle Shane tweet media
English
56
675
3.8K
0
Oren Melamud retweetledi
Sam Altman
Sam Altman@sama·
Have an idea for DALL·E? Reply with a caption. I'll generate 20 or so!
English
1.9K
1.3K
6.8K
0
Oren Melamud
Oren Melamud@orenmelamud·
I wrote a blog post, trying to give some insights into the incredible recent advances in NLP with language models, and then share some practical tips, based on my industrial experience with the latest and (literally) greatest at @getanyword. anyword.com/blog/understan…
English
0
1
1
0
Oren Melamud
Oren Melamud@orenmelamud·
Great overview on the latest and greatest in LM fine-tuning. I'm especially excited about the parameter-efficient methods. This seems to be super useful adapterhub.ml. I also liked the work on Prefix-Tuning that was missing from the summary arxiv.org/pdf/2101.00190….
Sebastian Ruder@seb_ruder

Recent Advances in Language Model Fine-tuning New blog post that takes a closer look at fine-tuning, the most common way large pre-trained language models are used in practice. ruder.io/recent-advance…

English
1
0
0
0
Gabriel Stanovsky
Gabriel Stanovsky@GabiStanovsky·
I'm happy and excited to share that I'll join the CS faculty at the Hebrew University this fall! @CseHuji
English
28
7
144
0
Oren Melamud retweetledi
Anna Rumshisky
Anna Rumshisky@arumshisky·
Our much-anticipated BERTology primer is out on arxiv: arxiv.org/abs/2002.12327. Why and how does BERT work? What does it learn, and where is it stored? We review 40+ recent papers in search of answers and give our view on future directions (with #OlgaKovaleva & @annargrs).
English
1
52
201
0
Oren Melamud retweetledi
Yann LeCun
Yann LeCun@ylecun·
How to train a model with 10^11 parameters without running out of GPU memory? Use DeepSpeed from Microsoft Research! It's PyTorch compatible. It partitions the network onto multiple processors automatically and efficiently.
Microsoft Research@MSFTResearch

Microsoft researchers and engineers release Zero Redundancy Optimizer (ZeRO) and DeepSpeed library, a system able to train 100-billion-parameter deep learning models. Learn about this breakthrough and how it led to Turing Natural Language Generation: aka.ms/AA79s5c

English
5
413
1.4K
0
Oren Melamud retweetledi
Chip Huyen
Chip Huyen@chipro·
I analyzed compensation & level details of 19k tech workers to find answers to: 1. How long does it take for SWEs to reach a certain level? 2. Compensations across jobs/levels? 3. Do women get paid less than men in tech? 4. Is there a deadline for SWEs? huyenchip.com/2020/01/18/tec…
English
18
245
1K
0
Oren Melamud retweetledi
Sebastian Ruder
Sebastian Ruder@seb_ruder·
10 ML & NLP Research Highlights of 2019 New blog post on ten ML and NLP research directions that I found exciting and impactful in 2019. ruder.io/research-highl…
Sebastian Ruder tweet media
English
4
280
909
0
Oren Melamud retweetledi
Anyword
Anyword@getanyword·
We've created a powerful product that uses AI to enhance social media posts. We're hosting a focus group for social media managers on November 22nd in NYC. We'll ask you a bit about your workflows and give you exclusive first access. Sign up: keywee.typeform.com/to/SKlCSp
English
0
1
1
0
Oren Melamud
Oren Melamud@orenmelamud·
I'm hiring an Applied Scientist with experience in #nlp and #ml for my team in @GoKeywee (can be either NYC or TLV). If you want to make a difference in a small dynamic company, let me know! #open" target="_blank" rel="nofollow noopener">keywee.co/careers-new-yo…
English
0
1
5
0
Oren Melamud
Oren Melamud@orenmelamud·
Work with @ekshoonyame on automatically generating synthetic clinical notes to overcome privacy considerations that limit our ability to publicly share this kind of data.
Chaitanya Shivade@ekshoonyame

Does limited data also restrict your models with clinical texts? What if we learn to generate some? How good is it? For which tasks? Check out our paper with @orenmelamud at the Clinical NLP Workshop in NAACL 2019. Paper: arxiv.org/abs/1905.07002 Code: github.com/orenmel/synth-…

English
0
0
4
0
Oren Melamud retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
New blog post: "A Recipe for Training Neural Networks" karpathy.github.io/2019/04/25/rec… a collection of attempted advice for training neural nets with a focus on how to structure that process over time
English
72
1.6K
4.7K
0
Oren Melamud retweetledi
Sebastian Ruder
Sebastian Ruder@seb_ruder·
A new bigger, better language model by @OpenAI: - Scaled-up version of their Transformer (10x params) - Trained on 10x more curated data (40 GB of Reddit out links w/ >2 karma) - SOTA on many LM-like tasks - Discuss potential for malicious use blog.openai.com/better-languag…
English
6
89
322
0