Oren Melamud

227 posts

Oren Melamud

@orenmelamud

CTO at @getanyword Previously, NLP Research Scientist at IBM Research

New York, USA Katılım Temmuz 2015

123 Takip Edilen231 Takipçiler

Oren Melamud retweetledi

Mike Lewis@ml_perception·22 Kas

New paper in Science today on playing the classic negotiation game "Diplomacy" at a human level, by connecting language models with strategic reasoning! Our agent engages in intense and lengthy dialogues to persuade other players to follow its plans. This was really hard! 1/5

English

728

3.7K

Oren Melamud retweetledi

Shachar Mirkin@shacharmirkin·7 Kas

Not from Twitter, but I've also been part of large-scale layoffs, so I'm looking for a new machine learning / data science position, esp. in NLP Lots of experience in both academic & industry settings I live in France so I'm after jobs that can be done from here please retweet

English

Oren Melamud retweetledi

Royi Rassin@RoyiRassin·15 Tem

Apparently DALL-E 2 couldn’t pick the appropriate word-sense for “bass”, and just settled on using both senses. I find it surprising #dalle2

English

253

Oren Melamud retweetledi

Janelle Shane@JanelleCShane·12 Haz

Stunning transcript proving that GPT-3 may be secretly a squirrel. GPT-3 wrote the text in green, completly unedited!

English

675

3.8K

Oren Melamud retweetledi

Sam Altman@sama·6 Nis

Have an idea for DALL·E? Reply with a caption. I'll generate 20 or so!

English

1.9K

1.3K

6.8K

Oren Melamud@orenmelamud·31 Mar

I wrote a blog post, trying to give some insights into the incredible recent advances in NLP with language models, and then share some practical tips, based on my industrial experience with the latest and (literally) greatest at @getanyword. anyword.com/blog/understan…

English

Oren Melamud@orenmelamud·5 Mar

Great overview on the latest and greatest in LM fine-tuning. I'm especially excited about the parameter-efficient methods. This seems to be super useful adapterhub.ml. I also liked the work on Prefix-Tuning that was missing from the summary arxiv.org/pdf/2101.00190….

Sebastian Ruder@seb_ruder

Recent Advances in Language Model Fine-tuning New blog post that takes a closer look at fine-tuning, the most common way large pre-trained language models are used in practice. ruder.io/recent-advance…

English

Oren Melamud retweetledi

Greg Brockman@gdb·5 Oca

DALL-E — our new neural network for generating images from text: openai.com/blog/dall-e/

English

849

3.5K

Oren Melamud@orenmelamud·22 Nis

@GabiStanovsky @CseHuji מזל טוב!

עברית

Gabriel Stanovsky@GabiStanovsky·22 Nis

I'm happy and excited to share that I'll join the CS faculty at the Hebrew University this fall! @CseHuji

English

144

Oren Melamud retweetledi

Anna Rumshisky@arumshisky·28 Şub

Our much-anticipated BERTology primer is out on arxiv: arxiv.org/abs/2002.12327. Why and how does BERT work? What does it learn, and where is it stored? We review 40+ recent papers in search of answers and give our view on future directions (with #OlgaKovaleva & @annargrs).

English

201

Oren Melamud retweetledi

Yann LeCun@ylecun·10 Şub

How to train a model with 10^11 parameters without running out of GPU memory? Use DeepSpeed from Microsoft Research! It's PyTorch compatible. It partitions the network onto multiple processors automatically and efficiently.

Microsoft Research@MSFTResearch

Microsoft researchers and engineers release Zero Redundancy Optimizer (ZeRO) and DeepSpeed library, a system able to train 100-billion-parameter deep learning models. Learn about this breakthrough and how it led to Turing Natural Language Generation: aka.ms/AA79s5c

English

413

1.4K

Oren Melamud retweetledi

Chip Huyen@chipro·20 Oca

I analyzed compensation & level details of 19k tech workers to find answers to: 1. How long does it take for SWEs to reach a certain level? 2. Compensations across jobs/levels? 3. Do women get paid less than men in tech? 4. Is there a deadline for SWEs? huyenchip.com/2020/01/18/tec…

English

245

Oren Melamud retweetledi

Sebastian Ruder@seb_ruder·6 Oca

10 ML & NLP Research Highlights of 2019 New blog post on ten ML and NLP research directions that I found exciting and impactful in 2019. ruder.io/research-highl…

English

280

909

Oren Melamud retweetledi

Anyword@getanyword·12 Kas

We've created a powerful product that uses AI to enhance social media posts. We're hosting a focus group for social media managers on November 22nd in NYC. We'll ask you a bit about your workflows and give you exclusive first access. Sign up: keywee.typeform.com/to/SKlCSp

English

Oren Melamud@orenmelamud·4 Eki

I'm hiring an Applied Scientist with experience in #nlp and #ml for my team in @GoKeywee (can be either NYC or TLV). If you want to make a difference in a small dynamic company, let me know! #open" target="_blank" rel="nofollow noopener">keywee.co/careers-new-yo…

English

Oren Melamud@orenmelamud·4 Ağu

ynet.co.il/articles/0,734…

ZXX

Oren Melamud retweetledi

(((ل()(ل() 'yoav))))👾@yoavgo·21 May

Very excited and proud to officially announce the opening of AI2 Israel! allenai.org/ai2-israel/ many thanks to @etzioni and the entire team for making this happen.

English

453

Oren Melamud@orenmelamud·20 May

Work with @ekshoonyame on automatically generating synthetic clinical notes to overcome privacy considerations that limit our ability to publicly share this kind of data.

Chaitanya Shivade@ekshoonyame

Does limited data also restrict your models with clinical texts? What if we learn to generate some? How good is it? For which tasks? Check out our paper with @orenmelamud at the Clinical NLP Workshop in NAACL 2019. Paper: arxiv.org/abs/1905.07002 Code: github.com/orenmel/synth-…

English

Oren Melamud retweetledi

Andrej Karpathy@karpathy·25 Nis

New blog post: "A Recipe for Training Neural Networks" karpathy.github.io/2019/04/25/rec… a collection of attempted advice for training neural nets with a focus on how to structure that process over time

English

1.6K

4.7K

Oren Melamud retweetledi

Sebastian Ruder@seb_ruder·14 Şub

A new bigger, better language model by @OpenAI: - Scaled-up version of their Transformer (10x params) - Trained on 10x more curated data (40 GB of Reddit out links w/ >2 karma) - SOTA on many LM-like tasks - Discuss potential for malicious use blog.openai.com/better-languag…

English

322

Keşfet

@getanyword @GabiStanovsky @CseHuji @annargrs @GoKeywee @etzioni @ekshoonyame @elonmusk