MTL DATA

861 posts

MTL DATA

MTL DATA

@mtldata

Connecting nodes in the Montreal data community. Since 2014. #AIFest

Montreal, QC Beigetreten Mayıs 2014
461 Folgt530 Follower
MTL DATA retweetet
Sebastian Ruder
Sebastian Ruder@seb_ruder·
My PhD thesis Neural Transfer Learning for Natural Language Processing is now online. It includes a general review of transfer learning in NLP as well as new material that I hope will be useful to some. ruder.io/thesis/
Sebastian Ruder tweet media
English
24
558
2.3K
0
MTL DATA retweetet
Jeff Dean
Jeff Dean@JeffDean·
Hand-labeling training data for machine learning problems is effective, but very labor and time intensive. This work explores how to use algorithmic labeling systems relying on other sources of knowledge that can provide many more labels but which are noisy.
Google AI@GoogleAI

In collaboration with @Stanford and @BrownUniversity, we present Snorkel Drybell, a variant of the open source Snorkel framework, which explores how existing organizational knowledge can be used as weak supervision to quickly label large training datasets. goo.gl/K2hKxk

English
7
176
636
0
MTL DATA retweetet
Ian Goodfellow
Ian Goodfellow@goodfellow_ian·
I originally thought of GANs as an unsupervised learning algorithm, but so far, to create recognizable object categories, they've needed a supervision signal / labeled images. This new work shows how to get them to work well with few labels. twitter.com/MarioLucic_/st…
Ian Goodfellow tweet media
Mario Lucic@MarioLucic_

How to train SOTA high-fidelity conditional GANs usin 10x fewer labels? Using self-supervision and semi-supervision! Check out our latest work at goo.gl/idWNVs @GoogleAI @ETHZurich @TheMarvinRitter @mtschannen @XiaohuaZhai @OlivierBachem @sylvain_gelly

English
3
134
481
0
MTL DATA retweetet
François Chollet
François Chollet@fchollet·
The complaints "Python is slow" or "Python is unsafe" seem misguided. The point of Python isn't to be fast or safe, it's to be flexible and hackable, and to interface well with everything else. It has become successful by serving as a frontend from which to call other libraries.
English
26
284
1.4K
0
MTL DATA retweetet
Ryan Lowe 🥞
Ryan Lowe 🥞@ryan_t_lowe·
Last tweet for me on the OpenAI GPT-2 thing, but for those interested I think this video really elevates the discussion. Great work by all parties involved.
Sam Charrington@samcharrington

Really great discussion this eve on @OpenAI's recent language model release and the issues and considerations raised. Thanks @AmandaAskell @AnimaAnandkumar @Miles_Brundage @smerity @WWRob for participating. Those who missed it live can catch the replay at youtube.com/watch?v=LWDbAo…

English
0
7
24
0
MTL DATA retweetet
Jascha Sohl-Dickstein
Jascha Sohl-Dickstein@jaschasd·
Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent arxiv.org/abs/1902.06720 <--- this should blow your mind a bit!! Also holds for convolutional networks, batch norm, ... Also, closed form for test predictions resulting from gradient descent training.
Jascha Sohl-Dickstein tweet media
English
5
153
490
0
MTL DATA retweetet
Jeremy Howard
Jeremy Howard@jeremyphoward·
This is literally the first time I've seen am NLP researcher say they want to focus on helping normal people solve normal problems. Hopefully the first of many :)
Lana Teplitsky@STeplitsky

Fascinating presentation by @yoavgo who is focusing his research on what people want: Turning text into structured data. “Information Extraction can be transformative to science” Excel for #NLP. #NaturalLanguageProcessing #AI @allen_ai BIU's NLP lab u.cs.biu.ac.il/~nlp/

English
4
79
412
0
MTL DATA retweetet
Kerem Turgutlu
Kerem Turgutlu@KeremTurgutlu·
@keremturgutlu/understanding-building-blocks-of-ulmfit-818d3775325b" target="_blank" rel="nofollow noopener">medium.com/@keremturgutlu… Here, I tried to explain building blocks of SOTA ULMFIT model. What is an AWD-LSTM? How Dropout is used everywhere? What is a QRNN and why might it be better? ...I also used excel spreadsheets to simplify things in a different way :)
English
2
61
245
0
MTL DATA retweetet
Gilles Louppe
Gilles Louppe@glouppe·
TIL when preparing my second deep learning lecture that the operators' manual for Rosenblatt's Mark 1 Perceptron machine was a classified document. It became unclassified only in 1977. The manual can now be found at apps.dtic.mil/dtic/tr/fullte…
English
0
19
61
0
MTL DATA retweetet
Thomas Wolf
Thomas Wolf@Thom_Wolf·
.@kaushal316 wrote a nice step-by-step tutorial on how to finetune BERT on a classification task (@kaggle Toxic Challenge) Covers everything from data processing to model modification Results are top-10% w. a very simple 30-lines-of-code single model 👇 medium.com/huggingface/mu…
English
1
102
356
0