Peter Schulam

609 posts

Peter Schulam

@pschulam

Scientist at Amazon Alexa AI. I’m interested in applied machine learning and building software systems. Views are my own.

El Segundo, CA Katılım Mayıs 2011

93 Takip Edilen350 Takipçiler

Peter Schulam retweetledi

Will Townes@will_townes·31 Ara

@WallaceUcsf Check out work by @david_sontag @ShalitUri in this area. Also @pschulam @suchisaria who use Gaussian processes instead of neural nets.

English

Peter Schulam retweetledi

Andrew Beam@AndrewLBeam·5 May

@MaxALittle @filippie509 Gotta love that sweet sweet irony

English

Peter Schulam@pschulam·4 May

In fact, ML can be especially impactful in situations like this. The heuristics make excellent features for a linear model. The result is often good enough (or is a strong baseline). Keeping this in mind gives me a nice “playbook” for kicking off work on a new project.

English

Peter Schulam@pschulam·4 May

For some classification problems, a first analysis usually uncovers several heuristics that would work ~50-75% of the time. My gut reaction to this is often: “Do we really need to use machine learning here?” After all, I don’t want to be the fool with a hammer looking for nails.

English

Sam Finlayson@IAmSamFin·3 May

@pschulam How bout that Twitter algorithm

English

Peter Schulam@pschulam·3 May

@IAmSamFin 😂

QME

Peter Schulam@pschulam·3 May

The majority of ML case studies floating around the internet are, unfortunately, fast food. I think this is a problem because we can’t share, learn from, and discuss our “recipes” as practitioners.

English

Peter Schulam@pschulam·2 May

I think this might follow from results reported by Jonathan Byrd and @zacharylipton in this great paper: arxiv.org/abs/1812.03372

English

Peter Schulam@pschulam·2 May

When we talk about covariate shift, the support of the train and test distributions may be the same but the frequency of seeing a given input may have changed. This is important when we use low-capacity models, but maybe less so with the richer classes we use today.

English

Peter Schulam@pschulam·2 May

There are lots of recent papers in the ML literature that look at how to detect when we can’t make reliable predictions. I often see this described as detecting “out of distribution” samples. This is unusual to me, though. The same value can come from two different distributions.

English

Peter Schulam@pschulam·2 May

@IAmSamFin Great write-up on this (unfortunately?) evergreen debate; thanks for sharing! I liked your point about avoiding the “who owns regression” question. The same tool can be used to accomplish different things.

English

Sam Finlayson@IAmSamFin·2 May

@pschulam Well put, and I think it makes a lot of sense in historical context of the fields. ML is about building computers that do stuff, stats is about understanding things (and sometimes the things are even computers that do stuff) sgfin.github.io/2020/01/31/Com…

English

Peter Schulam@pschulam·2 May

Very interesting paper from @laurence_ai: openreview.net/pdf?id=Rd138pW… Not the intended message, I think, but brought one thing into focus for me:

English

Peter Schulam@pschulam·2 May

@TimRadtke Thanks for the link! This looks interesting; I’ll check it out.

English

Tim Radtke@TimRadtke·2 May

@pschulam KDD's Applied Data Science track comes to mind: #ads-papers" target="_blank" rel="nofollow noopener">kdd.org/kdd2020/accept…

English

Peter Schulam@pschulam·2 May

Stats journals often have a separate “applications” track. Does something like this exist for machine learning? I’m looking for good write ups of the nitty gritty details behind successful ML applications.

English

Peter Schulam@pschulam·2 May

This reminds me of what @lawrennd calls “decomposition” in his three D’s of ML system design. inverseprobability.com/talks/notes/th…

English

Peter Schulam@pschulam·2 May

The thread below is the kind of thing that we need more of in “applied ML literature”. In this case, they didn’t really need a model, but I would love to read more about this kind of clever detective work.

Matt Lerner@matthlerner

After 17 years, we finally “cracked” a $100M churn problem at PayPal. Zero fancy tech. Just a spreadsheet, some simple SQL, and a physicist named Ben. 👇🏼

English

Peter Schulam retweetledi

Bayesian Health@bayesianhealth·3 Haz

We've heard lots about #MachineLearning in healthcare, but usually with few specifics. A new paper in @AnnalsofIM, featuring our very own @suchisaria & @pschulam, cuts through the buzz and discusses the real applications and real benefits of clinical #AI: acpjournals.org/doi/10.7326/M1…

English

Keşfet

@WallaceUcsf @david_sontag @ShalitUri @suchisaria @filippie509 @IAmSamFin @zacharylipton @laurence_ai