David Andrzejewski

1.1K posts

David Andrzejewski

@davidandrzej

Software! Systems! Data! @SFMachineLearn Elsewhere: @[email protected] @davidandrzej.bsky.social

San Francisco Katılım Eylül 2008

1.4K Takip Edilen2.3K Takipçiler

Sabitlenmiş Tweet

David Andrzejewski@davidandrzej·27 Oca

Proposal to re-brand "deep learning" as "stochastic regularized estimation of compositionally structured nonlinear functions".

English

David Andrzejewski retweetledi

Jason Alan Fries@jasonafries·8 May

🚀 Checkout our ICLR 2024 Spotlight Poster #165 today (Session 3) 🩺 "MOTOR: A Time-to-Event Foundation Model For Structured Medical Records" ✨ Highlights: - The first TTE foundation model for structured EHRs - Improves SOTA TTE by 4.6% & boosts label efficiency up to 95% - TTE pretraining scales to 16k tasks & reduces GPU memory usage by ~35% - Model weights available for research use! 🔍 Tutorial: github.com/som-shahlab/mo… 🤖 Model: huggingface.co/StanfordShahLa… #ICLR2024 #AI #Healthcare

English

4.4K

David Andrzejewski retweetledi

Korl@Korl_co·24 Eki

Korl is available for public beta! We’ve built a platform that auto-generates consumable product presentations in seconds, each optimized for a common use case and audience. Start with a 2-week free trial from our site: korl.co

English

1.9K

David Andrzejewski@davidandrzej·5 Kas

Fun stuff - "In-Context Learning Creates Task Vectors" (via @_akhaliq) isolates something (more or less) resembling the "program key" described below as an internal attention state, demonstrating with state substitution/patching experiments huggingface.co/papers/2310.15…

François Chollet@fchollet

My interpretation of prompt engineering is this: 1. A LLM is a repository of many (millions) of vector programs mined from human-generated data, learned implicitly as a by-product of language compression. A "vector program" is just a very non-linear function that maps part of the latent space unto itself. 2. When you're prompting, you're fetching one of these programs and running it on an input -- part of your prompt serves as a kind of "program key" (as in database key) and part serves as program argument(s). Like, in "write this paragraph in the style of Shakespeare: {my paragraph}", the part "write this paragraph in the stye of X: Y" is a program key, with arguments X=Shakespeare and Y={my paragraph}. 3. The program fetched by your key may or may not work well for the task at hand. There's no reason why it should be optimal. There are lots of related programs to choose from. 4. Prompt engineering represents a search over many keys in order a find a program that is empirically more accurate for what you're trying to do. It's no different than trying different keywords when searching for a Python library. 5. Everything else is unnecessary anthropomorphism on the part of the prompter. You're not talking to a human who understands language the way you do. Stop pretending you are.

English

640

David Andrzejewski@davidandrzej·14 Eki

Frequent SF (Nob Hill) sighting recently: astonished & delighted tourists taking photos/videos of the self-driving cars.

English

230

David Andrzejewski@davidandrzej·13 Oca

Nice @NetflixEng blog on Causal ML for title card art (people love a single expressive face) netflixtechblog.medium.com/causal-machine… The science is finally catching up to @joeveix "Your Pretty Face is Going to Sell" openspace.sfmoma.org/2018/04/your-p…

English

252

David Andrzejewski@davidandrzej·17 Eki

@NicoWolf16 @predict_addict @ml_angelopoulos @stats_stephen Ah yes, fair point and they do cover/mention it in the paper - would be more accurate to have said "w/o assumptions (beyond exchangeability)"

English

Nicolas Dewolf@NicoWolf16·16 Eki

@davidandrzej @predict_addict @ml_angelopoulos @stats_stephen Although I am a big proponent of conformal prediction, I do feel obliged to note that there is still the exchangeability condition. A condition that makes it notoriously difficult to apply the framework to, for example, time series.

English

David Andrzejewski@davidandrzej·15 Eki

Can we get rigorous guarantees on predictions w/o assumptions on the model OR the data distribution? Incredibly, yes - excellent intro to Conformal Prediction by @ml_angelopoulos & @stats_stephen: arxiv.org/abs/2107.07511

English

David Andrzejewski retweetledi

Valeriy M., PhD, MBA, CQF@predict_addict·15 Eki

@davidandrzej @ml_angelopoulos @stats_stephen github.com/valeman/awesom…

QME

David Andrzejewski@davidandrzej·15 Eki

Paper itself is an absolute treat: great diagrams, abundant code samples, and as a nice bonus some historical context around the origins and development of the research ideas. Credit: saw it in @deliprao's "AI Research & Strategy" newsletter deliprao.substack.com

English

David Andrzejewski@davidandrzej·17 Eyl

...furthermore: outperforms XGBoost, does Lasso in one-pass, seems not to rely on nearest-neighbor. Future work: "...an intriguing possibility where we might be able to reverse engineer the Transformer to obtain better learning algorithms." (!)

English

David Andrzejewski@davidandrzej·17 Eyl

Easy to get "breakthrough fatigue" in ML recently, but in this work the NN learns *how to learn* linear regression, decision trees, 2-layer ReLU nets 😲 "...we train Transformer models to discover algorithms for different learning problems." arxiv.org/abs/2208.01066

English

448

David Andrzejewski@davidandrzej·12 Eyl

Day 1 of #Illuminate22 starts tomorrow! Check out some of the talks that will kick off at 1:00 am PT/4:00 am ET including @SumoLogic's general session on #reliability management and #security in one platform as well as speakers from @eSentire and @Zyston. CVSoci.al/1hILg6Wx

English

David Andrzejewski@davidandrzej·27 Oca

That said, it worked! Big thanks to all.

English

David Andrzejewski@davidandrzej·27 Oca

ML on the M1 has magically transported me back to my youth: a hellscape of Python toolchain chaos with insane fixes like “find a random .py file in your /lib and swap the order of two import statements” #issuecomment-975178763" target="_blank" rel="nofollow noopener">github.com/tensorflow/ten…

English

David Andrzejewski@davidandrzej·16 Ara

@hkarthik @Carnage4Life is there an innocuous technical / data reason why (anecdotally) users overwhelmingly experience asymmetric “slippage” (ie, longer waits) in the predictions?

English

Karthik Hariharan@hkarthik·16 Ara

@Carnage4Life We have 7 data scientists and over 10 engineers working on predictions and dynamic pricing, the second of which is absolutely necessary to load balance the supply issues of delivery drivers.

English

David Andrzejewski@davidandrzej·5 Ara

lmao coinbase: “I understand”

English

David Andrzejewski@davidandrzej·4 Ara

@davidcarlton google.com/amp/s/www.theo…

QME

David Carlton@davidcarlton·4 Ara

Or why, in pop music, do we grudgingly accept remaking songs but make sure to label it as a cover, whereas it’s just the norm in classical music and play the same pieces over and over again?

English

David Carlton@davidcarlton·4 Ara

It’s kind of weird which categories of art are ones where we embrace frequent remakes and which are ones where we don’t. Like, why are people suspicious of the concept of remaking a TV series but we perform the same plays over and over?

English

David Andrzejewski@davidandrzej·9 Kas

Pondering the TBs of data crunched in an ultramodern cloud ML platform in order to power cutting-edge causal models targeting tightly business-aligned KPIs in one of the world’s most esteemed tech firms, all leading up to this email.

English

David Andrzejewski@davidandrzej·19 Ağu

I’m hiring for an ML Platform Engineer role in the SF Bay Area - we’ve got some interesting problems, please let me know if you’d want to learn more! boards.greenhouse.io/sumologic/jobs…

English

David Andrzejewski@davidandrzej·7 Nis

Very nice ad placement here spectrum.ieee.org/the-institute/…

English

Keşfet

@_akhaliq @NetflixEng @joeveix @NicoWolf16 @predict_addict @ml_angelopoulos @stats_stephen @deliprao