Keshi Dai

243 posts

Keshi Dai

@daikeshi

Building ML Tools @ Spotify, 🏔️✈️☕⚽🏃⛷️🦙

Katılım Mayıs 2009

476 Takip Edilen207 Takipçiler

Keshi Dai retweetledi

Yangqing Jia@jiayq·31 Mar

I probably have some credibility as a person who has worked on @TensorFlow and @PyTorch both (also, caffe / @onnxai / distbelief / a few others that never saw the light), so here are my two cents: (1) Speed doesn't really matter today as long as it is not particularly bad. Under the hood, many are just calling optimized CUDA code anyway. (2) For ultimate speed, especially in the LLM field, many are writing their own runtime engines, and not vanilla framework. For example vLLM, by @zhuohan123 @woosuk_k et al, is a great open source solution. (Lepton builds its own too, and is probably the fastest not-a-framework llm engine right now) (3) TensorFlow and PyTorch are successful in their own terms. For me, TF taught me how to think like a system architect, and PyTorch taught me how to prioritize users' needs over optimization. (4) Simply listing a table showing performance is relatively useless, because every use case come with their own needs. (4.1) For example, if you are running ads and feed recommendation systems, then 1% performance matters, and you pretty much need to write even your own FusedGemmAndReluAndMyWeirdOpsAfterThatOp to make things fast. (4.2) For research though, the flexibility was of ultimate importance, and losing 20-30%, sometimes even 100% speed, is fine - you gain from reducing people hours. (4.3) I want to express thanks to @soumithchintala and the FAIR team, because we had long-lasting arguments back at Meta: I was serving all ads/feed needs so 4.1 was my only priority, and those arguments were a window into the 4.2 land. In retrospect, those was some of the most rewarding experiences in my career. (5) People hours are important for a simple reason: salaries are increasing every year, and NVidia is reducing the dollar cost per TFlops very quickly. (6) A side note... Keras PyTorch wrapper exhibits a universal 2x overhead over native PyTorch. This is... not good. A wrapper abstraction shouldn't bring such a big overhead. (7) Ah the old good war of framework days. I feel nostalgic to have been part of it, and I am excited to move beyond frameworks and to build a truly AI-native cloud at @LeptonAI .

English

456

127.5K

Keshi Dai retweetledi

Spotify Engineering@SpotifyEng·18 Eyl

Build a robust Ray platform ✅. Make its developer experience frictionless ✅. Done, but how did we do it? #MachineLearning engineers, @daikeshi and @davidxia_, will be giving you the inside scoop at #RaySummit on 9/19. Register to watch it live here: raysummit.anyscale.com/agenda/session…

Spotify Engineering@SpotifyEng

"Our goal for Spotify’s ML Platform has always been to create a seamless user experience for ML practitioners who want to take an ML application from development to production..." And so, we introduced @raydistributed to our @Spotify ecosystem. engineering.atspotify.com/2023/02/unleas…

English

7.3K

Keshi Dai@daikeshi·17 Eyl

@omnific9 Oh nice! I’ll be there too! Haven’t seen you for years!

English

AI Hexagon Warrior@omnific9·14 Eyl

I’m going to Ray Summit! Yay!

English

207

Keshi Dai retweetledi

Robert Nishihara@robertnishihara·17 May

We've built a ton of #LLM applications recently. Reasoning about performance & feasibility is painful without reference points. Here are the reference points we use to anchor our intuition (inspired by @JeffDean's "Numbers every engineer should know"). github.com/ray-project/ll…

English

378

80.7K

Keshi Dai@daikeshi·19 Nis

Llama, Alpaca, Vicuña, or Guanaco

Español

115

Keshi Dai@daikeshi·8 Nis

If Brighton is a “no-frills" ski resort, Deer Valley is completely on the opposite with all the amazing services. The vibes are so different, but with the Ikon pass, you can access both!

English

193

Keshi Dai retweetledi

NASA Webb Telescope@NASAWebb·7 Nis

Stars: always making a dramatic exit! 🌟 Webb’s powerful infrared eye has captured never-before-seen detail of Cassiopeia A (Cas A). 11,000 light-years away, it is the remnant of a massive star that exploded about 340 years ago: go.nasa.gov/3ZJnk72

English

264

3.2K

17.8K

2.8M

Keshi Dai@daikeshi·2 Nis

#坂本龍一 R.I.P youtu.be/z9tECKZ60zk

YouTube

Keshi Dai@daikeshi·2 Nis

4DX movie experience was too much. Why did I get pushed in the back when two guys were fighting on the screen. I was just watching them.

English

118

Keshi Dai@daikeshi·28 Mar

This is very true. We start learning world models through hearing, vision, and other perceptions even at a very early stage of life before we know any languages.

Michael C. Frank@mcxfrank

Factor 2: multi-modal grounding. Human language input is (often) grounded in one or more perceptual modalities, especially for young children. This grounding connects language to rich information for world models that can be used for broader reasoning.

English

164

Keshi Dai retweetledi

Jay Hack@mathemagic1an·12 Şub

My thoughts on Toolformer IMO the most important paper in the past few weeks. arxiv.org/abs/2302.04761 Teach an LLM to use tools, like a calculator or search engine, in a *self-supervised manner* Interesting hack to resolve many blind spots of current LLMs Here's how 👇

English

350

2.1K

541.9K

Keshi Dai retweetledi

Richard Liaw@richliaw·2 Şub

I'm so excited about this blog post from @DivitaVohra, @daikeshi, @davidxia_ at @SpotifyEng about their experiences leveraging Ray for Spotify's next generation ML platform -- it aligns so well with the vision of @raydistributed. engineering.atspotify.com/2023/02/unleas… 1/n

English

6.2K

Keshi Dai@daikeshi·2 Şub

This is what we have been working on over the past year. I'm super proud of this!

Divita Vohra@DivitaVohra

I’m so proud of the work we’re doing at Spotify, and even more proud of the team I get to grow with through it all. Check out how we’re unleashing ML innovation at Spotify! @daikeshi @davidxia_ @pchandarr BIG shoutout to the @anyscalecompute team for being awesome partners.

English

988

Keshi Dai retweetledi

François Chollet@fchollet·4 Kas

Trust is your most important asset. It's easy to destroy, hard to rebuild. In periods of fast change and extreme uncertainty, focus on preserving trust.

English

466

Keshi Dai retweetledi

Vala Afshar@ValaAfshar·21 Şub

An octopus changing colors in her sleep may be an indication she is dreaming

English

128

2.3K

12.8K

Keshi Dai@daikeshi·8 Eki

I had to explicitly ask VS Code to go down 5 levels to fix my imports "python.analysis.packageIndexDepths": [["", 5],]

English

Keshi Dai retweetledi

Josh Baer@jbx____·29 Ağu

We're a few years into our ML Platform journey at Spotify, building infrastructure powering Machine Learning that makes Spotify "magic" for many of our users. And I'm thrilled to have a Product Manager opening on one of our cornerstone teams focusing o…lnkd.in/eauKcuem

English

Keshi Dai@daikeshi·5 Tem

😱

Karen Hao@_KarenHao

The other four confirmed basic information like their names before hanging up. One man, upon hearing why we had his information, sighed in resignation: “We are all running naked,” he said, using popular Chinese slang for a lack of privacy.

ART

Keshi Dai@daikeshi·30 May

We had to come back again #nasilemak #thespicelab #malaysianstreetfood

English

Keshi Dai@daikeshi·27 May

Food hall in a church? Brilliant idea!

English

Keşfet

@TensorFlow @PyTorch @onnxai @zhuohan123 @woosuk_k @soumithchintala @LeptonAI @davidxia_