Michael Burkov

133 posts

Michael Burkov

@xmikebur

Sales Engineer, AI Inference Platform @nebiustf

New York, NY Katılım Eylül 2023

44 Takip Edilen32 Takipçiler

Michael Burkov@xmikebur·5d

@madiator is there a way to trace where search requests are going to?

English

19.2K

Mahesh Sathiamoorthy@madiator·5d

When CC or Codex do web search, what are they using underneath?

English

571

210.9K

Michael Burkov@xmikebur·5d

@stalkermustang @xeophon @madiator yes, relying on 3rd party

English

822

Igor Kotenkov@stalkermustang·5d

@xeophon @madiator > Codex Bing + Google U sure? I was under the impression they've moved on to their in-house index.

English

3.1K

Michael Burkov@xmikebur·21 Kas

@viggy28 Respect !

English

Vignesh Ravichandran@viggy28·19 Kas

I am getting really comfortable with cold calling. Should I be proud… or deeply concerned? 🤔

English

154

Michael Burkov@xmikebur·15 Kas

@viggy28 Rebellion 😆

English

Vignesh Ravichandran@viggy28·13 Kas

Bro, it's a small world.

English

172

Michael Burkov@xmikebur·14 Kas

RL workloads are bursty by nature. Plan capacity to avoid over/under‑provisioning, and make autoscaling a first‑class feature.

English

Michael Burkov@xmikebur·14 Kas

Batch processing is everywhere. Classic map reduce is a popular approach for working with AI data. Map operation gives you flexibility to run custom code on your data to prepare datasets for training and RL. Cursor team talked a lot about it.

English

Michael Burkov@xmikebur·14 Kas

Came back from #raysummit by @anyscalecompute . I spoke w/ and learned from AI operators at Notion, Uber, xAI and the like. AI infra trends affecting engineers working in the space:

English

Michael Burkov@xmikebur·14 Kas

Reliability across nodes matters most. In distributed jobs, hardware fails (not if but when). Building a system that restarts failed jobs in the multi node environment is a complicated engineering task. I heard this from many people.

English

Michael Burkov@xmikebur·14 Kas

AI native compute is here. In the context of ML, we are used to data processing running on CPUs, model training and serving on GPUs. This now is consolidating around CPU/GPU compute hardware capable of processing various workloads. Ray leads the way here.

English

Michael Burkov@xmikebur·25 Eki

@viggy28 @erica_wenger @a16z 100%

Vignesh Ravichandran@viggy28·25 Eki

@erica_wenger Isn't @a16z a media company first?

English

erica wenger🏕️@erica_wenger·24 Eki

“The next great VC firms will look like media companies”

Seb Johnson@SebJohnsonUK

Last week @HarryStebbings did a post on LinkedIn for a company which directly led to $3.5m in ARR for them. That is WILD. I don't how other VCs can compete with that. And it doesn't seem that most VCs are even trying to. @a16z are building this level of media distribution with @eriktorenberg at the helm, but most other Tier 1s are just relying on their brand to maintain their lead. I just don't see how that works out long term.

English

14.8K

Michael Burkov@xmikebur·24 Eki

@jackson_stokes Niche focused apps is the way to go

English

Jackson Stokes@jackson_stokes·7 Eki

OpenAIs cannibalism of the horizontal startups that build on it will be studied in HBS and GSB for decades

Jerry Liu@jerryjliu0

Take: I don’t think n8n is dead OpenAI’s AgentKit is nice but imo it takes a big company’s worth of resources to maintain a 'real' low-code builder with every integration, enterprise controls, customizability, etc. etc. Just look at any RPA company out there. Could OpenAI build this? Yes. Should they though? There’s so many other things to own.

English

615

Michael Burkov@xmikebur·23 Eki

@viggy28 makes more sense. I "consume" both of those services - substack and podcasts

English

Vignesh Ravichandran@viggy28·23 Eki

@xmikebur Good point, but a lot of initial ICP aren't using those. We see Substack, podcasts, etc.

English

Vignesh Ravichandran@viggy28·23 Eki

What are the two more knowledge libraries that we should integrate before GA?

English

180

Michael Burkov@xmikebur·23 Eki

They are here! Little robot fella making deliveries.

English

Michael Burkov@xmikebur·22 Eki

@robertnishihara @raydistributed @anyscalecompute Felix Heide's twitter links to a different person :)

English

Robert Nishihara@robertnishihara·16 Eki

Ray Summit is going to be excellent. Can't wait to hear from xAI, Perplexity, Cursor, Thinking Machines, Physical Intelligence, Applied Intuition, Prime Intellect, vLLM, and so many others. Some major themes this year: - Reinforcement learning infra - Multimodal data (lots of video) - Distributed inference - Scalable agent infrastructure - Robotics / autonomy

English

243

47.6K

Michael Burkov@xmikebur·20 Eki

@codingyash @modal While you wait run serverless GPU workloads on us. We aren't sending them to aws like Modal but process in our own data center. Dynamic scaling, pay as you go. tracto.ai/pricing - 30% cheaper than Modal playground cluster with samples at playground.tracto.ai

English

Yash | OSS@codingyash·20 Eki

@modal is down, having an incident time to create a webhook

English

Michael Burkov@xmikebur·20 Eki

@madhavsinghal_ While you wait run serverless GPU workloads on us. We aren't sending them to aws like Modal but process in our own data center. Dynamic scaling, pay as you go. tracto.ai/pricing - 30% cheaper than Modal playground cluster with samples at playground.tracto.ai

English

230

Madhav Singhal@madhavsinghal_·20 Eki

modal is down and having an incident first time ever tbh for me, but still

English

6.2K

Michael Burkov@xmikebur·20 Eki

@ivanleomk Run serverless GPU workloads here tracto.ai . We aren't sending them to aws but process in our own data center. Dynamic scaling, pay as you go. 30% cheaper than modal. playground cluster with samples at playground.tracto.ai

English

Ivan Leo@ivanleomk·20 Eki

damn @modal being down, no wonder my endpoints were 400-ing

English

498

Michael Burkov@xmikebur·15 Eki

@mlejva @e2b Scary gpu bills are afoot lol

English

Vasek Mlejnsky@mlejva·14 Eki

In the next 3-6 months, we'll start seeing first AI agents running for days and not just hours

English

7.6K

Michael Burkov@xmikebur·14 Eki

As foundational models converge in capability, the true edge for AI products comes down to 2 things: 1. How fast you can iterate. 2. How well you can use data to improve model behavior. Learn from Cursor cursor.com/blog/tab-rl

English

Keşfet

@madiator @stalkermustang @xeophon @viggy28 @anyscalecompute @erica_wenger @a16z @jackson_stokes