Michael Burkov

133 posts

Michael Burkov banner
Michael Burkov

Michael Burkov

@xmikebur

Sales Engineer, AI Inference Platform @nebiustf

New York, NY Katılım Eylül 2023
44 Takip Edilen32 Takipçiler
Mahesh Sathiamoorthy
When CC or Codex do web search, what are they using underneath?
English
52
5
571
210.9K
Igor Kotenkov
Igor Kotenkov@stalkermustang·
@xeophon @madiator > Codex Bing + Google U sure? I was under the impression they've moved on to their in-house index.
English
2
0
4
3.1K
Vignesh Ravichandran
Vignesh Ravichandran@viggy28·
I am getting really comfortable with cold calling. Should I be proud… or deeply concerned? 🤔
English
1
0
4
154
Michael Burkov
Michael Burkov@xmikebur·
RL workloads are bursty by nature. Plan capacity to avoid over/under‑provisioning, and make autoscaling a first‑class feature.
English
0
0
0
26
Michael Burkov
Michael Burkov@xmikebur·
Batch processing is everywhere. Classic map reduce is a popular approach for working with AI data. Map operation gives you flexibility to run custom code on your data to prepare datasets for training and RL. Cursor team talked a lot about it.
English
1
0
0
88
Michael Burkov
Michael Burkov@xmikebur·
Came back from #raysummit by @anyscalecompute . I spoke w/ and learned from AI operators at Notion, Uber, xAI and the like. AI infra trends affecting engineers working in the space:
English
4
0
0
46
Michael Burkov
Michael Burkov@xmikebur·
Reliability across nodes matters most. In distributed jobs, hardware fails (not if but when). Building a system that restarts failed jobs in the multi node environment is a complicated engineering task. I heard this from many people.
English
0
0
0
16
Michael Burkov
Michael Burkov@xmikebur·
AI native compute is here. In the context of ML, we are used to data processing running on CPUs, model training and serving on GPUs. This now is consolidating around CPU/GPU compute hardware capable of processing various workloads. Ray leads the way here.
English
0
0
0
17
erica wenger🏕️
erica wenger🏕️@erica_wenger·
“The next great VC firms will look like media companies”
Seb Johnson@SebJohnsonUK

Last week @HarryStebbings did a post on LinkedIn for a company which directly led to $3.5m in ARR for them. That is WILD. I don't how other VCs can compete with that. And it doesn't seem that most VCs are even trying to. @a16z are building this level of media distribution with @eriktorenberg at the helm, but most other Tier 1s are just relying on their brand to maintain their lead. I just don't see how that works out long term.

English
6
3
55
14.8K
Michael Burkov
Michael Burkov@xmikebur·
@viggy28 makes more sense. I "consume" both of those services - substack and podcasts
English
0
0
1
11
Vignesh Ravichandran
Vignesh Ravichandran@viggy28·
@xmikebur Good point, but a lot of initial ICP aren't using those. We see Substack, podcasts, etc.
English
1
0
0
44
Vignesh Ravichandran
Vignesh Ravichandran@viggy28·
What are the two more knowledge libraries that we should integrate before GA?
Vignesh Ravichandran tweet media
English
1
0
3
180
Michael Burkov
Michael Burkov@xmikebur·
They are here! Little robot fella making deliveries.
Michael Burkov tweet mediaMichael Burkov tweet media
English
0
0
0
87
Robert Nishihara
Robert Nishihara@robertnishihara·
Ray Summit is going to be excellent. Can't wait to hear from xAI, Perplexity, Cursor, Thinking Machines, Physical Intelligence, Applied Intuition, Prime Intellect, vLLM, and so many others. Some major themes this year: - Reinforcement learning infra - Multimodal data (lots of video) - Distributed inference - Scalable agent infrastructure - Robotics / autonomy
Robert Nishihara tweet media
English
8
26
243
47.6K
Yash | OSS
Yash | OSS@codingyash·
@modal is down, having an incident time to create a webhook
English
2
0
0
36
Madhav Singhal
Madhav Singhal@madhavsinghal_·
modal is down and having an incident first time ever tbh for me, but still
English
2
1
21
6.2K
Michael Burkov
Michael Burkov@xmikebur·
@ivanleomk Run serverless GPU workloads here tracto.ai . We aren't sending them to aws but process in our own data center. Dynamic scaling, pay as you go. 30% cheaper than modal. playground cluster with samples at playground.tracto.ai
English
0
0
0
96
Ivan Leo
Ivan Leo@ivanleomk·
damn @modal being down, no wonder my endpoints were 400-ing
Ivan Leo tweet media
English
2
0
4
498
Vasek Mlejnsky
Vasek Mlejnsky@mlejva·
In the next 3-6 months, we'll start seeing first AI agents running for days and not just hours
English
6
1
38
7.6K
Michael Burkov
Michael Burkov@xmikebur·
As foundational models converge in capability, the true edge for AI products comes down to 2 things: 1. How fast you can iterate. 2. How well you can use data to improve model behavior. Learn from Cursor cursor.com/blog/tab-rl
Michael Burkov tweet media
English
1
0
1
88