Mike Bilodeau

2.1K posts

Mike Bilodeau

@mj_bilodeau

Time and Tide | Marketing @Basetenco

SF Katılım Mart 2015

606 Takip Edilen757 Takipçiler

Mike Bilodeau@mj_bilodeau·17h

@AliesTaha @philipkiely @AliesTaha i am almost certain there's some b10 ice cream in the eng side freezer

English

AT@AliesTaha·17h

@philipkiely what is inference? how does it work? @philipkiely can i come to learn (and also maybe get ice-cream)?

English

271

Philip Kiely@philipkiely·18h

Ice cream and books were a hit yesterday. ICYMI we're doing another, this time at the Ferry Building. Thursday 4/2 from 2-4 PM: luma.com/khxc93ju

English

Mike Bilodeau@mj_bilodeau·1d

@aishwarya_08 i also like when they tell me what i'm doing

English

Aishwarya Goel (AG)@aishwarya_08·2d

@mj_bilodeau What a milestone 🤣

English

Mike Bilodeau@mj_bilodeau·2d

the AI sdr urge to start off every email by congratulating someone for the exact dollar amount and valuation of their last fundraise

English

Mike Bilodeau@mj_bilodeau·1d

richard hendricks still undefeated it seems :/

AT@AliesTaha

x.com/i/article/2037…

English

120

Mike Bilodeau retweetledi

Madison Kanna@Madisonkanna·1d

What is AI inference engineering, why is it such an in-demand skill, and how do you break into the field? With author of Inference Engineering @philipkiely and head of training at Baseten @oneill_c 0:00: What is inference? 2:47: History of inference 4:59: Downstream effects of AI research on inference 13:54: What you'll learn from Inference Engineering 16:14: Advice for engineers transitioning into AI 19:00: Open source models driving inference growth 20:55: Specialization vs. frontier closed models 23:51: "Big Token" and the importance of open source AI 27:18: Where to get Inference Engineering

English

699

69.5K

Mike Bilodeau@mj_bilodeau·1d

@Madisonkanna i am thrilled there are no wood slats in this recording

Madison Kanna@Madisonkanna

A lot of raw technical experience and skills has been devalued somewhat, when you can go to Claude or ChatGPT now. What's more valuable now is this kind of more qualitative creative intelligence and problem solving. Don't necessarily put much stock in pedigree or experience. Tools like LMS allow you to learn things exponentially faster than we could before. Being curious and asking the right questions is the most important thing now because anyone has the capacity to answer those questions and push ahead in that front. Researcher and head of training at Baseten @oneill_c

English

Mike Bilodeau@mj_bilodeau·2d

@mon__lim @baseten just you wait

English

Monica L@mon__lim·2d

addicted to @baseten ice cream. hope they make this a staple. inference has never tasted so good

English

768

Mike Bilodeau@mj_bilodeau·3d

@FintechKristen @evancharles @baseten higher rations of ice cream for sure than a billboard

English

Kristen Anderson@FintechKristen·3d

@evancharles @baseten Is this Humphry Slocombe? Higher or lower ROI than an SF billboard?

English

409

Evan Moore@evancharles·3d

Would love to know which genius in growth at @baseten is responsible for sponsoring an ice cream flavor

English

6.3K

Mike Bilodeau retweetledi

Paras Stefanopoulos@stefanopopoulos·3d

x.com/i/article/2035…

ZXX

208

35K

Mike Bilodeau@mj_bilodeau·3d

getting to work with teams like the one at @zeddotdev is what makes this job fun

Zed@zeddotdev

Your AI code completions in Zed show up in ~200ms. That's Zeta, our Edit Prediction model, running on @baseten. We love partnering with companies who keep the bar high — Baseten is one of them.

English

115

Tuhin Srivastava@tuhinone·3d

We love working with Nathan, Conrad and the Zed team, and we're proud to power inference for their edit prediction for users around the world!

Baseten@baseten

Zed built its editor from scratch because performance is non-negotiable for a responsive IDE. When your product lives or dies by how fast it feels, inference has to be invisible. We're proud to partner with @zeddotdev as they build the editor of the future.

English

11.8K

Mike Bilodeau@mj_bilodeau·3d

@zeddotdev @baseten Zeta speed 🤝 b10 speed

English

818

Zed@zeddotdev·3d

Your AI code completions in Zed show up in ~200ms. That's Zeta, our Edit Prediction model, running on @baseten. We love partnering with companies who keep the bar high — Baseten is one of them.

English

821

93.9K

Mike Bilodeau@mj_bilodeau·3d

@tuhinone great product and people

English

Mike Bilodeau@mj_bilodeau·3d

@rapprach @baseten ayy no lanyards

Filipino

Rachel Rapp@rapprach·4d

Had a little too much caffeine this morning Come say hi at KubeCon! Booth 585 💚

English

1.2K

Mike Bilodeau retweetledi

Baseten@baseten·4d

We are thrilled to welcome Sameer Paranjpye to lead our engineering organization. Welcome, Sameer! baseten.co/blog/welcome-s…

English

7.8K

Mike Bilodeau@mj_bilodeau·6d

@Austen inredible open rate

English

Austen Allred@Austen·21 Mar

New logo wall for our website what do you think?

English

3.1K

87.5K

Mike Bilodeau@mj_bilodeau·21 Mar

@ad0rnai @netcapgirl never meet your heroes

English

Lan@ad0rnai·20 Mar

finally met my favorite tech egirl @netcapgirl

English

8.7K

Mike Bilodeau retweetledi

Parsa Idehpour@Radii2323·20 Mar

Special thanks to @maxricodecastro for designing the beautiful website! And huge thanks to @alanaagoyal ,@amiruci, and @baseten team for providing us the inference engine to deploy the model bioreason.net

English

1.1K

Mike Bilodeau@mj_bilodeau·20 Mar

@oneill_c dang and here i thought karpathy just oneshotted you guys out of a job

English

Charlie O'Neill@oneill_c·19 Mar

Thoughts on what makes autoresearch work and where you shouldn't expect magic Once you have a clearly defined metric and a way to normalise experiments ie usually wall clock time; you can't do steps or flops or tokens or whatever. But this is important, because no matter how the model decides to try and reward hack, every experiment is directly comparable. For example, if you fix the steps and our autoresearch agent tries increasing the size of the model you get less gradient updates per second; same with tokens. So basically you need to know what to fix ie with this hardware and this constraint, what's the best result we can get? (this is also why karpathy chooses bits per byte instead of cross entropy loss, as you can change CE by changing the vocab size) It then seems like everything else is a degree of freedom but really you've actually fixed the hardest part of research: not the steps, but the eval. 98% of good research is coming up with the right questions to ask in the first place. Of course, hillclimbing a metric/eval is useful, but autoresearch to me in its current form is a more general hyperparameter optimiser, where you kind of implicitly define hyperparameters that include things like architecture and design decisions, not just what you can specify a grid search over in ints or floats. On this point, the way I personally use these sort of loops is to have a running list of ideas I want to try or hypotheses to test. The former are optimising for a certain metric, and the latter are often trying to figure out the contours of the problem i'm working on. Models tend to degenerate/collapse to really small niche changes without intervention from a good human researcher, as even seen in Karpathy's stuff, and lack the creativity to continually drive new ideas forward based on previous results. It's like their value function is too vanilla I also think you basically need to constrain it to be single-file; the agent gets confused and creates a lot of mess if you don't do this. This is part of the reason why having truss train push (our Baseten training product) as a constraint, even though it seems trivially the same as just sshing into a node, is important. It creates focus for the agent. Finally, most people I know who have been taking advantage of LLMs in their work and research already run some sort of autoresearch loop and have been doing so for ages. Things tend to go viral when karpathy posts them, and he has figured out the minimal abstractions to run this, but I also think it needs to not be overhyped and interpreted in the context of previous prompt-based optimisation loops like GEPA

Andrej Karpathy@karpathy

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)

English

1.7K

Mike Bilodeau@mj_bilodeau·19 Mar

the creativity required to get this type of gain is what has always made me love infra (and our infra team). end-users of applications will never see it or interact with it, they'll just feel it when the products they love get better.

Rachel Rapp@rapprach

x.com/i/article/2034…

English

121

Keşfet

@AliesTaha @philipkiely @aishwarya_08 @oneill_c @Madisonkanna @mon__lim @baseten @FintechKristen