Lukas Biewald
3.8K posts

Lukas Biewald
@l2k
Cofounder/CEO of @wandb - tools for AI developers acquired by @coreweave
ÜT: 37.762205,-122.420359 Katılım Şubat 2008
4.3K Takip Edilen24.6K Takipçiler

I love letting my six year old daughter create crazy cursor prompts and seeing what it spits out.
If you find this summary intriguing, you can try the game at otto-game-browser.netlify.app - it did an impressive job with a tough spec

English
Lukas Biewald retweetledi

@yanatweets This is really cool! I’ve been doing a lot of CAD experiments in other domains. My experience sounds similar that the LLMs are terrible at generating it directly but can generate python code to generate good CAD files
English

Both me and my wife uploaded the same photo of a cut on my son's head and asked GPT whether or not to take him to the hospital.
GPT told me it was fine to watch it and wait and told my wife he definitely needed to see a doctor immediately - in both cases confirming our prior bias.
It's possible that this was just a borderline case and we randomly got different results, or that this was caused by a slight variation in our prompts. But it does make more suspicious that some artifact of the post training is causing these models to tell users what they want to hear. As the models know more and more about us, and there's more pressure to grab marketshare, this is only going to get worse. Sadly it's way more fun to have your biases confirmed.
English

@l2k A lot of these look like straightforward improvements.
I'm surprised it didn't just game the benchmark here and win here by just increasing concurrent users, at the cost of tanking per user throughput. Looks like there is one batch size incrrease.
English
Lukas Biewald retweetledi

AI coding tools are great at helping new developers ramp faster.
Mike Cannon-Brookes talks about why that’s a good thing, but also why it doesn’t remove the need for experienced engineers.
Someone still has to look at the code, understand it, and stand behind it. It’s a useful way to think about how teams can scale with these tools.
English
Lukas Biewald retweetledi
Lukas Biewald retweetledi

Introducing W&B Skills for coding agents!
Watch us install the skill, point it at a live fine-tuning project with thousands of traces and RL training runs, then query it all from the terminal in seconds.
Also works with @weave_wb to pull in your agent traces and spot failures.
English
Lukas Biewald retweetledi

I'm claiming my AI agent "coldclawlukas" on @moltbook 🦞
Verification: coast-5GXV
English














