Will Kurt

4.5K posts

Will Kurt

@willkurt

Ferment my own alcohol, run my own LLMs, that's just the kinda guy I am.

Seattle, WA Katılım Nisan 2007

830 Takip Edilen6.9K Takipçiler

Sabitlenmiş Tweet

Will Kurt@willkurt·17 Mar

🥳Check out: Token-Explorer! 🤖 Interact with and explore LLM token generation! Features: - Step through token selection - Remove tokens to explore alt paths - Fork prompt and quickly switch between them - Visualize all token probabilities and entropy! - OSS (github in replies)

English

5.8K

Will Kurt retweetledi

Bryan Bischof fka Dr. Donut@BEBischof·18h

@hugobowne Challenged me to make a chart live with my new charting library, and here's what i managed!

English

542

Will Kurt@willkurt·3d

@iamreddave Wow! That is a beautiful cover!

English

David Curran@iamreddave·3d

@willkurt Smullyan has great covers in the early editions. What cover dies that one have?

English

Will Kurt@willkurt·3d

Reading two wildly different books (Žižek’s “Too late to awaken” and Smullyan’s “To mock a mockingbird”) and each makes reference to the exact same joke (with different names/attributions)!

English

287

Will Kurt@willkurt·6 May

This is actually an under discussed take. We make a lot of assumptions about our interiority that don’t do well under scrutiny

Daniel Tenner@swombat

If AI isn't conscious... maybe you're not either?

English

240

Will Kurt@willkurt·6 May

@oprydai How many GPUs do you have and how many requests do you expect to be serving concurrently? If the answer to both is roughly 1 then llama.cpp is a good place to start (esp if you have < 1 GPU). Otherwise you'll probably get more value out of vLLM

English

246

Mustafa@oprydai·5 May

a question for the hardcore LLM folks; vLLM vs Llama cpp vs Ollama etc ?? which one? the use case is ; hosting it locally on the lab's machine; for tool calling agent based apps; also experimenting with llms for different analysies. my priority is reliability + all new advances on the architecture. e.g turboquant etc etc.

English

6.8K

Will Kurt@willkurt·5 May

It's pretty funny how often on both X and LinkedIn I see posts of the nature "You CAN'T do X with Y!!!" while I am actively "Xing with Y"

English

187

Will Kurt@willkurt·5 May

@TheAhmadOsman Heat and electricity bills? I run all my image models through my RTX 4090, but I keep any long running LLMs running on my M3 max MBP. Ultimately it boils down to bandwidth vs wattage That said, I've not seen a compelling argument in favor of the DGX

English

248

Ahmad@TheAhmadOsman·5 May

Please stop saying that the DGX Spark or any Unified Memory machine competes with GPUs in any meaningful way It’s misleading Speak of the pros and cons of each, but stop misinforming your audience for whatever reason you’re doing it for

English

211

25.7K

Will Kurt@willkurt·4 May

A huge distinction in success with agents really boils down to whether or not you're solving a problem with a directly measurable outcome. I also believe people will increasingly question why anyone is spending significant time writing software without a measurable outcome

English

213

Will Kurt@willkurt·2 May

Weirdly starting to enjoy X again, you just have to recognize that there are only a handful of real/interesting people left. A few likes from the right people means what 100s from randos did a few years ago.

English

163

Will Kurt@willkurt·2 May

@jun_song @LottoLabs People really underestimate how much everything really boils down to memory bandwidth and power consumption. 128GB of memory isn’t much better than 24GB if your bandwidth is still ~250 GB/s and your local model isn’t really “free” if you need 1200 watts to do inference.

English

144

송준 Jun Song@jun_song·2 May

@LottoLabs MBP M5 max has 2x faster memory bandwidth

English

1.6K

Lotto@LottoLabs·2 May

Why would anyone get a MBP over a gb10

English

5.6K

Will Kurt@willkurt·1 May

@nptacek Seriously, the last chapter of my soon-to-be released book on Stable Diffusion is all about using proprietary models to improve base SD. As long as new information can be added a model can be improved

English

515

CuddlySalmon@nptacek·30 Nis

i have never understood the synthetic data skeptics it's like they haven't played around with models at all, just stuck to whatever the prevailing view was

English

165

5.6K

Will Kurt@willkurt·1 May

This made me feel uncomfortably “seen”

roon@tszzl

people are walking around with their laptops slightly ajar to keep their agents running

English

252

Will Kurt retweetledi

Bryan Bischof fka Dr. Donut@BEBischof·30 Nis

When you invite @willkurt to speak to your class:

English

736

Will Kurt@willkurt·26 Nis

@jun_song So apparently cats do this because they are trying to imitate you, so if you have a old, small laptop for your cat it *might* use that instead!

English

115

송준 Jun Song@jun_song·26 Nis

고양이가 맥북 위에 올라가지 못하게 하는 효과적인 방법은 무엇이죠? 저는 지금 심각한 문제를 겪고있어요.

한국어

2.8K

Will Kurt@willkurt·25 Nis

Yes you should host your own LLM, and yes you should host your own private git server, but you should also ferment you own booze! Local apples, home fermented cider!

English

305

Will Kurt retweetledi

Ahmad@TheAhmadOsman·23 Nis

Qwen 3.6 27B means the permanent underclass thing has been canceled btw

English

120

161.1K

Will Kurt@willkurt·23 Nis

@LottoLabs Exactly. I don’t need local models to be better than proprietary SotA *today*, I just need them to be as good as proprietary models where when I started to be able to reliably trust agents to do their thing.

English

2.2K

Lotto@LottoLabs·23 Nis

Assume qwen 3.6 27b isn’t actually opus level or even sonnet level knowledge It’s much better than sonnet 3.5 level And that was sota not long ago, a loved model We’re in crazy times

English

585

23.6K

Will Kurt@willkurt·23 Nis

Worth noting that Hermes-agent can set this up for you while you grocery shop

Sudo su@sudoingX

unpopular opinion anon: if you don't have your own private git server you probably won't make it.

English

381

Will Kurt@willkurt·22 Nis

This is honestly the most exciting time in computing I've seen. It's not just "AI is amazing!", it's because we're starting to think in systems again My current homelab is ridiculous. My old computers don't just sit there, they're all performing some role, all communicating with each other. I have one box serving an LLM, another a ComfyUI server, yet another running hermes-agent. I've got a backend communicating to custom, hyper specific chrome extensions. For the first time since the early 2000s I remember which ports I'm using!

English

543

Will Kurt retweetledi

Taelin@VictorTaelin·21 Nis

Kimi 2.6 solved the HVM hard debug prompt!!? Took 3 attempts, but it did!! For a context, Gemini 3 was the first to solve it, inconsistently. Even GPT 5.4 fails sometimes. And this problem took me weeks back then. Now an open model solves it! Also captivated by its code style.

English

55K

Keşfet

@hugobowne @iamreddave @oprydai @TheAhmadOsman @jun_song @LottoLabs @nptacek @elonmusk