void *
2.3K posts

void *
@jamesmcn
Chicago maximalist, Japan enthusiast.





Just messing around with some brutalist versions of everyday tech


paid $7 for this strawberry matcha but it's really good so I've learned the wrong lesson from this experience










Major League Baseball is aired in the morning for Japan. So technically they eat breakfast with it being on television. Here’s their #openingday commercial. No hyperbole, when I say this, it might be greater than any US MLB commercial I’ve seen. Well done and worth the watch for any baseball fan.

companies are blindly burning thousands $ on fine-tuning before they even know if the base model can do the job. i've seen it firsthand. someone finetunes a large model on enterprise GPUs and the result matches a base model they could have tested for free. the smarter path should be test on a single consumer GPU first. run your actual workload on a 3090-5090 with the base model. find where it breaks. find where it's already good enough. then finetune only the gaps. then scale compute. if your finetuning is the bottleneck, throwing more hardware at it won't fix it. the problem is usually upstream. a $900 GPU and 2 hours of testing would have told you that before you burned $10K in compute. i test models on consumer hardware every day. the number of times a well chosen base model at the right quant outperforms a fine tuned model on 10x the hardware would surprise you.











