StrongEngineer_
7.1K posts

StrongEngineer_
@hotschmoe
Christian • Father of 4 • Structural Engineer • e/acc • BTB Jungle Lurker • too many labels


ELON MUSK: AI WILL FORCE GOVERNMENTS TO SEND CASH DIRECTLY TO PEOPLE AS DEFLATION BECOMES THE BIGGEST THREA

Running a huge model at home like GLM 5.2 Q2 where tk/s is low is a lot like 3D Printing large models. You wake up and you either have a gift waiting for you, or a big mess, it's all about the surprise! 😂 Wish GPUs were as cheap as 3d printers.

one intel b70 ($950), first day setup Qwen3.6-27B W4A16 (autoround), *no* MTP 128k context, kv cache fp16 1 session: 28.1 tok/s 2 concurrent sessions: 52.0 tok/s cumulative 4 concurrent sessions: 87.8 tok/s cumulative 64 concurrent sessions: 234.7 tok/s cumulative



one intel b70 ($950), first day setup Qwen3.6-27B W4A16 (autoround), *no* MTP 128k context, kv cache fp16 1 session: 28.1 tok/s 2 concurrent sessions: 52.0 tok/s cumulative 4 concurrent sessions: 87.8 tok/s cumulative 64 concurrent sessions: 234.7 tok/s cumulative






















