
Jamin
1.2K posts

Jamin
@jaminlabs
Building https://t.co/AQ08mro0s5


Any new features we must have in the next version of glm?





good luck paying $20k only to find out you can generate 20 tok/s. even running 24/7, that's just 50m tokens/month. for glm, at $4.40/m, this is $228 in value. any $200 sub gives you significantly more. and this math means the break-even is 7.3 years not 6 moths. by that time the hardware would die if it's running 24/7.

1 - So GLM 5.2 is 700b parameters (ish) 2 - 4x DGX Sparks can supposedly handle up to 700b parameters (give or take) 3 - GLM 5.2 is supposedly in striking distance of the performance of GPT 5.5 and Opus 4.8. In my brief tests, it's really not shabby at all. 4 - So for $20k, you can get near the frontier on your table. 5 - Extrapolate the trend, and you could have mythos/5.5 pro - class models in your dining room for the cost of a cheap car less than five years from now. Even without extrapolation, we're already the near frontier running locally. 6 - Paying real api costs, I could easily blow through $3,000 per month coding and running agents. The machine pays for itself in 6-7 months conservatively. 7 - In 3-5 years, most power users of AI will self-host. 8 - Am I missing something?

GLM-5.2 can now be run locally!🔥 The 2-bit model retains ~82% accuracy after we shrunk it from 1.51TB to 238GB (-84% size). Run on a 256GB Mac or RAM/VRAM setups. GLM-5.2 is the strongest open model to date. Guide: unsloth.ai/docs/models/gl… GGUF: huggingface.co/unsloth/GLM-5.…




NEW: Microsoft CEO Satya Nadella warns an AI future dominated by a few models is “not stable.”













