Jamin
1.2K posts


China’s AI playbook: kill OpenAI and anthropic with free great models. Make it free. Then use cheap electricity to export compute as well. Currently the blocker is chip but Hauwei would catch up soon. Imagine a world where instead of paying hundreds of billions to OpenAI and anthropic, you pay almost zero to similar level of intelligence with cheap cheap inference. What’s gonna happen?

Gemma 4 31B is now available in Public Preview on Cerebras. Our first multimodal model runs at over 1,800 tokens/s for ultra-fast image and text workflows. Give it a try: chat.cerebras.ai


Any new features we must have in the next version of glm?





good luck paying $20k only to find out you can generate 20 tok/s. even running 24/7, that's just 50m tokens/month. for glm, at $4.40/m, this is $228 in value. any $200 sub gives you significantly more. and this math means the break-even is 7.3 years not 6 moths. by that time the hardware would die if it's running 24/7.

1 - So GLM 5.2 is 700b parameters (ish) 2 - 4x DGX Sparks can supposedly handle up to 700b parameters (give or take) 3 - GLM 5.2 is supposedly in striking distance of the performance of GPT 5.5 and Opus 4.8. In my brief tests, it's really not shabby at all. 4 - So for $20k, you can get near the frontier on your table. 5 - Extrapolate the trend, and you could have mythos/5.5 pro - class models in your dining room for the cost of a cheap car less than five years from now. Even without extrapolation, we're already the near frontier running locally. 6 - Paying real api costs, I could easily blow through $3,000 per month coding and running agents. The machine pays for itself in 6-7 months conservatively. 7 - In 3-5 years, most power users of AI will self-host. 8 - Am I missing something?

GLM-5.2 can now be run locally!🔥 The 2-bit model retains ~82% accuracy after we shrunk it from 1.51TB to 238GB (-84% size). Run on a 256GB Mac or RAM/VRAM setups. GLM-5.2 is the strongest open model to date. Guide: unsloth.ai/docs/models/gl… GGUF: huggingface.co/unsloth/GLM-5.…










