MiniCPM5-1B is now fully open source, including weights, training data, and deployment code. 🚀1B params, #1 on Artificial Analysis among all open models under 2B (17.9 pts). 🤖 modelscope.cn/models/OpenBMB…
Beats Qwen3.5-2B (16.3) at half the parameters. Outperforms Qwen3.5-0.8B and LFM2.5-1.2B-Thinking on knowledge, math, code, and tool use.
INT4: 0.5GB. Runs on phones, browsers, and edge devices.
Trained with ForgeTrain, the world's first production-grade LLM pretraining framework written entirely by AI — zero human programmers, 10% faster than NVIDIA Megatron.
Using qwen2.5-coder:3b, ~1.9 GB size, which is really great to run locally for my DevOps escapades.
Updated my Quickshell Omni menu to make it easier to find what I need.
@sergionoodle I have not tried these models with QS. Using the Claude models usually.
I don't use anything specific skill or reference doc either. Most likely I should!