FP32 Monastic
499 posts

FP32 Monastic
@kv_cached
Shipping LLM systems: RAG, agents, serving, eval. Attention seeker (self-attention only). No vibes, just logits.


smallest Arabic model ever Built an Arabic RAG agent on @karpathy's nanochat Trained on: • ArabicText-Large (244M words) • Hala-4.6M Arabic instructions • MorphBPE tokenizer (76.8K vocab) Features: ✓ RTL UI with citations ✓ Offline RAG + search ✓ Consensus decoding ✓ Runs on 4GB GPU $0 cost. Fully offline. github.com/h9-tec/arabic_…

@r_chirra I fixed it :) deployed live now. This was done by doing a round of synthetic data generation to collect a 1000 multi-turn conversations (given a bunch of information including the readme of the nanochat project), and then mixing that into midtraining and SFT. fun!









New paper 📜: Tiny Recursion Model (TRM) is a recursive reasoning approach with a tiny 7M parameters neural network that obtains 45% on ARC-AGI-1 and 8% on ARC-AGI-2, beating most LLMs. Blog: alexiajm.github.io/2025/09/29/tin… Code: github.com/SamsungSAILMon… Paper: arxiv.org/abs/2510.04871







Introducing Command R7B: the smallest, fastest, and final model in our R series of enterprise-focused LLMs! It delivers a powerful combination of state-of-the-art performance in its class and efficiency to lower the cost of building AI applications. cohere.com/blog/command-r…






