

I Built a mini LLM fully in Rust. No libraries other than ndarray + rand. Works just like the real thing, just a lot smaller. I've got learnable embeddings, Self-Attention (not multi-head), Forward Pass, Layer-Norm, Logits etc.. Training set is tiny, but it can learn a few facts! Takes a few minutes to train fully in memory.




















