Me Not Hacker retweetledi

I pay Google $13.99 CAD to train a 9B LLM model on A100 80GB GPU.
It takes:
> 10 minutes to step notebook
> 7 hours to train the model
> 1.5 hour for eval testing
> 1.25 hours for validation testing
> 30 minutes for GGUF/MLX conversion
Overnight, I run codex (with computer use chrome extension) on new notebook.
In the morning I get a new custom trained model.
It's not hard to train SLMs anymore. Wake up.


English























