Sabitlenmiş Tweet

My post-trained 14B model is now #1 on the French gov «Bac» benchmark, built from real national exam questions, ahead of DeepSeek-R1 70B, Mistral Large, Llama 3.3 & more.
Started from the Phi-4 base model — model merging + DPO made the difference.
Scale isn’t enough. Post-training is the key (right @maximelabonne ?😉)

English






















