Rafi Ayub
79 posts

Rafi Ayub
@theayubinator
AI at @AnthropicAI. Former core torchtune dev and LLM fine-tuning at @MetaAI and @PyTorch.

If you’re excited about optimizing code that runs equally well on a single or thousands of GPUs and if you have the ability to submit a single substantial PR to a major OSS library, we want you on the PyTorch team - especially if you’re early in your career.

A few researchers at Anthropic have, over the past year, had a part-time obsession with a peculiar problem. Can Claude play Pokémon? A thread:

btw torchtune is officially reinforcement learning - the GRPO implementation is officially merged, the entire codebase is really clean and modifiable, so go out there, reinforcement learn your LLMs, and any contributions welcome!

David Lynch (1946-2025) 🖤

AI is getting better at math but we're just scratching the surface of what they will be capable of doing IMO (ex O3 only got 25% on FrontierMath). So we're super excited to release FineMath, the best open math dataset for everyone to use. Currently number one trending datasets on HF!



We're happy to announce the addition of knowledge distillation to torchtune, a PyTorch library for easily authoring, fine-tuning & experimenting with #LLMs. Check it out: hubs.la/Q02YyqDG0 Distilling Llama3.1 8B into 1B in #torchtune

torchtune v0.4.0 is out: github.com/pytorch/torcht…! 🤏reduce memory by a further 20% using activation offloading 🧠try out the newest cutting-edge models from qwen2.5 💪multimodal training is BIGGER and BADDER w/ support for Llama3.2V 90B Happy tuning 🫡



Coming soon! kagglehub 💙 torchtune integration github.com/pytorch/torcht…

Good morning. Gentle reminder from Karpathy



Machines of Loving Grace: my essay on how AI could transform the world for the better darioamodei.com/machines-of-lo…





