Firewalker retweetledi

- I work on post-training and RL
- I am an expert at the alphabet soup - DPO, PPO, GRPO
- my papers are cited by all the OpenAI researchers
- dropped a SOTA 10b LLM just a few weeks ago
- my dreams are about LLM alignment techniqes
Still got laid off by Meta, who hired a guy with my same profile for $100M a year 😭😭
English
































