
0xchains
555 posts

0xchains
@0xchains
Blockchain and AI investment in venture team @antalphagroup @BITMAINtech | Make 🍵 @GTCSE @pennEngineers






This episode features an interview with Yao Shunyu @ShunyuYao14 , Research Scientist at Google DeepMind. Yao has held research scientist roles at both Anthropic and Google DeepMind, contributing to the development of key models including Claude 3.7, 4.5, and Gemini 3. Yao Shunyu is not your typical nerd. Every now and then, he’ll catch you off guard with a flash of irreverence. “None of the old guard are your relatives — so if you think someone’s being dumb, they’re just being dumb. Say it. No big deal.” (laughs) “Everyone’s a surfer now, but what really matters is the wave — not the person riding it.” “AI doesn’t actually require that much brainpower — I mean it genuinely doesn’t — most of this is work any undergrad could do. The most important quality in this industry is reliability: being meticulous, and taking responsibility for what you put out.” “You don’t need to worry too much about ruffling feathers with your opinions. As long as your views are internally consistent — not just taking random shots at people, but grounded in your own genuine understanding — there are objective standards for how you’re doing in this field. People will respect you for it.” Let us have a little fun with this one! 😄 youtu.be/ttkd0t5qTD4?si…






If you love fine-tuning open-source models (like me), then listen. > Start with 1B, 2B, 4B, and 8B models. (Don't start with a 27B model or bigger at first.) > Use WebGPU providers. I use Google Colab Pro for any model smaller than 9B. A single A100 80GB costs around $0.60/hr, which is cheap. Enough for small models. > Don’t buy GPUs unless you fine-tune 7 to 10 models. You'll understand the nitty-gritty in the process. > Use Codex 5.5 × DeepSeek v4 Pro to create datasets. Codex to plan, DeepSeek v4 Pro to generate rows. > Use Unsloth's instruct models as a base from Hugging Face. Yes, there are others too, but Unsloth also provides fast fine-tuning notebooks. > Use Unsloth's fine-tuning notebooks as a reference. Paste them into Codex, and Codex will write a custom notebook with the configs you need. > Spend 1 day learning about: - SFT (supervised fine-tuning) - RL training (GRPO, DPO, PPO, etc.) - LoRA / QLoRA training - Quantization and types - Local inference engines (llama.cpp) - KV cache and prompt cache > Just get started. Claude, Codex, and ChatGPT can design a step-by-step plan for how you can fine-tune your first AI model. Future tech is moving toward small 5B to 15B ELMs (Expert Language Models) rather than general 1T LLMs. So fine-tuning is an important skill that anyone can acquire today. Tune models, test them, use them. Then fine-tune for companies and make a career out of it. (Companies pay $50k+ to fine-tune models on their data so they can get personalized AI models.) Shoot your questions below. I'll be sharing in-depth raw findings about this topic in the coming days.



Downloading now... 1M token context window with supposedly usable coding agent capability all on a 128GB Macbook Pro is 🤯





Codex grew programmatic policies with no neural nets: max score on Breakout, and SOTA-level scores on MuJoCo. Maybe heuristics were not too weak. Maybe they were just too expensive to maintain. Maybe it's the next paradigm. trinkle23897.github.io/learning-beyon…





