Gavin Bains
46 posts



holy wow they merged it



We are hiring a lot of former founders at @fleet_ai! So much, that we have our own application. just hit me up if you are looking for your next big swing fleetai.com/careers/former…



It's that time of year... Selling my SF condo. Open house tomorrow (2/22). • 2 bd, 2bath penthouse • 1,688 sq ft w/ rooftop patio • Rincon hill, near the water • 2 parking spots Link with details below. :)


Today, we're rolling out an Advanced version of Perplexity Deep Research, achieving state-of-the-art performance on external and internal benchmarks, beating every other deep research tool on accuracy, usability, and reliability across all verticals.


Education has promised personalization for decades. We’re making it real. Flint raised $15M in Series A funding (co-led by @BasisSet & @patronfund) to bring AI-powered personalized learning to every classroom. Every student deserves learning made for them.

(1/5) New post: "Mismatch Praxis: Rollout Settings and IS Corrections". We pressure-tested solutions for inference/training mismatch. Inference/training mismatch in modern RL frameworks creates a hidden off-policy problem. To resolve the mismatch, various engineering (e.g., FP16 unification, deterministic kernels) and algorithmic (e.g., importance sampling) fixes have been proposed. In this work, we examine how rollout settings (temp, top-p, and top-k) affect mismatch, and how importance sampling corrections bear out in practice. We find that while Sequence-TIS is theoretically optimal, it can succumb to catastrophic variance in long-horizon contexts. Additionally, non-standard rollout settings create subtle mismatch patterns that require careful engineering fixes. Token-TIS with default rollout settings proved to be the most robust setting for long-horizon training.












