Chase Fagen
1.2K posts

Chase Fagen
@chasef07
Lifestyle Engineer x Apple Silicon Inference Engineer

Say hello to Gemini 3.1 Flash Live. 🗣️ Our latest audio model delivers more natural conversations with improved function calling – making it more useful and informed. Here’s what’s new 🧵





Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI






every new model generation you see the pinch of the bitter lesson. harnesses, pipelines, rules which previously felt important now hold you back from innovating. what took months of grind for you is now just a prompt away at ½ the cost. look for it and you will see. Both large and small companies re-evaluating. Company directions change before your eyes. it’s a wild moment for our industry

