David Cardozo 🇨🇦
2K posts

David Cardozo 🇨🇦
@_davidcardozo
Google Developer Expert in AI/ML in JAX/FLAX | Docker Captain | Machine Learning Scientist in Quebec

Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI







i love jax and tpus, i think they are elegant and fit my mental model of how compliers and systems should work. that said, you'd have a _very_ hard time getting me to go back to using jax and tpu instead of pytorch (really cuda and CuTeDSL) and GB300NLV72s








10/If you're drowning in papers too: → Daily ML paper reviews: arxiviq.substack.com → My deep dives & opinions: gonzoml.substack.com Subscribe to one or both. And tell me — how do YOU keep up with the field?




