David Cardozo 🇨🇦

2K posts

David Cardozo 🇨🇦

@_davidcardozo

Google Developer Expert in AI/ML in JAX/FLAX | Docker Captain | Machine Learning Scientist in Quebec

🇨🇦🇫🇷🇨🇴 Katılım Mayıs 2010

1.8K Takip Edilen673 Takipçiler

Sabitlenmiş Tweet

David Cardozo 🇨🇦@_davidcardozo·2 Şub

“Geography has made us neighbours. History has made us friends. Economics has made us partners. And necessity has made us allies." "From the fields of Flanders to the streets of Kandahar." Vive la Canada, Team Canada The Maple Leaf Forever

GIF

English

1.9K

David Cardozo 🇨🇦 retweetledi

Raffi Hotter@raffi_hotter·6d

This algorithm uses one of my favourite theorems in math, the Johnson-Lindentrauss Lemma, which says you can drastically reduce the dimensionality of n points to just log(n) dimensions and still preserve pairwise distances

Google Research@GoogleResearch

Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI

English

227

3.3K

252.9K

David Cardozo 🇨🇦 retweetledi

Sayak Paul@RisingSayak·18 Mar

@himanshustwts GAN was coded in a night that too by a drunk Ian Goodfellow.

English

2.8K

David Cardozo 🇨🇦 retweetledi

Dataiku@dataiku·17 Mar

AI agents can act. But can you see why they acted? That gap makes trust and governance harder in critical workflows. At #NVIDIAGTC, we introduced Kiji Inspector™, a new open source framework for AI agent explainability, starting with NVIDIA Nemotron. dataiku.com/stories/blog/i…

English

1.9K

David Cardozo 🇨🇦 retweetledi

Dataiku@dataiku·11 Mar

NVIDIA just announced Nemotron 3 Super, a new open model for agentic AI at scale. Dataiku is proud to be part of the ecosystem helping enterprises operationalize multi-agent systems. Learn more — and see us at #NVIDIAGTC booth 3108: blogs.nvidia.com/blog/nemotron-…

English

347

David Cardozo 🇨🇦 retweetledi

Julia Turc@juliarturc·11 Mar

Diffusion models clicked for me when I started seeing them through the lens of particle motion. I built this interactive playground where you too can clickety-clack to understand how drift, noise, and other hyperparams control diffusion. I hereby submit this as penance for the sin of YouTube edu-tainment 😇 Link in the first comment.

English

564

31.3K

David Cardozo 🇨🇦@_davidcardozo·9 Mar

is grain and arrayrecords the new tfrecords? I cannot believe that in the era of LLMs google still cannot produce more than this post: developers.googleblog.com/building-high-… for grain and arrayrecords :(

English

David Cardozo 🇨🇦@_davidcardozo·28 Şub

Tailscale is my most Canadian proud company. Best tool ever

English

David Cardozo 🇨🇦@_davidcardozo·27 Şub

I just got a new job where I got to play with a gb200, missing Jax but now I have compute to do cool stuff

xjdr@_xjdr

i love jax and tpus, i think they are elegant and fit my mental model of how compliers and systems should work. that said, you'd have a _very_ hard time getting me to go back to using jax and tpu instead of pytorch (really cuda and CuTeDSL) and GB300NLV72s

English

110

David Cardozo 🇨🇦 retweetledi

xjdr@_xjdr·27 Şub

English

191

12.5K

David Cardozo 🇨🇦 retweetledi

Chris Olah@ch402·27 Şub

Here I stand, I can do no other.

Anthropic@AnthropicAI

A statement from Anthropic CEO, Dario Amodei, on our discussions with the Department of War. anthropic.com/news/statement…

English

105

2.7K

168K

David Cardozo 🇨🇦 retweetledi

Ivy Zheng@ivyzhengx·26 Şub

A guide of Pallas on TPU SparseCore is out! Things could still be wacky, but these are the paths we've cleared out, and more to come soon. Have fun playing with it! docs.jax.dev/en/latest/pall…

English

153

David Cardozo 🇨🇦 retweetledi

Andrej Karpathy@karpathy·25 Şub

a beauty for anyone interested in mechanistic interpretability or getting into LLMs. interesting to look at small algorithms and their "neural implementations" to get a sense of how neural nets implement various functionality. unless the minification really creates "esoteric" solutions that you wouldn't encounter in practice, which might be more based around distributed representations, helixes etc. i tried training the same arch briefly from scratch and gradient descent didn't find the solution, would probably work with more degrees of freedom and enough effort.

English

1.2K

93.7K

David Cardozo 🇨🇦@_davidcardozo·25 Şub

How come Google does not provide an Chrome arm64 package in linux.....

English

David Cardozo 🇨🇦 retweetledi

Dataiku@dataiku·18 Şub

Introducing 575 Lab: an open-source initiative for production-ready AI tooling. As AI systems take on greater autonomy, trust must be earned in production, giving teams visibility into AI behavior and the controls needed to govern it at scale. Join us: dataiku.com/open-source/?u…

English

1.4K

David Cardozo 🇨🇦 retweetledi

Grigory Sapunov@che_shr_cat·17 Şub

1/ Transformers don't count like computers. We assume they have hidden "registers" to track variables. We were wrong. New research by @AnthropicAI reverse-engineered Claude 3.5 Haiku and found it works with 6D helical manifolds. It's geometry, not math. 🧵

English

206

1.5K

210K

David Cardozo 🇨🇦 retweetledi

xjdr@_xjdr·4 Şub

this is now officially a GB300NVL72 fan account. sorry in advance

English

229

46.2K

David Cardozo 🇨🇦@_davidcardozo·5 Şub

@RisingSayak or the test suits, sometimes I have learned of upcoming features by looking at the tests.

English

Sayak Paul@RisingSayak·4 Şub

I think going through an important PR of a repository like PyTorch, even as an external observer, is an under-appreciated learning resource. I learned how to test for recompilations from one of those PRs and it has been a great debugging tool since then!

English

David Cardozo 🇨🇦@_davidcardozo·28 Oca

Today I just renewed my subscription to @che_shr_cat this is the way I keep up with arxiv. If you are an MLE or AI practitioner this is a must, and I really like

Grigory Sapunov@che_shr_cat

10/If you're drowning in papers too: → Daily ML paper reviews: arxiviq.substack.com → My deep dives & opinions: gonzoml.substack.com Subscribe to one or both. And tell me — how do YOU keep up with the field?

English

243

David Cardozo 🇨🇦@_davidcardozo·23 Oca

@ariG23498 Amazing material

English

Aritra 🤗@ariG23498·22 Oca

The YouTube lecture on Mixture of Experts (MoEs) Routing Algorithm is out now! Here is what I cover: > Foundation of MoEs (MoE) > The routing algorithm > A toy problem > Issues with routing (oversub experts) > Conclusion Please swamp me with feedback. It is a big video, so if you don't have time this week, do not worry. Bookmark this, and come back to it in the weekends. A cup of coffee, a lazy saturday/sunday morning and this video 🤌🏻❤️. A fresh brain would easily grasp every concept and also tell you that you could have understood the concepts yourself. Link: youtu.be/CDnkFbW-uEQ

YouTube

English

273

21.5K

David Cardozo 🇨🇦 retweetledi

Sayak Paul@RisingSayak·21 Oca

Some notes on what someone can do for building chops in ML x open source x modeling: • Take a popular pre-trained model implementation, profile it, spot the bottlenecks, & try to improve its speed-memory trade-off -- it's a valuable skill that any sane hiring manager should understand and credit (they are probably not legit if they don't). github.com/meta-pytorch/s… is a good example of this. • GPUs are in short supply. So, try reimplementing it in JAX, leveraging its strengths. Make it run on TPUs, blazing fast 🔥 -- this will help you establish that you care about performance and are comfortable switching stacks when needed. github.com/sanchit-gandhi… is an amazing example of this. • In the context of an organization, communication is the key. Make sure you document your experience in an easily digestible way so most folks would understand what you achieved. Provide numbers on benchmarks, mention assumptions, and whatever limitations you faced and how you approached them. • Get a pro subscription to whatever AI coding assistant you think works the best for your stuff. Make it a part of your workflow, but DO NOT become overly reliant on it. Have enough juice in the process so that you can build muscle memory and objective evidence of your intellect over time. • Have fun! It gives me a sense of joy and relief to know that back in the days, we did all of it happily WITHOUT any AI coding assistance. Lots of fun, despair, and anxiety -- but all worth it; 10/10 -- would do it again!

English

523

41.6K

Keşfet

@himanshustwts @AnthropicAI @RisingSayak @che_shr_cat @ariG23498 @elonmusk @BarackObama @taylorswift13