David Cardozo 🇨🇦

2K posts

David Cardozo 🇨🇦 banner
David Cardozo 🇨🇦

David Cardozo 🇨🇦

@_davidcardozo

Google Developer Expert in AI/ML in JAX/FLAX | Docker Captain | Machine Learning Scientist in Quebec

🇨🇦🇫🇷🇨🇴 Katılım Mayıs 2010
1.8K Takip Edilen673 Takipçiler
Sabitlenmiş Tweet
David Cardozo 🇨🇦
David Cardozo 🇨🇦@_davidcardozo·
“Geography has made us neighbours. History has made us friends. Economics has made us partners. And necessity has made us allies." "From the fields of Flanders to the streets of Kandahar." Vive la Canada, Team Canada The Maple Leaf Forever
GIF
English
0
1
14
1.9K
David Cardozo 🇨🇦 retweetledi
Raffi Hotter
Raffi Hotter@raffi_hotter·
This algorithm uses one of my favourite theorems in math, the Johnson-Lindentrauss Lemma, which says you can drastically reduce the dimensionality of n points to just log(n) dimensions and still preserve pairwise distances
Google Research@GoogleResearch

Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency. Read the blog to learn how it achieves these results: goo.gle/4bsq2qI

English
33
227
3.3K
252.9K
David Cardozo 🇨🇦 retweetledi
Sayak Paul
Sayak Paul@RisingSayak·
@himanshustwts GAN was coded in a night that too by a drunk Ian Goodfellow.
English
5
4
76
2.8K
David Cardozo 🇨🇦 retweetledi
Dataiku
Dataiku@dataiku·
AI agents can act. But can you see why they acted? That gap makes trust and governance harder in critical workflows. At #NVIDIAGTC, we introduced Kiji Inspector™, a new open source framework for AI agent explainability, starting with NVIDIA Nemotron. dataiku.com/stories/blog/i…
English
2
8
10
1.9K
David Cardozo 🇨🇦 retweetledi
Dataiku
Dataiku@dataiku·
NVIDIA just announced Nemotron 3 Super, a new open model for agentic AI at scale. Dataiku is proud to be part of the ecosystem helping enterprises operationalize multi-agent systems. Learn more — and see us at #NVIDIAGTC booth 3108: blogs.nvidia.com/blog/nemotron-…
Dataiku tweet media
English
0
3
3
347
David Cardozo 🇨🇦 retweetledi
Julia Turc
Julia Turc@juliarturc·
Diffusion models clicked for me when I started seeing them through the lens of particle motion. I built this interactive playground where you too can clickety-clack to understand how drift, noise, and other hyperparams control diffusion. I hereby submit this as penance for the sin of YouTube edu-tainment 😇 Link in the first comment.
English
22
43
564
31.3K
David Cardozo 🇨🇦
David Cardozo 🇨🇦@_davidcardozo·
Tailscale is my most Canadian proud company. Best tool ever
English
0
0
0
49
David Cardozo 🇨🇦 retweetledi
xjdr
xjdr@_xjdr·
i love jax and tpus, i think they are elegant and fit my mental model of how compliers and systems should work. that said, you'd have a _very_ hard time getting me to go back to using jax and tpu instead of pytorch (really cuda and CuTeDSL) and GB300NLV72s
English
12
4
191
12.5K
David Cardozo 🇨🇦 retweetledi
Ivy Zheng
Ivy Zheng@ivyzhengx·
A guide of Pallas on TPU SparseCore is out! Things could still be wacky, but these are the paths we've cleared out, and more to come soon. Have fun playing with it! docs.jax.dev/en/latest/pall…
English
0
1
7
153
David Cardozo 🇨🇦 retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
a beauty for anyone interested in mechanistic interpretability or getting into LLMs. interesting to look at small algorithms and their "neural implementations" to get a sense of how neural nets implement various functionality. unless the minification really creates "esoteric" solutions that you wouldn't encounter in practice, which might be more based around distributed representations, helixes etc. i tried training the same arch briefly from scratch and gradient descent didn't find the solution, would probably work with more degrees of freedom and enough effort.
English
25
48
1.2K
93.7K
David Cardozo 🇨🇦
David Cardozo 🇨🇦@_davidcardozo·
How come Google does not provide an Chrome arm64 package in linux.....
English
0
0
0
57
David Cardozo 🇨🇦 retweetledi
Dataiku
Dataiku@dataiku·
Introducing 575 Lab: an open-source initiative for production-ready AI tooling. As AI systems take on greater autonomy, trust must be earned in production, giving teams visibility into AI behavior and the controls needed to govern it at scale. Join us: dataiku.com/open-source/?u…
Dataiku tweet media
English
2
5
9
1.4K
David Cardozo 🇨🇦 retweetledi
Grigory Sapunov
Grigory Sapunov@che_shr_cat·
1/ Transformers don't count like computers. We assume they have hidden "registers" to track variables. We were wrong. New research by @AnthropicAI reverse-engineered Claude 3.5 Haiku and found it works with 6D helical manifolds. It's geometry, not math. 🧵
Grigory Sapunov tweet media
English
66
206
1.5K
210K
David Cardozo 🇨🇦 retweetledi
xjdr
xjdr@_xjdr·
this is now officially a GB300NVL72 fan account. sorry in advance
English
10
5
229
46.2K
Sayak Paul
Sayak Paul@RisingSayak·
I think going through an important PR of a repository like PyTorch, even as an external observer, is an under-appreciated learning resource. I learned how to test for recompilations from one of those PRs and it has been a great debugging tool since then!
English
2
0
26
2K
Aritra 🤗
Aritra 🤗@ariG23498·
The YouTube lecture on Mixture of Experts (MoEs) Routing Algorithm is out now! Here is what I cover: > Foundation of MoEs (MoE) > The routing algorithm > A toy problem > Issues with routing (oversub experts) > Conclusion Please swamp me with feedback. It is a big video, so if you don't have time this week, do not worry. Bookmark this, and come back to it in the weekends. A cup of coffee, a lazy saturday/sunday morning and this video 🤌🏻❤️. A fresh brain would easily grasp every concept and also tell you that you could have understood the concepts yourself. Link: youtu.be/CDnkFbW-uEQ
YouTube video
YouTube
English
19
40
273
21.5K
David Cardozo 🇨🇦 retweetledi
Sayak Paul
Sayak Paul@RisingSayak·
Some notes on what someone can do for building chops in ML x open source x modeling: • Take a popular pre-trained model implementation, profile it, spot the bottlenecks, & try to improve its speed-memory trade-off -- it's a valuable skill that any sane hiring manager should understand and credit (they are probably not legit if they don't). github.com/meta-pytorch/s… is a good example of this. • GPUs are in short supply. So, try reimplementing it in JAX, leveraging its strengths. Make it run on TPUs, blazing fast 🔥 -- this will help you establish that you care about performance and are comfortable switching stacks when needed. github.com/sanchit-gandhi… is an amazing example of this. • In the context of an organization, communication is the key. Make sure you document your experience in an easily digestible way so most folks would understand what you achieved. Provide numbers on benchmarks, mention assumptions, and whatever limitations you faced and how you approached them. • Get a pro subscription to whatever AI coding assistant you think works the best for your stuff. Make it a part of your workflow, but DO NOT become overly reliant on it. Have enough juice in the process so that you can build muscle memory and objective evidence of your intellect over time. • Have fun! It gives me a sense of joy and relief to know that back in the days, we did all of it happily WITHOUT any AI coding assistance. Lots of fun, despair, and anxiety -- but all worth it; 10/10 -- would do it again!
English
17
62
523
41.6K