tom white

4.2K posts

tom white banner
tom white

tom white

@dribnet

creations with code and networks

Wellington, New Zealand Katılım Haziran 2011
4.2K Takip Edilen11K Takipçiler
tom white
tom white@dribnet·
the models, they just wanna converge
GIF
English
0
0
1
209
tom white
tom white@dribnet·
saxophone print + top-150 SigLIP image probe though mknn model agreement (a la platonic representation hypothesis) is not part of the test time compute process, it climbs naturally as the as the optimization evolves
tom white tweet mediatom white tweet media
English
1
0
5
401
tom white
tom white@dribnet·
45% mutual kNN between CLIP and SigLIP — not bad for two model families trained on different data with different objectives when probed with this print. revisiting ImageNet so I can build a toolbox for navigating more uncharted waters without class labels (stay tuned)...
English
0
0
4
200
tom white
tom white@dribnet·
"pirate ship" (ImageNet class 724)
tom white tweet media
English
1
0
2
349
tom white
tom white@dribnet·
unsure how AI interprets this print? treating the image as a linear probe on your favorite vision model and scraping a diverse dataset for maximum activations provides a coherent suggestion.
English
1
2
7
1.4K
tom white
tom white@dribnet·
or query your favorite vision model for semantic nearest neighbors - here's OpenAI-CLIP's top hits across CC3M using the baseball_player print as a probe
tom white tweet media
English
0
0
1
162
tom white
tom white@dribnet·
not seeing it? don't worry - your favorite imagenet model is.
tom white tweet media
English
1
0
6
603
tom white
tom white@dribnet·
traffic sign, baseball player, pomegranate
tom white tweet mediatom white tweet mediatom white tweet media
English
2
1
15
810
John Hewitt
John Hewitt@johnhewtt·
Lots of interp thought discusses the linearity of the residual stream! This blog post: the residual stream isn't linear in a way that provides formal leverage, and interp methods based on linearity should not be preferred beyond empirical utility. cs.columbia.edu/~johnhew/resid…
English
5
17
232
12K
tom white
tom white@dribnet·
Weapons-grade piggy bankness: One drawing. No training. Subtract the style, get a direction in SigLIP space. Sort 50K ImageNet images by cosine similarity: 41 of the top 50 are piggy banks (P@50 = 82%). The drawing is the classifier.
tom white tweet media
English
0
0
1
1.1K
tom white
tom white@dribnet·
piggy bank (ImageNet class 719)
tom white tweet media
English
1
0
2
299
tom white
tom white@dribnet·
@hyhieu226 enjoying this slow takeoff and will genuinely miss it
English
0
0
1
1.2K
Hieu Pham
Hieu Pham@hyhieu226·
Today, I finally feel the existential threat that AI is posing. When AI becomes overly good and disrupts everything, what will be left for humans to do? And it's when, not if.
English
311
189
2.1K
456.3K
tom white
tom white@dribnet·
@farmgeek fwiw: the email was garbage but the disclosure within the app is actually pretty good - they show at the document/file level what was accessed
English
1
0
1
259
John Hart
John Hart@farmgeek·
“All patients who are not impacted can see that in their MMH app” That’s great, except when you can’t log in and every method (password reset, one-time pass) fails. Shitshow.
English
10
9
67
2.1K
tom white
tom white@dribnet·
@RT_Artwork Got this too. 5 minutes before call they ask you to use the riverside client and forward you to a website clone (riverside dot name - BEWARE) with their malware installer (I stopped there). (would be up for a themed exhibit on scams showcasing all artists with this invite! 😂)
English
1
0
1
156
RyanThompson
RyanThompson@RT_Artwork·
Be careful with artwork commission requests. Starts out with request for a video call. I contacted the comnpany directly and found out this request was a scam. Getting on a call could have led to some software being downloaded and wallet drained. Anyone have experience with this?
RyanThompson tweet media
English
7
1
9
866
tom white
tom white@dribnet·
@CSProfKGD looks different here in nz 😉 (happy new year!)
tom white tweet media
English
1
0
1
148
tom white
tom white@dribnet·
@NeelNanda5 Understandable - though the distribution shift implies not all Gemma 3 concepts will be represented in these SAEs. Were there any others such as language filtering? Might be worth updating the technical paper which seems to be misleading on this point.
tom white tweet media
English
0
0
3
72
Neel Nanda
Neel Nanda@NeelNanda5·
@dribnet Text only, sorry! Images are annoying infra wise
English
1
0
1
232
Neel Nanda
Neel Nanda@NeelNanda5·
I'm excited to release Gemma Scope 2: a comprehensive set of interpretability tools on Gemma 3. SAEs & transcoders on every layer of every model! Gemma 3 27B shows lots of rich safety-relevant behaviour I want to enable deep dives into what's really going on Check out our demo!
Neel Nanda tweet media
Callum McDougall@calsmcdougall

Google DeepMind is releasing Gemma Scope 2: SAEs and transcoders on every layer of every Gemma 3 model, 270M-27B, base & chat. We hope this enables deep dives into complex model behavior, for more ambitious open source safety & interpretability work!

English
6
20
217
14.8K
tom white
tom white@dribnet·
@GaryMarcus @Ted_Underwood always felt you were wrong about this, but ChatGPT in fact completely agrees with you and helped me to better understand and appreciate your point here 👍
tom white tweet mediatom white tweet media
English
0
0
0
22
Gary Marcus
Gary Marcus@GaryMarcus·
@dribnet @Ted_Underwood one of key examples in 2001 was object permanence; we gave examples of an object permanent failure today, two decades later, even though GPT has a trillion times more data and compute. grammaticality has improved, via pastische, but conceptual understanding has not.
English
2
0
2
0