Jinen Setpal

2.6K posts

Jinen Setpal

@48bitmachine

PhD student @PurdueECE, researching deep learning optimization theory and formal interpretability. I love open source. @jinen:https://t.co/W0XuIlDIe9

[email protected] Katılım Ekim 2017

2.7K Takip Edilen463 Takipçiler

Jinen Setpal retweetledi

Optimal Intellect@opt_intellect·24 Mar

We're Optimal Intellect, a research lab from the team behind CVXPY. Today we're introducing Moreau: a GPU-native solver that's orders of magnitude faster than the best existing tools.

English

740

1.5M

Jinen Setpal@48bitmachine·18 Mar

@phil_hellmuth @personaaiinc @nicholas_k_wade is cracked and that robot's awesome

English

phil_hellmuth@phil_hellmuth·18 Mar

Moving a robot w a joystick! But, of course, I crash it, doh! Still, it’s cutting edge world class tech here at @personaaiinc So many world class technologists in this company, they are building “ai robots” #POSITIVITY #PHNiceLife

English

7.2K

Jinen Setpal@48bitmachine·29 Oca

@JacobZietek Brilliant play by California Department of Fish and Wildlife, they raised 157M like it was nothing 🥶

English

Jacob Zietek@JacobZietek·29 Oca

what a shitty name

San Francisco Chronicle@sfchronicle

A mountain lion named 157M is released back into the wild after being removed from San Francisco. Wildlife officials said he was in great health. 🎥: Courtesy of California Department of Fish and Wildlife Read more >> sfchronicle.com/sf/article/mou…

English

2.9K

Jinen Setpal retweetledi

Harmya@racerfunction·21 Oca

great blog on how @tensarahq thinks about correctness of submitted GPU kernels: sarthakmangla.com/blog/wrong

English

608

Jinen Setpal retweetledi

David Pfau@pfau·12 Oca

This is the key difference between in-domain and out-of-domain generalization, and we still have not truly solved out-of-domain generalization. It just turns out you can build world changing technology by throwing so much data at things that the entire universe is in-domain.

Niels Rogge@NielsRogge

One of the best visual explanations I've ever seen for why scaling Transformers works, but is suboptimal, as it's just brute-forcing things, by @YesThisIsLion (co-author of the Transformer) on @MLStreetTalk "In the (rejected) paper "Intelligent Matrix Exponentiation", they show the decision boundary of a classic MLP with a ReLu/Tanh activation function on the classic Spiral dataset." "You can see they both technically solve it with great scores on the test set. Next, they show the decision boundary of the "M-layer" they propose in the paper. And it represents the spiral ... as a spiral!" "Shouldn't we? If the data is a spiral... shouldn't we represent it as a spiral?" "If you look back at the decision boundaries of the MLP, it's clear that you just have these tiny, piecewise separations without learning the concept of a spiral. That's what I mean!" "If you train these things enough, it can fit the spiral and get a high accuracy. But there's no indication that the MLP actually understands a spiral. When you represent it as a spiral, it extrapolates correctly, cause the spiral just keeps going out."

English

335

36K

Jinen Setpal@48bitmachine·6 Ara

@livgorton 🔥

QME

100

Liv@livgorton·6 Ara

Despite what twitter (importantly not Neel or GDM tbc) says, ambitious mech interp (AMI) isn't dead. We have made a lot of progress and at least some of us should persevere with that goal :) I don't think we've tried hard enough to warrant an entire field-level pivot!

Leo Gao@nabla_theta

New post: An Ambitious Vision for Interpretability Understanding is essential for ensuring things don't break unexpectedly. AMI is a big risky bet, but so is all ambitious research. AMI is tractable: it has good empirical feedback loops, and we've already made a lot of progress.

English

103

6.6K

Jinen Setpal@48bitmachine·24 Kas

@NeelNanda5 [2/2] From the discussion at ~6m, perhaps using multiple activations / PCA reduced how much the explanation can update *without* changing the underlying prediction, which may explain why it improved alignment?

English

Jinen Setpal@48bitmachine·24 Kas

@NeelNanda5 [1/2] This was awesome. I'm also excited about leveraging interp in training; my approach focuses on reducing degrees of freedom of model parameters & input w.r.t. explanation, because this provably prevents malicious compliance. Then training against these enforces alignment!

English

Neel Nanda@NeelNanda5·23 Kas

New video: Can interpretability help *control* training? Surprisingly, yes! Ish! In this talk I discuss this emerging research area. It's not reliable, but I was surprised by how well interp does eg stopping emergent misalignment Bonus: Sound interesting? Apply to work with me!

English

173

9.1K

Jinen Setpal@48bitmachine·9 Kas

@bitsgopew @ninja_maths The channel below in general is an absolute goldmine, but the video specifically talks about the visualization in the tweet above, makes it very intuitive: youtube.com/watch?v=ZdlraR… I also recommend checking out 4.2.2. where prof talks about the connection with normal equations

YouTube

English

Nick@bitsgopew·9 Kas

@ninja_maths Was never taught the geometric interpretation in my uni math courses, and i never got the intuition for this :( I'll actually try again this weekend

English

Alex Smith@ninja_maths·8 Kas

Love this image!

Alex Smith@ninja_maths

The four fundamental subspaces of a matrix.

English

106

1.3K

102.8K

Jinen Setpal@48bitmachine·9 Kas

@SuryaGanguli I shared this exact analogy at an interview I gave just yesterday (when asked about my use of AI for coding), this tweet made my day :D

English

161

Jinen Setpal retweetledi

Surya Ganguli@SuryaGanguli·8 Kas

Using AI to help you on your homework is like using a robot to help you lift weights at the gym.

English

109

179

1.4K

91.9K

Jinen Setpal retweetledi

Mathieu@miniapeur·12 Eki

ZXX

170

2.3K

44K

Jinen Setpal@48bitmachine·2 Eki

I'm awestruck and inspired. Congratulations Dr. Cohen!

Jeremy Cohen@deepcohen

@jasondeanlee @SebastienBubeck @tomgoldsteincs @zicokolter @atalwalkar This is the third, last, and best paper from my PhD. By some metrics, an ML PhD student who writes just three conference papers is "unproductive." But I wouldn't have had it any other way 😉 !

English

420

Jinen Setpal@48bitmachine·26 Tem

@jennyzhangzt My reviews have been constructive across the board; meaningful feedback from all 4. It's my first major conference submission too, so I was mentally prepared to have everything either torn to shreds or to be fed AI slop but I've been pleasantly surprised😃

English

599

Jenny Zhang@jennyzhangzt·25 Tem

It is easy and (definitely) understandable to feel that conference reviews are bad (in terms of quality, depth, etc). However, instead of focusing on the bad reviews, I propose that we highlight the good ones instead! 🧵👇

Yiping Lu@2prime_PKU

Anyone knows adam?

English

17.7K

Jinen Setpal@48bitmachine·20 Tem

@BorisMPower @AcerFur Wow

141

Boris Power@BorisMPower·13 Eyl

@AcerFur No need for lean

English

6.3K

Boris Power@BorisMPower·12 Eyl

few people appreciate how difficult IOI problems are, and how few people in the world can solve them

Aidan McLaughlin@aidan_mclau

openai did the funniest thing...

English

103

7.2K

Jinen Setpal retweetledi

Jacob Zietek@JacobZietek·18 Tem

Currently working on RFMs for tool use and other industrial tasks @personaaiinc. We have an early customer (Hyundai) and significant capital ($28M pre-seed). Looking to expand the ML team, dm me if you're interested in joining. Also taking cracked interns :)

English

3.3K

Jinen Setpal@48bitmachine·17 Tem

@adinamwilliams Awesome; I'll be on the look for sure!

English

Adina Williams@adinamwilliams·17 Tem

@48bitmachine Aww not until next cycle but please keep us in mind!

English

Adina Williams@adinamwilliams·15 Tem

Our team is hiring a postdoc in (mech) interpretability! The ideal candidate will have research experience in interpretability for text and/or image generation models and be excited about open science! Please consider applying or sharing with colleagues: metacareers.com/jobs/222395396…

English

4.8K

Jinen Setpal@48bitmachine·15 Tem

@_joestacey_ Sounds great 😃

English

Joe Stacey@_joestacey_·15 Tem

@48bitmachine Hey! Thanks a lot. I don’t think I’ll be able to squeeze it in now, but I’ll definitely keep in mind for next time I’m around 🙂

English

117

Joe Stacey@_joestacey_·15 Tem

I’m so excited to be on my way to the US!! I’m doing a little end of PhD tour 😍😍 Thanks so much to all the US unis that are having me for visits, I honestly can’t wait 🙂

English

5.4K

Keşfet

@phil_hellmuth @personaaiinc @nicholas_k_wade @JacobZietek @tensarahq @livgorton @NeelNanda5 @bitsgopew