Jinen Setpal

2.6K posts

Jinen Setpal banner
Jinen Setpal

Jinen Setpal

@48bitmachine

PhD student @PurdueECE, researching deep learning optimization theory and formal interpretability. I love open source. @jinen:https://t.co/W0XuIlDIe9

[email protected] Katılım Ekim 2017
2.7K Takip Edilen463 Takipçiler
Jinen Setpal retweetledi
Optimal Intellect
Optimal Intellect@opt_intellect·
We're Optimal Intellect, a research lab from the team behind CVXPY. Today we're introducing Moreau: a GPU-native solver that's orders of magnitude faster than the best existing tools.
English
30
55
740
1.5M
phil_hellmuth
phil_hellmuth@phil_hellmuth·
Moving a robot w a joystick! But, of course, I crash it, doh! Still, it’s cutting edge world class tech here at @personaaiinc So many world class technologists in this company, they are building “ai robots” #POSITIVITY #PHNiceLife
English
6
0
22
7.2K
Jinen Setpal
Jinen Setpal@48bitmachine·
@JacobZietek Brilliant play by California Department of Fish and Wildlife, they raised 157M like it was nothing 🥶
English
0
0
1
53
Jinen Setpal retweetledi
David Pfau
David Pfau@pfau·
This is the key difference between in-domain and out-of-domain generalization, and we still have not truly solved out-of-domain generalization. It just turns out you can build world changing technology by throwing so much data at things that the entire universe is in-domain.
Niels Rogge@NielsRogge

One of the best visual explanations I've ever seen for why scaling Transformers works, but is suboptimal, as it's just brute-forcing things, by @YesThisIsLion (co-author of the Transformer) on @MLStreetTalk "In the (rejected) paper "Intelligent Matrix Exponentiation", they show the decision boundary of a classic MLP with a ReLu/Tanh activation function on the classic Spiral dataset." "You can see they both technically solve it with great scores on the test set. Next, they show the decision boundary of the "M-layer" they propose in the paper. And it represents the spiral ... as a spiral!" "Shouldn't we? If the data is a spiral... shouldn't we represent it as a spiral?" "If you look back at the decision boundaries of the MLP, it's clear that you just have these tiny, piecewise separations without learning the concept of a spiral. That's what I mean!" "If you train these things enough, it can fit the spiral and get a high accuracy. But there's no indication that the MLP actually understands a spiral. When you represent it as a spiral, it extrapolates correctly, cause the spiral just keeps going out."

English
13
21
335
36K
Liv
Liv@livgorton·
Despite what twitter (importantly not Neel or GDM tbc) says, ambitious mech interp (AMI) isn't dead. We have made a lot of progress and at least some of us should persevere with that goal :) I don't think we've tried hard enough to warrant an entire field-level pivot!
Leo Gao@nabla_theta

New post: An Ambitious Vision for Interpretability Understanding is essential for ensuring things don't break unexpectedly. AMI is a big risky bet, but so is all ambitious research. AMI is tractable: it has good empirical feedback loops, and we've already made a lot of progress.

English
2
2
103
6.6K
Jinen Setpal
Jinen Setpal@48bitmachine·
@NeelNanda5 [2/2] From the discussion at ~6m, perhaps using multiple activations / PCA reduced how much the explanation can update *without* changing the underlying prediction, which may explain why it improved alignment?
English
0
0
0
13
Jinen Setpal
Jinen Setpal@48bitmachine·
@NeelNanda5 [1/2] This was awesome. I'm also excited about leveraging interp in training; my approach focuses on reducing degrees of freedom of model parameters & input w.r.t. explanation, because this provably prevents malicious compliance. Then training against these enforces alignment!
English
1
0
0
23
Neel Nanda
Neel Nanda@NeelNanda5·
New video: Can interpretability help *control* training? Surprisingly, yes! Ish! In this talk I discuss this emerging research area. It's not reliable, but I was surprised by how well interp does eg stopping emergent misalignment Bonus: Sound interesting? Apply to work with me!
English
2
10
173
9.1K
Jinen Setpal
Jinen Setpal@48bitmachine·
@bitsgopew @ninja_maths The channel below in general is an absolute goldmine, but the video specifically talks about the visualization in the tweet above, makes it very intuitive: youtube.com/watch?v=ZdlraR… I also recommend checking out 4.2.2. where prof talks about the connection with normal equations
YouTube video
YouTube
English
1
0
0
40
Nick
Nick@bitsgopew·
@ninja_maths Was never taught the geometric interpretation in my uni math courses, and i never got the intuition for this :( I'll actually try again this weekend
English
1
0
1
59
Jinen Setpal
Jinen Setpal@48bitmachine·
@SuryaGanguli I shared this exact analogy at an interview I gave just yesterday (when asked about my use of AI for coding), this tweet made my day :D
English
0
0
2
161
Jinen Setpal retweetledi
Surya Ganguli
Surya Ganguli@SuryaGanguli·
Using AI to help you on your homework is like using a robot to help you lift weights at the gym.
English
109
179
1.4K
91.9K
Jinen Setpal retweetledi
Mathieu
Mathieu@miniapeur·
Mathieu tweet media
ZXX
7
170
2.3K
44K
Jinen Setpal
Jinen Setpal@48bitmachine·
@jennyzhangzt My reviews have been constructive across the board; meaningful feedback from all 4. It's my first major conference submission too, so I was mentally prepared to have everything either torn to shreds or to be fed AI slop but I've been pleasantly surprised😃
English
0
1
4
599
Jenny Zhang
Jenny Zhang@jennyzhangzt·
It is easy and (definitely) understandable to feel that conference reviews are bad (in terms of quality, depth, etc). However, instead of focusing on the bad reviews, I propose that we highlight the good ones instead! 🧵👇
Yiping Lu@2prime_PKU

Anyone knows adam?

English
8
0
64
17.7K
Jinen Setpal retweetledi
Jacob Zietek
Jacob Zietek@JacobZietek·
Currently working on RFMs for tool use and other industrial tasks @personaaiinc. We have an early customer (Hyundai) and significant capital ($28M pre-seed). Looking to expand the ML team, dm me if you're interested in joining. Also taking cracked interns :)
English
1
7
27
3.3K
Adina Williams
Adina Williams@adinamwilliams·
Our team is hiring a postdoc in (mech) interpretability! The ideal candidate will have research experience in interpretability for text and/or image generation models and be excited about open science! Please consider applying or sharing with colleagues: metacareers.com/jobs/222395396…
English
2
10
71
4.8K
Joe Stacey
Joe Stacey@_joestacey_·
@48bitmachine Hey! Thanks a lot. I don’t think I’ll be able to squeeze it in now, but I’ll definitely keep in mind for next time I’m around 🙂
English
1
0
1
117
Joe Stacey
Joe Stacey@_joestacey_·
I’m so excited to be on my way to the US!! I’m doing a little end of PhD tour 😍😍 Thanks so much to all the US unis that are having me for visits, I honestly can’t wait 🙂
English
3
2
61
5.4K