Parthe Pandit

217 posts

Parthe Pandit

@PartheP

Thakur Family Chair Assistant Professor @ C-MInDS, IIT Bombay

Powai, Mumbai 가입일 Şubat 2014

524 팔로잉334 팔로워

Parthe Pandit@PartheP·4 Oca

@gambhir_sheetal can you help us take this forward?

English

Parthe Pandit@PartheP·1 Oca

ZXX

Parthe Pandit@PartheP·1 Oca

@mybmc No action taken on this blocked & broken drainage chamber on LJ Rd despite formal complaint, follow-ups. Please take immediate action. Serious health menace for residents. Local sewage not draining into municipal gutters ⇒ clogged sewage ⇒ rats, mosquitos, cockroaches...

English

Parthe Pandit@PartheP·11 Eyl

@dbeagleholeCS @etienne_barnard @etienne_barnard very good question! here is a (hopefully satisfactory) answer: twitter.com/dbeagleholeCS/…

Daniel Beaglehole@dbeagleholeCS

We identify that Conv Nets implement a variant of the same general mechanism of feature learning as in fully-connected networks. The covariances of the filters in CNNs again recover the average gradient outer-product (AGOP) of the model, additionally averaged over input patches.

English

202

Daniel Beaglehole@dbeagleholeCS·23 Şub

@etienne_barnard We are working on this now :)

English

Daniel Beaglehole@dbeagleholeCS·21 Şub

What is the nature of feature learning in deep networks? We propose that neural networks recover a statistic known as the average gradient outer product (AGOP). Github: github.com/aradha/recursi… arXiv: arxiv.org/abs/2212.13881

English

119

20.3K

Parthe Pandit@PartheP·11 Eyl

@dbeagleholeCS @_onionesque easier to come up with new iterative algorithms rather than throwing away previous estimates

English

Daniel Beaglehole@dbeagleholeCS·8 Eyl

@_onionesque Yeah I think @PartheP may have mentioned this? what is the advantage of a variational formulation?

English

Shubhendu Trivedi@_onionesque·8 Eyl

At the risk of sounding repetitive: Everyone should check this paper and line of work out! First, because it's cool, and second, they somehow (still not sure how) found some old paper of ours, which I really liked*, that literally no one cared about! 😜 columbia.edu/~skk2175/Paper…

Daniel Beaglehole@dbeagleholeCS

English

4.6K

Parthe Pandit@PartheP·30 Mar

@MountainOfMoon Wow thats amazing news!! Congrats

English

131

Arya Mazumdar@MountainOfMoon·30 Mar

Super proud that my student Nami Matsumoto just won an NSF Graduate Research Fellowship. This is sooo well-deserved!

English

2.9K

Parthe Pandit@PartheP·16 Mar

@andrewgwils 2. arxiv.org/abs/2302.02605 How to train kernel models with large number of centers (or inducing points)? A new algorithm, EigenPro3, to train such models requiring only O(p) memory for p centers. Prior work such as FALKON needed O(p^2) memory, ie, infeasible for large models

English

172

Andrew Gordon Wilson@andrewgwils·16 Mar

...and 3) conjugate gradients! :). I look forward to reading this. I suppose "pure kernel methods" means no kernel learning? I've always wanted to combine these infinite-width methods with kernel learning... perhaps a promising next step.

Jasper@latentjasper

Really excited to share this new paper: Kernel Regression with Infinite-Width Neural Networks on Millions of Examples. We found that the recipe for success with kernels is 1) highly expressive kernels (infinitely wide deep nets) and 2) lots of data. arxiv.org/abs/2303.05420

English

13K

Parthe Pandit@PartheP·16 Mar

@andrewgwils You might find these papers interesting too: 1. arxiv.org/abs/2212.13881 recursively adapting kernel functions inspired by how neural networks learn features These models, Recursive Feature Machines, are SotA on tabular datasets and bridge the gap to fully connected networks

English

311

Parthe Pandit@PartheP·4 Mar

@raazdwivedi @Cornell_ORIE @cornell_tech @Cornell @LesterMackey Congratulations Raaz!! <3

English

101

Raaz Dwivedi@raazdwivedi·3 Mar

Extremely delighted to join @Cornell_ORIE @cornell_tech @Cornell. Heartfelt thanks to my amazing advisors, @LesterMackey, Susan Murphy, Devavrat Shah, Martin Wainright, & Bin Yu, the wonderful collaborators, friends, & family for making this possible. The chapter starts Jan 1st!

Susan Murphy lab@SusanMurphylab1

Postdoc @raazdwivedi has accepted a tenure-track assistant professorship at @Cornell, Operations Research and Information Engineering @Cornell_orie, @Cornell Tech in NYC! Congratulations, Raaz!!!

English

245

40.8K

Parthe Pandit@PartheP·27 Şub

@nmallinar you can't just get a single dream of Costco. you have to get a 32 pack

English

140

Neil Mallinar@nmallinar·27 Şub

i dreamt of Costco again last night

English

603

Parthe Pandit@PartheP·23 Şub

@PreetumNakkiran i am hoping someone creates a generative model that converts hand drawings to professional figures

English

259

Preetum Nakkiran@PreetumNakkiran·23 Şub

don’t know tikz, so I have to waste time the old fashioned way (drawing unnecessary figures by hand)

English

5.8K

Parthe Pandit@PartheP·21 Şub

@gururajS92 @JeonghyunW @Prashxnt_Nair @UBC @HpcaArchConf Congrats Guru!

Indonesia

105

Gururaj Saileshwar@gururajS92·21 Şub

Thrilled to share that our HPCA'23 paper "Scalable and Secure Row-Swap" has been selected for the Best Paper Award! This was a collaborative work led by @JeonghyunW and @Prashxnt_Nair at @UBC ! JH will present our paper at @HpcaArchConf on 27 Feb in Montreal!

English

Parthe Pandit@PartheP·21 Şub

We now have a simple, powerful, and stable alternative to Deep Nets A challenge in classical kernel models is choosing the 'right' kernel By examining NN training, we identified the modification necessary to empower kernels - FEATURE LEARNING #NeuralNetworkFreeSince2023

Daniel Beaglehole@dbeagleholeCS

English

Parthe Pandit@PartheP·14 Şub

@Yizhezhu_ How many samples do I need to generate to achieve the rate? If there are n true samples, can you generate O(sqrt(n)) samples and still achieve small W1-distance?

English

Yizhe Zhu@Yizhezhu_·14 Şub

The reason we care about 1-Wasserstein distance is that by the Kantorovich-Rubinstein duality, it uniformly bounds the utility loss for any Lipschitz function applied to the synthetic data

English

259

Yizhe Zhu@Yizhezhu_·14 Şub

My second paper without eigenvalue: arxiv.org/abs/2302.05552

English

1.3K

Parthe Pandit@PartheP·6 Şub

@ysbhalgat Paragraph structure is very important for technical communication. The same text should perhaps be divided into 3-4 separate paragraphs, each with a purpose.

English

Parthe Pandit@PartheP·6 Şub

@ysbhalgat Not a good summary - Needs bullets. Currently, I need to read an entire sentence with obscure 10-word-named schemes to get to the start of the next sentence - The Capitalization Of Words In The Sentences Makes It Harder To Read Easily - Topics can be segregated based on themes

English

Yash Bhalgat@ysbhalgat·5 Şub

IMO one of the most useful applications of #ChatGPT is summarizing or explaining lengthy documents which usually require domain expertise to really make any sense of. This could be scientific papers, legal paperwork, financial documents, etc. (1/n) #ai

AI Breakfast@AiBreakfast

Use ChatGPT on your own files This is going to be big: humata.ai lets you upload a .pdf up to 60 pages long and allows you to ask questions about it in plain English ↓

English

705

Parthe Pandit@PartheP·5 Şub

@_onionesque whats the difference between Bayes theorem and Bayes rule?

English

507

Shubhendu Trivedi@_onionesque·5 Şub

A poster by Larry Wasserman to put on your refrigerator.

English

237

33.7K

Parthe Pandit@PartheP·5 Şub

@ysbhalgat Why is this a good summarization?

English

Yash Bhalgat@ysbhalgat·5 Şub

I just used humata.ai to explain the Union Budget recently released by the Indian Govt. I have a minimal understanding of Economics/Politics, but I think it did a great job summarizing the key points in a 58-page doc. (2/n) #UnionBudget2023

English

285

Parthe Pandit 리트윗함

Rob Nowak@rdnowak·24 Oca

Rahul Parhi and I wrote a tutorial style article to explain modern theory of deep learning and neural function spaces via elementary signal/image processing concepts (Fourier transform, Radon transform, L1 regularization, sparsity) arxiv.org/abs/2301.09554

English

401

65.3K

Parthe Pandit@PartheP·21 Oca

@DimitrisPapail the difference between "I could have written that" v/s "I can't paint that" ?

English

Dimitris Papailiopoulos@DimitrisPapail·21 Oca

Nobody calls denoising bullshit generation, but suddenly if it happens using a transformer then it is?

English

4.5K

탐색

@gambhir_sheetal @mybmc @dbeagleholeCS @etienne_barnard @_onionesque @MountainOfMoon @andrewgwils @raazdwivedi