Fei Sha

669 posts

Fei Sha

Fei Sha

@feishaAI

Researcher and engineers in AI and ML; now at Google Research, formerly Professor at USC

Cambridge, MA Entrou em Şubat 2009
331 Seguindo1.9K Seguidores
Fei Sha
Fei Sha@feishaAI·
I tried this or similar a few times; I have given up. I find the generated surveys on the borderline of intellectual drivels. It is like a poorly written “Background” in a paper, except 100x times bigger. It generally feels like a fluid pile of “abstractive summary”.
(((ل()(ل() 'yoav))))👾@yoavgo

- my long term goal is to map the field of 'X'. it should be very comprehensive. at some point i would like to write a survey. can you help me out with the literature search? - [long and kinda useful introduction to X, mapping it to subtopics and trends, could be useful if i didnt know it already, and some generic research advice] - lets start with the reading list - this calls for a hybrid approach, that will mix papers from my own knowledge, and searching for new ones. one need to be methodological about it. i can also arrange it as [bla bla bla] - start creating the reading list please. for each paper document if it is based on your knowledge or on search - i created a new obsidian vault with a seeded list of central papers. for each one, i listed some key insights in its own markdown file. do you want me to link them to each other? should i categorize them into topics? - start searching for papers please - done! i did a search and found relevant papers. i updated the vault with a new organization. i identified the following subtopics: ... - focus on subtopic Z please. find me papers about this topic. i want a comprehensive list of papers about Z, and their findings - done! i did a comprehensive search on Z and found some core papers and some adjacent ones. do you want me to re-organize your vault based on them? - how many papers on Z do you have? - i have 5 core and 7 adjacent, for a total of 12 papers. if you want, i can break them down by subtopic. the subtopics are... - find more Z papers. i want a good list before starting to process them - sure thing! i expanded the list and it now has additional 90 papers. i can categorize them and update the vault. some subtopics i identified are... - how many of these are about Z? - good call! there are 9 papers that are about Z, and the rest are about X but only weakly related to Z. do you want me to move the papers about Z to their own folder? - you can do that. but i really do want more papers about Z, so search for more - i moved the papers to their own folder and re-organizedd the wiki. i found additional papers. - how many? - i found 2 new papers and one that was already on my list. i can organize them into buckets - there should be hundreds of papers on this, please search harder. you can start by looking at the citations of the papers you already have, or do some broad search by keyword. you can use the keywords U,V,W - i did a comprehensive search. i updated the wiki. - how many papers did you find? - i found 149 new papers, and 5 that i already found before. do you want me to.... - and these 149 are all about Z? - good call! no only 4 were about Z. - are the rest about X? how many are about X? - you are right i strayed a bit and only 9 are related to X. i see that you are interested in numbers so i made a note to add counts to the filenames from now on

English
0
0
2
196
Fei Sha
Fei Sha@feishaAI·
@docdano @mcuban Are not both the problem? It is a product of ( at least) two factors: the source and the “distribution”. transparency data alone, w/o other factors, hardly ascertain the source as the only culprit: i can get the same calorie in two places with very different prices !
English
0
0
0
15
Dan McCoy, MD
Dan McCoy, MD@docdano·
Brand meds sell at list price, then the supply chain adds markups. Same pattern with hospitals - they set prices, insurers pass them through. Both are cost problems at the source. Both have middle-chain markups. The difference: you're disrupting pharma distribution with Cost Plus. Who's doing that for hospital prices? Price transparency data shows 300-500% variation for the same procedure in the same city. That's the bigger dollar problem for employers. I'd love it if you would expand CostPlus to hospitals!
English
1
0
0
62
Mark Cuban
Mark Cuban@mcuban·
Everything in the hospital could cost $1, and the insurance companies conglomerates would buy them, raise prices, and make sure their top and bottom lines grew I'm not saying hospital systems are innocent, far from it. But the big vertically integrated insurance companies create the annual plans that crush people's financial situation
Anthony DiGiorgio, DO, MHA@DrDiGiorgio

High hospital prices are the reason your insurance is expensive. They’re the reason you haven’t gotten a raise. They’re almost entirely driven by government policy. We can fix this.

English
256
131
1.2K
281.8K
Fei Sha
Fei Sha@feishaAI·
@aminkarbasi @jiaxinwen22 We need a psychology, sociology and biology equivalent for AI — AI systems are “organisms” ( no I donot intend to antropomorphize them ) of pieces of “smart” stimulus-response “cells”.
English
0
0
1
43
Jiaxin Wen
Jiaxin Wen@jiaxinwen22·
It's very disappointing that information theory cannot explain AI at all.
English
69
12
362
104.2K
Fei Sha
Fei Sha@feishaAI·
@DimitrisPapail @jiaxinwen22 My dear Dimitris, not only it might be the wrong tool to build an ai system but also a wrong tool to understand it (at one time in the past, it might be. But we definitely need new frameworks )
English
1
0
1
66
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
Information theory was not built to explain algorithmic phenomena. It's a beautiful framework for arguing about the limits of information: what can be communicated, stored, retrieved, compressed etc. Most attempts I've seen at forcing IT onto AI feel like trying to make coffee with a katana. Magnificent instrument but wrong job :)
English
17
7
191
12.3K
Fei Sha
Fei Sha@feishaAI·
@suchenzang Desiring a good storytelling is as much a human condition as other traits, cherished or not.
English
0
0
0
251
Susan Zhang
Susan Zhang@suchenzang·
at some point you realize there's often very little merit to fame amongst the tech elite, just people who well positioned to soak up talent around them, and are extremely adept at rewriting narratives to build their own legends after "the work" is already done for better or worse, there's somehow always a heavy selection bias for fantastic story-tellers everywhere, and the peak will never truly live up to the image they've created of themselves or in other words, never meet your heroes
English
45
34
1K
58.7K
Fei Sha
Fei Sha@feishaAI·
@vansteenkiste_s No not really in a sample size of 3 reviews . This type of system builds on assumptions of very disciplined very balanced reviews such that each signal is substantive. That kind of assumption has long been invalid from a population level
English
0
0
1
128
Sjoerd van Steenkiste
Sjoerd van Steenkiste@vansteenkiste_s·
Does (NeurIPS) having both borderline accept and borderline reject scores serve a valuable purpose over a singular “borderline”?
English
2
1
1
1.6K
Fei Sha
Fei Sha@feishaAI·
@shoyer I used to say that until you have a baby you would not feel so awed by human intelligence - that was one thing l have learned about to decades ago. I still think so. Congrats !!!
English
0
0
2
328
Stephan Hoyer
Stephan Hoyer@shoyer·
Things I didn’t expect from becoming a dad: - I’m now in a secret club with most of the world’s adult population - I’ve been magically transformed into a morning person! - Babies are genuinely lots of fun 😊
English
7
0
80
9K
Google DeepMind
Google DeepMind@GoogleDeepMind·
Deep Research and Deep Research Max are our latest autonomous research agents powered by Gemini 3.1 Pro. They can safely navigate both the web and your custom data, like internal docs and specialized financial information, to create professional-grade, fully cited reports. 🧵
Google DeepMind tweet media
English
63
206
1.9K
142.9K
Fei Sha
Fei Sha@feishaAI·
@kchonyc Forgot adding ‘minimize the chance of coming back to me for clarification on trivial stuff by reducing my intervention as you are an intelligent agent’ —> oops, ‘yes | rm -r *’ . That would be a real cry.
English
0
0
2
188
Kyunghyun Cho
Kyunghyun Cho@kchonyc·
poor academics cry
Kyunghyun Cho tweet media
English
2
1
29
3.6K
Fei Sha
Fei Sha@feishaAI·
@percyliang Congrats ! The scaling law is definitely not exponential in academia
English
0
0
0
626
Percy Liang
Percy Liang@percyliang·
Academic titles are funny. After 14 years, I finally have the official title that people might have always assumed I had.
English
93
22
1.3K
116.4K
Fei Sha
Fei Sha@feishaAI·
@andrewgwils Catastrophic forgetting happens to neural networks, artificial or biological.
English
0
0
2
253
Andrew Gordon Wilson
Andrew Gordon Wilson@andrewgwils·
Oh right, because no one had heard of transfer learning before "ulmfit" (presently taking credit for LLMs). There's a lot I love about ML, but the rate at which we forget (or never bother to understand) is not one of them.
English
6
2
78
12.3K
Fei Sha
Fei Sha@feishaAI·
@docmilanfar I assume you were referring to the debate about the quantization ? I thought that debate is less about citing JL, more about citing a perceived directly related work ( instead of citing all possible work built on JL)
English
0
0
0
219
Peyman Milanfar
Peyman Milanfar@docmilanfar·
AI people going on about Johnson-Lindenstrauss lemma like it was discovered yesterday. it’s just another example of how most folks don’t read or know anything more than 5 years old
English
9
12
306
25.8K
Fei Sha
Fei Sha@feishaAI·
@euanashley This was observed before frontier model era for example, aclanthology.org/N18-1040/ “In particular, the resulting learner can ignore the visual information, the question, or both while still doing well on the task.”
English
0
0
1
765
euan ashley
euan ashley@euanashley·
New AI paper from us this week. When my student first showed me his initial findings, I really didn’t know what to make of them. I felt that this was an interesting but curious loophole phenomenon that would shortly be closed. I was very wrong. arxiv.org/abs/2603.21687
euan ashley tweet media
English
59
257
1.5K
476K
Fei Sha
Fei Sha@feishaAI·
@yoavgo JL is a standard technique but its use in quantization (especially in useful ones to real world problems ) is not that abundant? I am hoping there could be more frank discussions about this.
English
0
0
1
244
(((ل()(ل() 'yoav))))👾
another remark on this: this can be seen as a technical complaint about academic credit ("we also use JL transform when creating the codebook and they didn't acknowledge that"). but it is more than that. by reading the TurboQuant paper, one gets the impression that JL / random projection is the major component. but since RaBitQ also uses JL, then if TurboQuant is indeed better, it means that the thing that actually works for TurboQuant (the contribution) is not JL but something else. And currently, we the readers cannot know this without implementing and checking. so it is not only credit assignment. if TurboQuant authors would say "RaBitQ also did JL, but we differ from them by doing XYZ, which improves things from this to that" we as readers would get a much more informative paper, and TurboQuant authors will have written about an actual contribution to state of the art.
Jianyang Gao@gaoj0017

The TurboQuant paper (ICLR 2026) contains serious issues in how it describes RaBitQ, including incorrect technical claims and misleading theory/experiment comparisons. We flagged these issues to the authors before submission. They acknowledged them, but chose not to fix them. The paper was later accepted and widely promoted by Google, reaching tens of millions of views. We’re speaking up now because once a misleading narrative spreads, it becomes much harder to correct. We’ve written a public comment on openreview (openreview.net/forum?id=tO3AS…). We would greatly appreciate your attention and help in sharing it.

English
5
14
159
28.4K
Nika Haghtalab
Nika Haghtalab@nhaghtal·
This week I was promoted to the rank of Associate Professor at @Berkeley_EECS ! In a remarkable show of enthusiasm, the committee apparently tore a hole in spacetime to make me an Associate Professor 9 months ago!
English
57
14
720
55.7K
Kyunghyun Cho
Kyunghyun Cho@kchonyc·
thanks to @karpathy , now i have cracked the mystery why my agent doesn't follow my instruction closely enough.
Kyunghyun Cho tweet media
English
99
172
3.7K
794.8K