Dimitris Papailiopoulos

9.9K posts

Dimitris Papailiopoulos banner
Dimitris Papailiopoulos

Dimitris Papailiopoulos

@DimitrisPapail

Researcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); reasoning in context; learning to remember; agent of agents; babas of Inez Lily.

Madison, WI Katılım Mayıs 2012
1.3K Takip Edilen25.1K Takipçiler
Dimitris Papailiopoulos retweetledi
Peter Gostev
Peter Gostev@petergostev·
LLM sceptics have predicted the last 7 of 0 walls
English
9
11
186
8.1K
Omar Khattab
Omar Khattab@lateinteraction·
what do people mean by ex-founder? how did you edit history to un-founder yourself
English
4
0
45
6.2K
Dhruv Batra
Dhruv Batra@DhruvBatra_·
@DimitrisPapail I have raised funding as a university faculty member and as a founder. I doubt I need to tell you which one involved more bureaucracy and inefficiency. Perhaps our focus should be reducing bureaucracy.
English
2
0
3
600
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
The entire NSF research budget is ~$9B/year. This is literally funding every awarded PI at every field and every institution. But we've decided that all of basic science is a rounding error in comparison to venture bets. Please consider funding basic science more.
English
10
44
433
28.3K
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
@tszzl @nikitabier clearly true, and most know. i always add a TLDR paragraph. now there's going to be a TLDR of the TLDR+article, sounds like adding to the garbage pile of what currently is the aesthetics of articles. Just style copy subst*ck and move on plz..
English
0
0
2
235
roon
roon@tszzl·
@nikitabier you’re going to get hate for this but it’s obviously the right product choice. I’m sure you have the data but I assume 99% of people open any article and close it after seeing it’s longer than a paragraph
English
28
1
281
9K
Nikita Bier
Nikita Bier@nikitabier·
We’re rolling out summaries for Articles now. Just tap the Summarize button if you want to know if it’s worth your time to read it (or if your attention span is 12 seconds).
English
1.3K
260
3.5K
734.3K
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
@petergostev i bet Claude code can destroy this, because it will pattern match what the right interpreter is if you close the feedback look, which seems the right paradigm for coding..
English
3
0
13
1K
Peter Gostev
Peter Gostev@petergostev·
These kind of claims never pass the sniff test. Benchmarks can be cheated, but if it worked 0-11% of the time on real tasks (which are not part of benchmarks) nobody would ever use LLMs for coding.
Lossfunk@lossfunk

🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵

English
11
0
60
8.5K
Lossfunk
Lossfunk@lossfunk·
🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵
English
113
212
1.6K
775.6K
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
I noticed something interesting: Claude Code auto-adds itself as a co-author on every git commit. Codex doesn’t. That’s why you see Claude everywhere on GitHub, but not Codex. I wonder why OpenAI is not doing that. Feels like an obvious branding strategy OpenAI is skipping.
English
192
30
1.5K
132K
Marcin Paprzycki
Marcin Paprzycki@marcinpaprzycki·
@DimitrisPapail You want state to do things? 😁😜😛 You are sooo woke 😜😜😜😜😜 only meta can waste 80 billion for nothing and you should be happy about this. You should feel good... Do you have a warm feeling about Meta?
English
1
0
1
295
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
@GalantiTomer i've heard startup piches that raise 1 oom more than that that are worse than the worst nsf proposal i've reviewed.
English
1
0
0
256
Tomer Galanti
Tomer Galanti@GalantiTomer·
@DimitrisPapail It is probably easier to raise 10m for a startup than to get 300k for research from government
English
1
0
2
447
Surya Ganguli
Surya Ganguli@SuryaGanguli·
@DimitrisPapail @pfau And I keep hearing of plans to divert government funds for basic research in academia to research projects in startups, without replenishing the funding that would have gone to academia.
English
1
0
9
1.3K
Erik Meijer
Erik Meijer@headinthebox·
@DimitrisPapail Imagine if Meta would have invested all that VR money in academic research …
English
1
0
3
936
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
Lambda is completely out of capacity since yesterday and so I'm trying runpod. Seems good, cheaper, and boots vms much faster! Web UI aesthetics are much busier though. Not that claude code cares, but just saying.
English
4
0
28
3.1K
Dimitris Papailiopoulos retweetledi
Mario Filho
Mario Filho@mariofilhoml·
A Kaggle Grandmaster Tries to Semi-Automate Himself An experiment in turning years of machine learning experience into an research loop that could run on its own. Inspired by @DimitrisPapail github.com/ledmaster/ml-m…
English
0
2
13
1.4K
Glenn Matlin
Glenn Matlin@GlennMatlin·
@DimitrisPapail The worst part is that the term "basic research" makes people think it means "simple and not complex or important" ! It's actually one of the most important kinds to fund to produce material changes that improve peoples lives.
English
1
0
5
945
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
total fed research funding is ineed broader than NSF. But NSF is unique in that it funds a lot of curiosity driven, fundamental research with no mission requirement. NIH funds health and DOD has a much broader agenda (though it also fund some basic/math research). NSF is the only agency that will fund a mathematician with a wild idea and no application in sight which is the kind of research that historically produces big surprises but is also extremely fragile to funding
English
0
2
20
1.2K
Deivon Drago
Deivon Drago@DeivonDrago·
I agree about funding basic research. But not sure where you have that specific # from - it's not an accurate picture at all. The biggest source of fed funding is from HHS (mostly NIH) and that's over $30b. 2nd is DOD, and only then NSF. There is also DOD, NASA etc. You can see details here in the breakout by funding source, but in FY23 the total was around $60b. ncses.nsf.gov/pubs/nsf25313
English
1
0
4
1.3K
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
perhaps to develop this implied concern of mine a bit more, there seems to be two main types of interested consumers of "unlearning services": those that expect from the service that a post-processed model sheds any meaningful behaviors elicited by training on a specific data set, but don't care so much about "identical to whatever model we'd have if this data did not exist in the training mixtures. And those that absolutely would not tolerate epsilon bits of information of a certain data point leaking to the model. For the first group, I would guess there are statistical tests/evals that can be sufficiently convincing, but not for the second.
English
0
0
1
199
Stella Biderman
Stella Biderman@BlancheMinerva·
If I was going to claim that a finetuning methodology for machine unlearning “really worked,” what evidence would you like to see?
English
13
1
29
8.2K