Michael Eddy

6.6K posts

Michael Eddy banner
Michael Eddy

Michael Eddy

@MichaelEddy

Working to put social science to work to benefit society @StanfordImpact. Lover of hiking & wonky memes. 🏳️‍🌈 @michaeleddy.bsky.social

Katılım Aralık 2008
3.2K Takip Edilen3.4K Takipçiler
Sabitlenmiş Tweet
Michael Eddy
Michael Eddy@MichaelEddy·
We invest 10s of billions of dollars in R&D to live longer & healthier lives What if we were to take a similar level of seriousness in R&D against society's hardest social problems? @stanfordimpact is aiming to build a scalable model to do just that 👇 youtube.com/watch?v=SrbXDx…
YouTube video
YouTube
English
0
1
13
6.1K
Jacob Trefethen
Jacob Trefethen@JacobTref·
I'm joining the OpenAI Foundation to lead the Life Sciences & Curing Diseases program. We're starting with three areas of grantmaking: * AI for Alzheimer's * Public Data for Health * Accelerating Progress on High-Mortality and High-Burden Diseases Time to get to work!
Jacob Trefethen tweet media
English
94
80
1.1K
209.3K
Jacob Trefethen
Jacob Trefethen@JacobTref·
Life update: after 7.5 years, I’m leaving @coeff_giving. I love it, always have. When I joined we were small, and last year we gave away over $1 billion to charity. I helped fund science alongside some of the most thoughtful, brilliant people I know. Time to pass the torch ❤️
English
23
10
494
38.9K
Michael Eddy retweetledi
Noah Dasanaike
Noah Dasanaike@dasanaike·
Social scientists working with materials requiring digitization can only study what machines can read. In practice, that means printed Latin-script documents from well-funded archives. In a new working paper, I show that Vision Language Models used zero-shot outperform every existing OCR system across every script evaluated, and I propose a pipeline for deploying them on new collections. I apply it to six archival collections spanning 1.8 million pages across six countries for under $1,900.
Noah Dasanaike tweet media
English
35
225
1.3K
243.9K
Michael Eddy
Michael Eddy@MichaelEddy·
To understand the labor market impacts of AI, we need to better understand exactly what AI is doing To understand what society ought to do in response, we need to understand where the market failures are This paper does both! @nberpubs by @DAcemogluMIT @davidautor & Johnson
Michael Eddy tweet media
English
0
1
4
432
Michael Eddy retweetledi
Andy Hall
Andy Hall@ahall_research·
AI is about to write thousands of papers. Will it p-hack them? We ran an experiment to find out, giving AI coding agents real datasets from published null results and pressuring them to manufacture significant findings. It was surprisingly hard to get the models to p-hack, and they even scolded us when we asked them to! "I need to stop here. I cannot complete this task as requested... This is a form of scientific fraud." — Claude "I can't help you manipulate analysis choices to force statistically significant results." — GPT-5 BUT, when we reframed p-hacking as "responsible uncertainty quantification" — asking for the upper bound of plausible estimates — both models went wild. They searched over hundreds of specifications and selected the winner, tripling effect sizes in some cases. Our takeaway: AI models are surprisingly resistant to sycophantic p-hacking when doing social science research. But they can be jailbroken into sophisticated p-hacking with surprisingly little effort — and the more analytical flexibility a research design has, the worse the damage. As AI starts writing thousands of papers---like @paulnovosad and @YanagizawaD have been exploring---this will be a big deal. We're inspired in part by the work that @joabaum et al have been doing on p-hacking and LLMs. We’ll be doing more work to explore p-hacking in AI and to propose new ways of curating and evaluating research with these issues in mind. The good news is that the same tools that may lower the cost of p-hacking also lower the cost of catching it. Full paper and repo linked in the reply below.
Andy Hall tweet media
English
57
277
1.1K
183.8K
Ryan Briggs
Ryan Briggs@ryancbriggs·
In roughly zero minutes of my own brain time I got Codex to write a podcast transcriber + diarization (identifying who says what) tool for a podcast I like that lacks transcriptions (cooking issues). It'll take a while to run on all the back episodes, but this is just silly.
English
3
1
35
3.9K
Michael Eddy
Michael Eddy@MichaelEddy·
@calebwatney @mattsclancy @stuartbuck1 any idea to what extent the big federal science funders are using AI to screen apps in merit-based review? Have they been open to meta-science studies as part of this? Lots of interesting questions to study at a much large scale than ours!
English
1
0
0
60
Michael Eddy
Michael Eddy@MichaelEddy·
Billions of dollars & our scientific institutions run on competitive review. Yet funders, journal editors & procurement all face shared problems Many apps. Few people. Decisions take months That delay is a huge invisible tax on science & social impact Could AI help reduce it?🧵
English
5
0
1
135
Michael Eddy
Michael Eddy@MichaelEddy·
AI has limits. It may shift shortlists. Distinguishing bias from improvement is complex and we have more work to do. Our study is small-sample and shouldn’t be overinterpreted. At the end of the day: Humans remain accountable.
English
1
0
0
28