Matt | PublicThink

12 posts

Matt | PublicThink

@PublicThinkOrg

Who decides what gets studied? We're building PublicThink. It's a place for the public to set the research agenda.

Katılım Nisan 2026

84 Takip Edilen0 Takipçiler

Matt | PublicThink@PublicThinkOrg·20h

@lakens It keeps getting worse. There's no cost to using AI unchecked, so now we can't trust the critiques to be anything better than AI slop. We need harsh consequences for fabricated citations in peer criticism. I wonder if libel laws apply here.

English

486

Daniël Lakens@lakens·23h

New blog post: Evaluating Dr. Cuddy’s Claim that the Debunking of Power Posing is a Myth. daniellakens.blogspot.com/2026/05/evalua… On an AI generated description of a non-existent study, incorrectly citing findings from studies, and the importance of scientific criticism.

English

11.7K

Matt | PublicThink@PublicThinkOrg·1d

@robinhanson I think in general, people have lost their curiosity and drive to learn new things because times are rough right now, even so… some people really do have a skill for pulling people in regardless.

English

Robin Hanson@robinhanson·1d

Except, I know lots of innovative things I could explain to a smart 5yo who was willing to listen carefully. But when you talk on sensitive topics, the big problem is folks unwilling to listen.

Sahil Bloom@SahilBloom

This is a major life hack: Richard Feynman was known for his ability to convey complex ideas in simple, elegant ways. Remember this rule the next time someone tries to fast talk you with a bunch of fancy words, acronyms, and jargon...

English

4.6K

Matt | PublicThink@PublicThinkOrg·1d

@davidmanheim The strawman lands because there’s no scoreboard. When predictions live as scattered tweets, anyone can claim mostly-right and anyone can cherry-pick misses.

English

David Manheim@davidmanheim·1d

Gary has been wrong about plenty of things, and right about plenty more. That's what happens when you say things publicly about the future. I disagree with him about lots of his expectations about the coming years, but strawman attacks are something everyone should denounce.

Gary Marcus@GaryMarcus

The case against me below is completely intellectually dishonest, filled with lies and misrepresentations, wrong about almost literally everything it says—a textbook example of propaganda: - I didn’t say scaling laws didn’t work ever; I said that pure scaling would reach a point if diminishing returns (it did) - I didn’t say AI progress in general would have diminishing returns; I said pure scaling would (it did; neurosymbolic tools and harness are doing a lot for the work now, as I said they would) - I didn’t say deep learning would hit a dead end forever; i said it would need to encompass new mechanisms such as neurosymbolic AI (it did) - I didn’t say models would never improve; i said GPT-5 wouldn’t arrive in 2024 (it didn’t) - I never said LLMs aren’t any good (I have often pointed to reasonable use cases like coding) - Only part that is partly true is that O signed the pause letter, but as I noted publicly at the time it was because I thought we should have more research on AI safety (still do). (If you care about fair play and seeking truth, I hope you would consider retweeting this.)

English

8.5K

Matt | PublicThink@PublicThinkOrg·2d

@lakens Making public claims without due diligence causes real harm. A private correction doesn't reach the million people who saw the original. All future posts should have to link the retraction. The people who were misled deserve the same reach the false claim got.

English

125

Daniël Lakens@lakens·2d

If a scientist uses AI to make strong claims in public about the scientific literature, and some references do not exist, and the generated text does not correcfly reflect the findings, should we inform them privately to fix the text or should we share this information publicly?

English

7.1K

Matt | PublicThink@PublicThinkOrg·2d

@hankgreen GLP-1s are thrilling, but I keep thinking about peptides. They have equal or greater potential but are barely studied, because you can't patent a peptide the way you can patent semaglutide. We're leaving breakthroughs on the table because there's no profit in them.

English

Hank Green@hankgreen·2d

This is very, very interesting but not proof of effect. They looked at a large group of people with diabetes, some taking GLP-1s, some taking DPP-4 inhibitors (another medicine for type 2 diabetes). The ones who had cancer and were taking GLP-1s, on average, had significantly better outcomes than a matched group on DPP-4 inhibitors. A lot of different things could cause those different outcomes, including that there is something different about those two groups. People who get prescribed GLP-1s might be healthier, wealthier, have better healthcare habits, etc. But it's an extremely enticing signal and there are several plausible mechanisms by which GLP-1 drugs might help with cancer treatment, so...expect clinical trials!

The Wall Street Journal@WSJ

The world’s most popular weight-loss and diabetes drugs are linked to a powerful new possible benefit: better outcomes for cancer patients. on.wsj.com/3RBfcXO

English

1.4K

239.6K

Matt | PublicThink@PublicThinkOrg·2d

Building things is the part I love. But eventually you have to let people see it, and if nobody cares, it goes nowhere. I love what I built and I think it should exist. I just don't want it lost to internet oblivion.

English

Matt | PublicThink@PublicThinkOrg·3d

@Noahpinion I agree. There were already more incentives to publish bad work than to find it and correct it. Now the volume is overwhelming even the verification tools that existed. I don't have a good answer to where you find the trustworthy work anymore.

English

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·3d

The whole scientific enterprise is in trouble, and the Slopocalypse is going to compound the problem enormously

Melissa Chen@MsMelChen

Fascinating to see that Chinese researchers and whistleblowers are exposing high profile science journals such as Nature for publishing fraudulent papers. This same rot mirrors the replication crisis and distrust of scientific journals that's been going on in the US. Prestigious journals like Nature, Science, and Cell have morphed into gatekeepers of narrative rather than truth, amplifying irreproducible work while sidelining inconvenient findings. Publish-or-perish incentives combined with ideological capture - DEI mandates, politicized climate and biomedical research - have all but eroded credibility. Fraudulent papers proliferate because the system rewards quantity and alignment with prevailing orthodoxies over careful replication and falsification. It's refreshing to see accountability surfacing in China. Maintaining public trust in science demands relentless scrutiny, not institutional sanctity.

English

123

20K

Matt | PublicThink@PublicThinkOrg·3d

@StatModeling Rigor doesn't pay and sloppiness doesn't cost. That's the actual problem. There's no reward for tracing a citation carefully, and no real penalty for passing on a claim you never verified. The sensational version always travels faster than the correction.

English

Andrew Gelman et al.@StatModeling·3d

Don’t cite sources you haven’t read, and don’t trust when people claim to be reporting something from the literature. statmodeling.stat.columbia.edu/2026/05/22/don…

English

2.3K

Matt | PublicThink@PublicThinkOrg·3d

@TheStalwart @adamjkucharski So... Copilot described UK responses as more understated, US as more emphatic. Both are actual cultural stereotypes. Kucharski had copy-pasted 2,000 responses and relabelled them. To me, it looks like LLMs tend to reproduce what sounds right, not what the data says.

English

818

Joe Weisenthal@TheStalwart·4d

This was an amazing and incredibly damning experiment using Microsoft Copilot, by @adamjkucharski kucharski.substack.com/p/real-signals…

English

410

2.5K

303.2K

Matt | PublicThink@PublicThinkOrg·3d

At PublicThink, open submission means anyone can post anything, which is a problem. My current solution is very simple. A Discord bot pings my phone every time a question comes in and I review it personally. Then, good questions should rise to the top through voting.

English

Matt | PublicThink@PublicThinkOrg·3d

@RaoulRuparel The timeline might be the deeper problem. A serious RCT takes 18 months to 2 years from design to publication. By then the model is several generations old. More RCTs don't obviously fix that. Probably needs something closer to ongoing monitoring than periodic studies.

English

Raoul Ruparel@RaoulRuparel·5d

The more I get into studying & researching the economics of AI, the clearer it is that we are desperately in need of more randomised control trials (RCTs) & other systematic assessments of the potential time saved, quality improvements, or other measures of productivity gains from AI. Especially at the occupation or sector level, with outputs that are usable as inputs into macro models (this is important). As far as I can tell, pretty much every academic study on the productivity effects of AI rests on the same 3 or 4 RCTs or studies, then draws some generalised lessons. 1) This is a very thin & increasingly outdated evidence base. In fact, many of the RCTs rest on GPT-4 level models, we're well past that obviously. 2) It creates a massive clustering within the research. It also means that the input assumptions become the main variation/difference. This isn't necessarily bad but it means we really need to be aware & stress test the assumptions' credibility. 3) There is a huge lack of nuance, granularity, & consistency of detail across tasks/occupations/sectors. If I were working at a university or one of the tech firms I think this would be one of the most impactful research efforts to undertake. Would feed into a huge array of academic work.

English

8.6K

Matt | PublicThink@PublicThinkOrg·4d

Why doesn't anyone trust policy research anymore? Because most of it is funded by people who've already chosen what the answer should be. Maybe it's time to try something different.

English

Keşfet

@lakens @robinhanson @davidmanheim @hankgreen @Noahpinion @StatModeling @TheStalwart @adamjkucharski