Paul Esau

93 posts

Paul Esau banner
Paul Esau

Paul Esau

@ptesau

Spring 2023 @ScovillePF at @WisconsinProj. Military export policy writer/researcher. 2022 @Laurier and @StudyofCanada alum.

Washington, D.C Katılım Ekim 2014
105 Takip Edilen71 Takipçiler
Paul Esau
Paul Esau@ptesau·
IMO, writers and editors should both be using detectors in their workflows, especially if they have also integrated other AI research/editing tools. AI can liberate people from the "hardness" of writing (empowering for many!), but only by introducing other risks. (2/2)
English
0
0
1
19
Paul Esau
Paul Esau@ptesau·
I agree with @alexcdot that, while ragebait gets clicks, AI detection is fundamentally about transparency and trust. I'm not in favor of "gotcha" moments unless factual errors (bad stats, vibe citations, metacommentary, etc.) indicate no human checked the work. (1/2)
Alex Cui@alexcdot

WSJ reports @pangramlabs on falsely accusing people of AI. wsj.com/opinion/the-ai… The ai detection industry needs to rise above accusations as a way to drive use (which people on twitter love, I get it, ragebait sells) If we really want to promote human-written content, we should help people establish that their writing is human. Give people the ability to replay their process. Trust comes from transparency. So what did we do about this at @GPTZeroAI ?

English
1
0
1
30
Paul Esau
Paul Esau@ptesau·
@shadihamid @matthewkaemingk Thanks for the rec! And thanks to both of you again for doing what you do! (P.S. Shadi: Not only is it a signed copy, but it's lived lived rent-free in my head for the last two years — I keep referencing it in convos about US FP.)
Paul Esau tweet media
English
1
0
1
15
Shadi Hamid
Shadi Hamid@shadihamid·
@ptesau @matthewkaemingk Christian Hospitality and Muslim Immigration in an Age of Fear is amazing! Also thanks for buying The Problem of Democracy :)
English
1
0
2
35
Paul Esau
Paul Esau@ptesau·
@shadihamid @matthewkaemingk Although it reminds me that I bought Shadi's The Problem of Democracy in DC a few years back, but never picked up one of Matthew's books. Seems unfair. Which one would you recommend, @shadihamid ? 😀
English
1
0
1
25
Paul Esau
Paul Esau@ptesau·
@dex_eve So the NYT could be correct in calling out Cooper's reluctance to apply Article 51 fully, instead of conveniently narrowing the definition of "indiscriminate" to point b?
English
0
0
0
17
Paul Esau
Paul Esau@ptesau·
@dex_eve Doesn't this last concession invalidate your original criticism of the NYT, tho? The 1% failure rate is a hypothetical ideal for future US-manufactured cluster munitions — the cluster munitions transferred to Ukraine in 2023 from stockpiles had a much higher failure rate (5-10%).
English
1
0
0
67
Decker Eveleth
Decker Eveleth@dex_eve·
This is a disappointing article from the NYT. It does not seem to understand the substance of Adm. Cooper's comment, and thus ends up with an argument that is mostly irrelevant to said comments. A short thread.
John Ismay@johnismay

CENTCOM is unhappy with this story, but unable to argue with the facts it presents. This morning I offered an on-record interview with Adm. Cooper in case he wanted to clarify his remarks, but that was rejected out of hand by a staffer nytimes.com/2026/03/17/us/…

English
1
73
423
121.7K
Paul Esau
Paul Esau@ptesau·
@Er_Woods @edward_the6 Considering your level of expertise, being a dinosaur is a solid strategic choice. Dataset coding checks out, but I'm surprised it's been so useful for language learning. Chinese, right? And you don't use any tools for baseline research or comms clarity/structure suggestions?
English
1
0
0
22
Eric Woods
Eric Woods@Er_Woods·
@ptesau @edward_the6 It’s very good at languages (shocking!) so I’ve also found use in assisting with language learning. Namely as a grammar reference and tool to break down complex sentences and vocab into their component parts
English
1
0
0
24
Edward Tian
Edward Tian@edward_the6·
When was the last you didn’t use AI to write?
English
6
0
9
968
Paul Esau
Paul Esau@ptesau·
@Er_Woods @edward_the6 I'm actually pretty curious what benefits you've seen from AI tools given the technical nature of your work, the multiple languages involved, and the consequences of hallucinations. Have you experienced any significant increases in productivity, or just lower noise/signal ratios?
English
1
0
1
20
Paul Esau retweetledi
Scoville Fellowship
Scoville Fellowship@ScovillePF·
Interested in working for a think tank or advocacy organization in DC on international peace and security for 6 to 9 months? Join us on Wednesday, February 4 at 3:00 PM EST to hear from Scoville Fellows and staff to learn more about the fellowship. scoville.org/overview/infor…
Scoville Fellowship tweet media
English
0
4
2
202
Paul Esau
Paul Esau@ptesau·
@alexcdot "FirstName Lastname" in a published paper gets me every time. "And Others" is just the cherry on that slop cupcake!
English
0
0
3
320
Alex Cui
Alex Cui@alexcdot·
Another example of "vibe citing". Authors non-existent and publication dates are off by years. These papers had to beat out 15,000 others, which got rejected. How does this happen??
Alex Cui tweet media
English
8
25
479
56.9K
Alex Cui
Alex Cui@alexcdot·
Okay so, we just found that over 50 papers published at @Neurips 2025 have AI hallucinations I don't think people realize how bad the slop is right now It's not just that researchers from @GoogleDeepMind, @Meta, @MIT, @Cambridge_Uni are using AI - they allowed LLMs to generate hallucinations in their papers and didn't notice at all. It's insane that these made it through peer review👇
Alex Cui tweet media
English
280
1.4K
6.3K
996.3K
Paul Esau
Paul Esau@ptesau·
I know every generation of academics moans about the diminishing quality of publications, but this crisis (caused by the intersection of AI tools and huge increases in submissions) just feels different. I'm happy to be building solutions with @alexcdot and the GPTZero team.
Alex Cui@alexcdot

Okay so, we just found that over 50 papers published at @Neurips 2025 have AI hallucinations I don't think people realize how bad the slop is right now It's not just that researchers from @GoogleDeepMind, @Meta, @MIT, @Cambridge_Uni are using AI - they allowed LLMs to generate hallucinations in their papers and didn't notice at all. It's insane that these made it through peer review👇

English
1
0
8
415
Paul Esau retweetledi
Edward Tian
Edward Tian@edward_the6·
Hey Alex, huge fans, and kudos for an awesome study on AI detection reliability in the ecosystem HUGE CLARIFICATION we were benchmarked in this study without accounting for our three-output structure, excluding our third 'mixed' class it's like having a two party election, forgetting there's a smaller third independent party; we care about the third option because more writing on the internet is now humanized, modified, lightly edited with ai, or mixed and combined. accounting for this, our Recall is 99.3% and FPR is 0.5%, outperforming Pangram here.
Edward Tian tweet media
English
1
5
16
3.2K
Paul Esau retweetledi
Alex Cui
Alex Cui@alexcdot·
Millions of people are now seeing "verified by GPTZero" badge - starting today. @hackernoon , one of world's most popular tech blogs, is officially verifying their 100,000s of articles and 50,000+ tech writers with @GPTZeroAI . And this is just the beginning...
English
2
5
12
2.5K
Paul Esau
Paul Esau@ptesau·
@alexcdot Another example of the "slop flood" @AishaKDown ? NeurIPS may be almost over, but ICLR is at a critical point in its submission review pipeline for 2026.
English
0
0
0
165
Alex Cui
Alex Cui@alexcdot·
There are a LOT more papers that need to be desk rejected at ICLR. Somehow, this hallucination wasn't caught. So, I went on a crazy rabbit hole and found 50 more (many are just as funny). We're approaching crisis levels Paper titles included below 👇 gptzero.me/news/iclr-2026/
Alex Cui tweet media
ICLR 2026@iclr_conf

This paper has been desk rejected. LLM-generated papers that hallucinate references and do not report LLM usage will be desk rejected per ICLR policy (blog.iclr.cc/2025/08/26/pol…) Reviewers of other versions of this submission have been notified.

English
19
48
408
178.3K