Shanthi

7.4K posts

Shanthi banner
Shanthi

Shanthi

@ssc627

@MLB Research. @SyracuseU ‘20. Inquire within for baseball analogies. Inflammatory opinions my own. She/her

New York Katılım Ağustos 2016
684 Takip Edilen513 Takipçiler
Shanthi retweetledi
Ewan Morrison
Ewan Morrison@MrEwanMorrison·
The evidence is piling in now that ChatGPT and all the other large language models are on a developmental plateau and hallucinations will never stop. Not a pathway to exponential improvement or to human level intelligence. The data centre build is pointless.
Nav Toor@heynavtoor

🚨BREAKING: OpenAI published a paper proving that ChatGPT will always make things up. Not sometimes. Not until the next update. Always. They proved it with math. Even with perfect training data and unlimited computing power, AI models will still confidently tell you things that are completely false. This isn't a bug they're working on. It's baked into how these systems work at a fundamental level. And their own numbers are brutal. OpenAI's o1 reasoning model hallucinates 16% of the time. Their newer o3 model? 33%. Their newest o4-mini? 48%. Nearly half of what their most recent model tells you could be fabricated. The "smarter" models are actually getting worse at telling the truth. Here's why it can't be fixed. Language models work by predicting the next word based on probability. When they hit something uncertain, they don't pause. They don't flag it. They guess. And they guess with complete confidence, because that's exactly what they were trained to do. The researchers looked at the 10 biggest AI benchmarks used to measure how good these models are. 9 out of 10 give the same score for saying "I don't know" as for giving a completely wrong answer: zero points. The entire testing system literally punishes honesty and rewards guessing. So the AI learned the optimal strategy: always guess. Never admit uncertainty. Sound confident even when you're making it up. OpenAI's proposed fix? Have ChatGPT say "I don't know" when it's unsure. Their own math shows this would mean roughly 30% of your questions get no answer. Imagine asking ChatGPT something three times out of ten and getting "I'm not confident enough to respond." Users would leave overnight. So the fix exists, but it would kill the product. This isn't just OpenAI's problem. DeepMind and Tsinghua University independently reached the same conclusion. Three of the world's top AI labs, working separately, all agree: this is permanent. Every time ChatGPT gives you an answer, ask yourself: is this real, or is it just a confident guess?

English
120
900
4.9K
205.3K
Shanthi
Shanthi@ssc627·
there is literally already a child on the mound, the WBC is a wild ride
English
0
0
0
58
Shanthi retweetledi
JJ Cooper
JJ Cooper@jjcoop36·
If you have watched a pitch clock without a limit to throwing over, you know that you don't have a pitch clock. This was the rule in AAA for years. And the pitch clock might as well have not existed. Runner on? Throw over if the clock gets low. Or just step off.
HaloTerritory@HaloTerritory

"Three-batter minimum, get rid of it. Bigger bases, limiting throwovers, get rid of it. Runner on second base, get rid of it." Joe Maddon wants to "get the real game back" but does believe the pitch clock and PitchCom were necessary changes.

English
3
4
57
22.8K
Shanthi retweetledi
Jake Scott, MD
Jake Scott, MD@jakescottMD·
The point of calling a bombing campaign “not politically correct” is to make empathy itself illegitimate. Once mourning dead children is reframed as softness, there’s no floor. Anything can be justified as long as objecting to it can be painted as weakness.
Aaron Rupar@atrupar

Stephen Miller: "What you're seeing right now is a military under President Trump's leadership that's not fighting politically correct"

English
84
2K
7.9K
196.8K
Shanthi
Shanthi@ssc627·
update: he does
Shanthi tweet media
English
0
0
0
28
Shanthi retweetledi
Adam Gaffney
Adam Gaffney@awgaffney·
There is more out-in-the-open racism in US society today than I can recall in my adult life. This is a US Congressman.
Adam Gaffney tweet media
English
51
171
1.2K
46.8K
Shanthi retweetledi
awlivv
awlivv@awlivv69·
“i asked chat gpt” have you tried asking literally anyone else
English
38
5.5K
36.2K
1.5M
Shanthi retweetledi
Cameron 🇺🇸 🗽🦅
Cameron 🇺🇸 🗽🦅@CameronCorduroy·
have you ever seen the New York Times quote Trump word for word like this before? the double standard is insufferable
Cameron 🇺🇸 🗽🦅 tweet media
English
261
750
11.3K
182K