O⊥ƆɯoƆʇoᗡ

25.3K posts

O⊥ƆɯoƆʇoᗡ banner
O⊥ƆɯoƆʇoᗡ

O⊥ƆɯoƆʇoᗡ

@DotComCTO

Chief Technology Officer & Marketing vet. Gamer, private pilot, musician, ham radio operator & proud dad! WE ARE...PENN STATE!

New York 가입일 Ocak 2009
801 팔로잉638 팔로워
O⊥ƆɯoƆʇoᗡ 리트윗함
Nav Toor
Nav Toor@heynavtoor·
🚨SHOCKING: Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves. And the way they proved it is devastating. Apple researchers took the most popular math benchmark in AI — GSM8K, a set of grade-school math problems — and made one change. They swapped the numbers. Same problem. Same logic. Same steps. Different numbers. Every model's performance dropped. Every single one. 25 state-of-the-art models tested. But that wasn't the real experiment. The real experiment broke everything. They added one sentence to a math problem. One sentence that is completely irrelevant to the answer. It has nothing to do with the math. A human would read it and ignore it instantly. Here's the actual example from the paper: "Oliver picks 44 kiwis on Friday. Then he picks 58 kiwis on Saturday. On Sunday, he picks double the number of kiwis he did on Friday, but five of them were a bit smaller than average. How many kiwis does Oliver have?" The correct answer is 190. The size of the kiwis has nothing to do with the count. A 10-year-old would ignore "five of them were a bit smaller" because it's obviously irrelevant. It doesn't change how many kiwis there are. But o1-mini, OpenAI's reasoning model, subtracted 5. It got 185. Llama did the same thing. Subtracted 5. Got 185. They didn't reason through the problem. They saw the number 5, saw a sentence that sounded like it mattered, and blindly turned it into a subtraction. The models do not understand what subtraction means. They see a pattern that looks like subtraction and apply it. That is all. Apple tested this across all models. They call the dataset "GSM-NoOp" — as in, the added clause is a no-operation. It does nothing. It changes nothing. The results are catastrophic. Phi-3-mini dropped over 65%. More than half of its "math ability" vanished from one irrelevant sentence. GPT-4o dropped from 94.9% to 63.1%. o1-mini dropped from 94.5% to 66.0%. o1-preview, OpenAI's most advanced reasoning model at the time, dropped from 92.7% to 77.4%. Even giving the models 8 examples of the exact same question beforehand, with the correct solution shown each time, barely helped. The models still fell for the irrelevant clause. This means it's not a prompting problem. It's not a context problem. It's structural. The Apple researchers also found that models convert words into math operations without understanding what those words mean. They see the word "discount" and multiply. They see a number near the word "smaller" and subtract. Regardless of whether it makes any sense. The paper's exact words: "current LLMs are not capable of genuine logical reasoning; instead, they attempt to replicate the reasoning steps observed in their training data." And: "LLMs likely perform a form of probabilistic pattern-matching and searching to find closest seen data during training without proper understanding of concepts." They also tested what happens when you increase the number of steps in a problem. Performance didn't just decrease. The rate of decrease accelerated. Adding two extra clauses to a problem dropped Gemma2-9b from 84.4% to 41.8%. Phi-3.5-mini from 87.6% to 44.8%. The more thinking required, the more the models collapse. A real reasoner would slow down and work through it. These models don't slow down. They pattern-match. And when the pattern becomes complex enough, they crash. This paper was published at ICLR 2025, one of the most prestigious AI conferences in the world. You are using AI to help you make financial decisions. To check legal documents. To solve problems at work. To help your children with homework. And Apple just proved that the AI is not thinking about any of it. It is pattern matching. And the moment something unexpected shows up in your question, it breaks. It does not tell you it broke. It just quietly gives you the wrong answer with full confidence.
Nav Toor tweet media
English
862
2.9K
11.5K
2.1M
O⊥ƆɯoƆʇoᗡ 리트윗함
THE ISLANDER
THE ISLANDER@IslanderWORLD·
🇺🇸🇮🇱🇮🇷 A US stealth pilot departed Ovda Air Base in southern Israel, forgot to switch off his transponder, and handed the entire world via Flightradar24 a live broadcast of the route Washington had just spent weeks diplomatically insisting it wasn’t using. Saudi Arabia had told Iran and told Washington — that its airspace would not be made available for strikes. Iran’s ambassador to Riyadh had personally thanked the Kingdom for that pledge. The ink was barely dry. The $150 million stealth aircraft whose entire operational premise is invisibility announced itself over Saudi Arabia like a commercial flight to Dubai. Call sign F35LTNG2. Altitude, heading, groundspeed — all of it, public, live, archived, distributed across Telegram channels from Tehran to Moscow before the sortie had even reached its target. The most expensive air force in human history, undone not by an Iranian S-300, not by electronic warfare, not by any weapons system that cost a single riyal to deploy — but by a checklist item a student pilot learns in week one. The strategic implications land harder than the embarrassment. F-22s flying from Israel would have to traverse Syria, Iraq, Jordan and Saudi Arabia — the very countries that had declared their airspace unavailable for strikes on Iran. What the transponder confirmed in real time is that those declarations were either ignored, circumvented, or quietly negotiated away under pressure and that every government in the region now knows it, and more critically, so does Tehran. Iran does not need to intercept the aircraft. It already intercepted the lie and we'll have to see what comes next for Saudi Arabia.
THE ISLANDER tweet media
English
527
7K
18.6K
1.3M
O⊥ƆɯoƆʇoᗡ 리트윗함
The Lincoln Project
The Lincoln Project@ProjectLincoln·
"You bring a gun into DC, mark my words you're going to jail. I don't care if you have a license in another district and I don't care if you are a law-abiding gun owners somewhere else. " Any word yet from 2A-defending Republicans? If there's even any left at this point...
English
1.2K
4.5K
21.4K
1.1M
O⊥ƆɯoƆʇoᗡ 리트윗함
Amber Woods @ Amber Speaks Up
Amber Woods @ Amber Speaks Up@AmberWoods100·
Thirty years ago, a survivor did everything right. Maria Farmer went to the FBI and reported Epstein and his network of the wealthiest and most powerful men in the world. The network kept operating anyway.
Amber Woods @ Amber Speaks Up tweet media
English
273
16.1K
83.2K
1.2M
O⊥ƆɯoƆʇoᗡ 리트윗함
Alexandria Ocasio-Cortez
Even with everything in this Epstein drop, remember: this is a minority of the files. This is STILL just what they were *willing* to release - in violation of the law, which requires release of all files. Pam Bondi’s DOJ is still hiding most of them. We need them all.
English
5.9K
29.3K
194.2K
3.6M
O⊥ƆɯoƆʇoᗡ 리트윗함
Agorist Nexus (Brandon)
Agorist Nexus (Brandon)@AgoristN·
MAGA waiting for new instructions on what to say about the new Epstein file release:
Agorist Nexus (Brandon) tweet media
English
70
646
7.2K
97.1K
O⊥ƆɯoƆʇoᗡ 리트윗함
Jake Broe
Jake Broe@RealJakeBroe·
Here is an explanation why American democracy has fallen.
Jake Broe tweet media
English
399
3.6K
33.9K
489.6K
O⊥ƆɯoƆʇoᗡ 리트윗함
The New York Times
The New York Times@nytimes·
Breaking News: The Border Patrol leader Greg Bovino mocked the Jewish faith of the U.S. attorney in Minnesota during a call with lawyers, according to several people with knowledge of the conversation. nyti.ms/4qfVXPl
English
3.9K
3.1K
12.9K
4.3M
O⊥ƆɯoƆʇoᗡ 리트윗함
Liam Nissan™
Liam Nissan™@theliamnissan·
This is sworn testimony, folks
Liam Nissan™ tweet media
English
1.6K
9.2K
28.6K
725.6K
O⊥ƆɯoƆʇoᗡ 리트윗함
Brian Allen
Brian Allen@allenanalysis·
🚨 NEW: An affidavit signed under penalty of perjury by a witness identified as “Tiffany Doe” alleges she personally heard Trump threaten a plaintiff with “disappearing like another 12-year-old” and warned he could have her family killed. This is sworn testimony. Not a rumor. Not a tweet. Not hearsay. H/t aronparnas
Brian Allen tweet media
English
103
1.8K
3.9K
134.8K
O⊥ƆɯoƆʇoᗡ 리트윗함
Brian Allen
Brian Allen@allenanalysis·
BREAKING: Deputy Attorney General Todd Blanche just admitted the DOJ excluded images showing “death, physical abuse, or injury” from today’s Epstein files release. Let that sink in. The government is acknowledging graphic evidence exists and chose to withhold it, while redacting names tied to Jeffrey Epstein and his powerful associates, including Donald Trump.
English
1.9K
28.9K
85.3K
5.6M
O⊥ƆɯoƆʇoᗡ 리트윗함
Google DeepMind
Google DeepMind@GoogleDeepMind·
Step inside Project Genie: our experimental research prototype that lets you create, edit, and explore virtual worlds. 🌎
English
982
4.3K
34.5K
13.4M
O⊥ƆɯoƆʇoᗡ 리트윗함
Casey Hudson
Casey Hudson@CaseyDHudson·
This is Star Wars: Fate of the Old Republic, a single player narrative-driven action RPG and spiritual successor to Star Wars: Knights of the Old Republic. Working on KOTOR was a defining experience of my career. This is a dream come true for me and our team of incredible storytellers and game makers @ArcanautStudios.
English
1.5K
4.7K
39.7K
3.3M
O⊥ƆɯoƆʇoᗡ 리트윗함
Flat2VR Modding
Flat2VR Modding@Flat2VR·
Hey @CDPROJEKTRED — we’d love to explore the idea of a proper, official VR port of Cyberpunk 2077 if you were ever interested. It's one of our "dream games to port"🙏 Our @Flat2VRStudios has shipped multiple award-winning VR adaptations, focused on reimagining games to feel built from the ground up for VR with motion controls and uncompromised presentation. We're trusted by multiple AAA studios and work in a way that lets you keep on focusing on all the amazing stuff you do.
English
287
404
3.3K
154.1K
O⊥ƆɯoƆʇoᗡ 리트윗함
Brian Allen
Brian Allen@allenanalysis·
People in Greenland are wearing these hats and honestly… I’m stealing it. MAGA now officially stands for Make America Go Away. 🇬🇱🧢💀
Brian Allen tweet mediaBrian Allen tweet media
English
818
14.4K
106.4K
2.3M
O⊥ƆɯoƆʇoᗡ
O⊥ƆɯoƆʇoᗡ@DotComCTO·
@bethesda Seeing this post reminds me that Skyrim is on PS5, and yet there's still no PSVR2 support/version. Check out the PSVR subreddit, and I think you might be surprised by the number of PSVR2 owners that would love Skyrim on the platform. Just a thought.
English
0
0
0
210
Bethesda
Bethesda@bethesda·
Find the perfect version of Skyrim on Nintendo Switch 2 for you. Featuring improvements such as enhanced resolution, improved load times, optimized performance, and more!
Bethesda tweet mediaBethesda tweet mediaBethesda tweet media
English
255
114
1.3K
204.8K