O⊥ƆɯoƆʇoᗡ

25.3K posts

O⊥ƆɯoƆʇoᗡ banner
O⊥ƆɯoƆʇoᗡ

O⊥ƆɯoƆʇoᗡ

@DotComCTO

Chief Technology Officer & Marketing vet. Gamer, private pilot, musician, ham radio operator & proud dad! WE ARE...PENN STATE!

New York 参加日 Ocak 2009
800 フォロー中638 フォロワー
CyrilXBT
CyrilXBT@cyrilXBT·
ANTHROPIC JUST EXPOSED HOW BADLY MOST PEOPLE ARE PROMPTING CLAUDE. Their applied AI team dropped a 24 minute workshop. Free. From the people who wrote the model. Not a course creator. Not someone who figured it out by accident. THE TEAM THAT BUILT THE THING. Here is what makes this uncomfortable to watch. There are 6 elements to a properly structured Claude prompt. Most people are using 1. Maybe 2 if they are being generous with themselves. That gap is the difference between Claude giving you something useful and Claude giving you something you could have Googled. The people who watch this workshop tonight will prompt differently tomorrow morning. The people who skip it will keep wondering why their outputs feel slightly off no matter how much they tweak the wording. 24 minutes. Free. From the only people on earth who know from the inside exactly how Claude thinks. I watched it twice. Then I built a Claude Skill that applies all 6 elements automatically so you never have to think about prompt structure again. Every prompt you run goes through the framework without you doing anything manually. Full guide and the skill setup is below. Bookmark this. Come back to it this weekend. This is the thing that compounds. Follow @cyrilXBT for the exact Claude skills, prompt architecture, and systems I use to get outputs that most people do not believe came from one person.
English
69
337
2.7K
432.7K
O⊥ƆɯoƆʇoᗡ がリツイート
Nav Toor
Nav Toor@heynavtoor·
🚨SHOCKING: Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves. And the way they proved it is devastating. Apple researchers took the most popular math benchmark in AI — GSM8K, a set of grade-school math problems — and made one change. They swapped the numbers. Same problem. Same logic. Same steps. Different numbers. Every model's performance dropped. Every single one. 25 state-of-the-art models tested. But that wasn't the real experiment. The real experiment broke everything. They added one sentence to a math problem. One sentence that is completely irrelevant to the answer. It has nothing to do with the math. A human would read it and ignore it instantly. Here's the actual example from the paper: "Oliver picks 44 kiwis on Friday. Then he picks 58 kiwis on Saturday. On Sunday, he picks double the number of kiwis he did on Friday, but five of them were a bit smaller than average. How many kiwis does Oliver have?" The correct answer is 190. The size of the kiwis has nothing to do with the count. A 10-year-old would ignore "five of them were a bit smaller" because it's obviously irrelevant. It doesn't change how many kiwis there are. But o1-mini, OpenAI's reasoning model, subtracted 5. It got 185. Llama did the same thing. Subtracted 5. Got 185. They didn't reason through the problem. They saw the number 5, saw a sentence that sounded like it mattered, and blindly turned it into a subtraction. The models do not understand what subtraction means. They see a pattern that looks like subtraction and apply it. That is all. Apple tested this across all models. They call the dataset "GSM-NoOp" — as in, the added clause is a no-operation. It does nothing. It changes nothing. The results are catastrophic. Phi-3-mini dropped over 65%. More than half of its "math ability" vanished from one irrelevant sentence. GPT-4o dropped from 94.9% to 63.1%. o1-mini dropped from 94.5% to 66.0%. o1-preview, OpenAI's most advanced reasoning model at the time, dropped from 92.7% to 77.4%. Even giving the models 8 examples of the exact same question beforehand, with the correct solution shown each time, barely helped. The models still fell for the irrelevant clause. This means it's not a prompting problem. It's not a context problem. It's structural. The Apple researchers also found that models convert words into math operations without understanding what those words mean. They see the word "discount" and multiply. They see a number near the word "smaller" and subtract. Regardless of whether it makes any sense. The paper's exact words: "current LLMs are not capable of genuine logical reasoning; instead, they attempt to replicate the reasoning steps observed in their training data." And: "LLMs likely perform a form of probabilistic pattern-matching and searching to find closest seen data during training without proper understanding of concepts." They also tested what happens when you increase the number of steps in a problem. Performance didn't just decrease. The rate of decrease accelerated. Adding two extra clauses to a problem dropped Gemma2-9b from 84.4% to 41.8%. Phi-3.5-mini from 87.6% to 44.8%. The more thinking required, the more the models collapse. A real reasoner would slow down and work through it. These models don't slow down. They pattern-match. And when the pattern becomes complex enough, they crash. This paper was published at ICLR 2025, one of the most prestigious AI conferences in the world. You are using AI to help you make financial decisions. To check legal documents. To solve problems at work. To help your children with homework. And Apple just proved that the AI is not thinking about any of it. It is pattern matching. And the moment something unexpected shows up in your question, it breaks. It does not tell you it broke. It just quietly gives you the wrong answer with full confidence.
Nav Toor tweet media
English
862
2.9K
11.5K
2.1M
O⊥ƆɯoƆʇoᗡ がリツイート
THE ISLANDER
THE ISLANDER@IslanderWORLD·
🇺🇸🇮🇱🇮🇷 A US stealth pilot departed Ovda Air Base in southern Israel, forgot to switch off his transponder, and handed the entire world via Flightradar24 a live broadcast of the route Washington had just spent weeks diplomatically insisting it wasn’t using. Saudi Arabia had told Iran and told Washington — that its airspace would not be made available for strikes. Iran’s ambassador to Riyadh had personally thanked the Kingdom for that pledge. The ink was barely dry. The $150 million stealth aircraft whose entire operational premise is invisibility announced itself over Saudi Arabia like a commercial flight to Dubai. Call sign F35LTNG2. Altitude, heading, groundspeed — all of it, public, live, archived, distributed across Telegram channels from Tehran to Moscow before the sortie had even reached its target. The most expensive air force in human history, undone not by an Iranian S-300, not by electronic warfare, not by any weapons system that cost a single riyal to deploy — but by a checklist item a student pilot learns in week one. The strategic implications land harder than the embarrassment. F-22s flying from Israel would have to traverse Syria, Iraq, Jordan and Saudi Arabia — the very countries that had declared their airspace unavailable for strikes on Iran. What the transponder confirmed in real time is that those declarations were either ignored, circumvented, or quietly negotiated away under pressure and that every government in the region now knows it, and more critically, so does Tehran. Iran does not need to intercept the aircraft. It already intercepted the lie and we'll have to see what comes next for Saudi Arabia.
THE ISLANDER tweet media
English
527
7K
18.6K
1.3M
O⊥ƆɯoƆʇoᗡ がリツイート
The Lincoln Project
The Lincoln Project@ProjectLincoln·
"You bring a gun into DC, mark my words you're going to jail. I don't care if you have a license in another district and I don't care if you are a law-abiding gun owners somewhere else. " Any word yet from 2A-defending Republicans? If there's even any left at this point...
English
1.2K
4.5K
21.4K
1.1M
O⊥ƆɯoƆʇoᗡ がリツイート
Amber Woods @ Amber Speaks Up
Amber Woods @ Amber Speaks Up@AmberWoods100·
Thirty years ago, a survivor did everything right. Maria Farmer went to the FBI and reported Epstein and his network of the wealthiest and most powerful men in the world. The network kept operating anyway.
Amber Woods @ Amber Speaks Up tweet media
English
273
16.1K
83.2K
1.2M
O⊥ƆɯoƆʇoᗡ がリツイート
Alexandria Ocasio-Cortez
Even with everything in this Epstein drop, remember: this is a minority of the files. This is STILL just what they were *willing* to release - in violation of the law, which requires release of all files. Pam Bondi’s DOJ is still hiding most of them. We need them all.
English
5.9K
29.3K
194.2K
3.6M
O⊥ƆɯoƆʇoᗡ がリツイート
Agorist Nexus (Brandon)
Agorist Nexus (Brandon)@AgoristN·
MAGA waiting for new instructions on what to say about the new Epstein file release:
Agorist Nexus (Brandon) tweet media
English
70
645
7.2K
97.1K
O⊥ƆɯoƆʇoᗡ がリツイート
Jake Broe
Jake Broe@RealJakeBroe·
Here is an explanation why American democracy has fallen.
Jake Broe tweet media
English
397
3.6K
33.8K
489.6K
O⊥ƆɯoƆʇoᗡ がリツイート
The New York Times
The New York Times@nytimes·
Breaking News: The Border Patrol leader Greg Bovino mocked the Jewish faith of the U.S. attorney in Minnesota during a call with lawyers, according to several people with knowledge of the conversation. nyti.ms/4qfVXPl
English
3.8K
3.1K
12.9K
4.3M
O⊥ƆɯoƆʇoᗡ がリツイート
Liam Nissan™
Liam Nissan™@theliamnissan·
This is sworn testimony, folks
Liam Nissan™ tweet media
English
1.6K
9.2K
28.6K
725.7K
O⊥ƆɯoƆʇoᗡ がリツイート
Brian Allen
Brian Allen@allenanalysis·
🚨 NEW: An affidavit signed under penalty of perjury by a witness identified as “Tiffany Doe” alleges she personally heard Trump threaten a plaintiff with “disappearing like another 12-year-old” and warned he could have her family killed. This is sworn testimony. Not a rumor. Not a tweet. Not hearsay. H/t aronparnas
Brian Allen tweet media
English
103
1.8K
3.9K
134.8K
O⊥ƆɯoƆʇoᗡ がリツイート
Brian Allen
Brian Allen@allenanalysis·
BREAKING: Deputy Attorney General Todd Blanche just admitted the DOJ excluded images showing “death, physical abuse, or injury” from today’s Epstein files release. Let that sink in. The government is acknowledging graphic evidence exists and chose to withhold it, while redacting names tied to Jeffrey Epstein and his powerful associates, including Donald Trump.
English
1.9K
28.8K
85.2K
5.6M
O⊥ƆɯoƆʇoᗡ がリツイート
Google DeepMind
Google DeepMind@GoogleDeepMind·
Step inside Project Genie: our experimental research prototype that lets you create, edit, and explore virtual worlds. 🌎
English
982
4.3K
34.5K
13.4M
O⊥ƆɯoƆʇoᗡ がリツイート
Casey Hudson
Casey Hudson@CaseyDHudson·
This is Star Wars: Fate of the Old Republic, a single player narrative-driven action RPG and spiritual successor to Star Wars: Knights of the Old Republic. Working on KOTOR was a defining experience of my career. This is a dream come true for me and our team of incredible storytellers and game makers @ArcanautStudios.
English
1.5K
4.7K
39.7K
3.3M
O⊥ƆɯoƆʇoᗡ がリツイート
Flat2VR Modding
Flat2VR Modding@Flat2VR·
Hey @CDPROJEKTRED — we’d love to explore the idea of a proper, official VR port of Cyberpunk 2077 if you were ever interested. It's one of our "dream games to port"🙏 Our @Flat2VRStudios has shipped multiple award-winning VR adaptations, focused on reimagining games to feel built from the ground up for VR with motion controls and uncompromised presentation. We're trusted by multiple AAA studios and work in a way that lets you keep on focusing on all the amazing stuff you do.
English
287
403
3.3K
154.1K
O⊥ƆɯoƆʇoᗡ がリツイート
Brian Allen
Brian Allen@allenanalysis·
People in Greenland are wearing these hats and honestly… I’m stealing it. MAGA now officially stands for Make America Go Away. 🇬🇱🧢💀
Brian Allen tweet mediaBrian Allen tweet media
English
818
14.4K
106.4K
2.3M