r⬡bAIR

8K posts

r⬡bAIR banner
r⬡bAIR

r⬡bAIR

@robAIRio

permabull 1000 procent talent

België شامل ہوئے Mayıs 2012
2.8K فالونگ509 فالوورز
Easy
Easy@NotSoEasyMoney·
HOW IS NOBODY TALKING ABOUT THIS!?!?!?!? THEY PRINTED 5 BILLION OF THEIR OWN TOKENS THEN WITHDREW IT AS USDC!?!?!?!??!
Easy tweet media
English
992
4.5K
15.2K
3.8M
r⬡bAIR
r⬡bAIR@robAIRio·
@heynavtoor Maybe Ai knows we would not sell those small kiwis
English
0
0
0
544
Nav Toor
Nav Toor@heynavtoor·
🚨SHOCKING: Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves. And the way they proved it is devastating. Apple researchers took the most popular math benchmark in AI — GSM8K, a set of grade-school math problems — and made one change. They swapped the numbers. Same problem. Same logic. Same steps. Different numbers. Every model's performance dropped. Every single one. 25 state-of-the-art models tested. But that wasn't the real experiment. The real experiment broke everything. They added one sentence to a math problem. One sentence that is completely irrelevant to the answer. It has nothing to do with the math. A human would read it and ignore it instantly. Here's the actual example from the paper: "Oliver picks 44 kiwis on Friday. Then he picks 58 kiwis on Saturday. On Sunday, he picks double the number of kiwis he did on Friday, but five of them were a bit smaller than average. How many kiwis does Oliver have?" The correct answer is 190. The size of the kiwis has nothing to do with the count. A 10-year-old would ignore "five of them were a bit smaller" because it's obviously irrelevant. It doesn't change how many kiwis there are. But o1-mini, OpenAI's reasoning model, subtracted 5. It got 185. Llama did the same thing. Subtracted 5. Got 185. They didn't reason through the problem. They saw the number 5, saw a sentence that sounded like it mattered, and blindly turned it into a subtraction. The models do not understand what subtraction means. They see a pattern that looks like subtraction and apply it. That is all. Apple tested this across all models. They call the dataset "GSM-NoOp" — as in, the added clause is a no-operation. It does nothing. It changes nothing. The results are catastrophic. Phi-3-mini dropped over 65%. More than half of its "math ability" vanished from one irrelevant sentence. GPT-4o dropped from 94.9% to 63.1%. o1-mini dropped from 94.5% to 66.0%. o1-preview, OpenAI's most advanced reasoning model at the time, dropped from 92.7% to 77.4%. Even giving the models 8 examples of the exact same question beforehand, with the correct solution shown each time, barely helped. The models still fell for the irrelevant clause. This means it's not a prompting problem. It's not a context problem. It's structural. The Apple researchers also found that models convert words into math operations without understanding what those words mean. They see the word "discount" and multiply. They see a number near the word "smaller" and subtract. Regardless of whether it makes any sense. The paper's exact words: "current LLMs are not capable of genuine logical reasoning; instead, they attempt to replicate the reasoning steps observed in their training data." And: "LLMs likely perform a form of probabilistic pattern-matching and searching to find closest seen data during training without proper understanding of concepts." They also tested what happens when you increase the number of steps in a problem. Performance didn't just decrease. The rate of decrease accelerated. Adding two extra clauses to a problem dropped Gemma2-9b from 84.4% to 41.8%. Phi-3.5-mini from 87.6% to 44.8%. The more thinking required, the more the models collapse. A real reasoner would slow down and work through it. These models don't slow down. They pattern-match. And when the pattern becomes complex enough, they crash. This paper was published at ICLR 2025, one of the most prestigious AI conferences in the world. You are using AI to help you make financial decisions. To check legal documents. To solve problems at work. To help your children with homework. And Apple just proved that the AI is not thinking about any of it. It is pattern matching. And the moment something unexpected shows up in your question, it breaks. It does not tell you it broke. It just quietly gives you the wrong answer with full confidence.
Nav Toor tweet media
English
863
2.9K
11.5K
2.1M
Ash Crypto
Ash Crypto@AshCrypto·
🇯🇵 Japan's 10Y bond yield has reached its highest level in 29 years. You know what's coming next.
Ash Crypto tweet media
English
205
211
1.4K
184.4K
Latest in space
Latest in space@latestinspace·
NEWS 🚨: Astronomers say the surge of large fireball events worldwide recently "warrants serious investigation"' About a dozen of the biggest are all coming from the same place in space (via American Meteor Society)
Latest in space tweet mediaLatest in space tweet media
English
290
923
6.5K
869.4K
r⬡bAIR
r⬡bAIR@robAIRio·
@bittybitbit86 Probably for his own good “I need to go back there” relax bro. Reminds me of obelix
English
0
0
0
12
Magus
Magus@TraderMagus·
There's $69 trillion in long liquidations at 48k
Magus tweet media
English
251
135
1.7K
270.8K
China pulse 🇨🇳
China pulse 🇨🇳@Eng_china5·
Unitree Robotics robot shooting test. It feels like it was generated by AI. It’s terrifying… in the future, wars might not need humans anymore
English
1.8K
2.5K
11K
1.6M
r⬡bAIR
r⬡bAIR@robAIRio·
@Osint613 This time russia and china agreed to a mutual defence pact
English
0
0
0
13
Open Source Intel
Open Source Intel@Osint613·
With constant U.S. aerial refueling, Israeli F-35s could turn Tehran into Gaza. The last thing Iran wants to do is to poke Israel right now. And that’s exactly what they are doing.
Open Source Intel tweet media
English
185
267
3.1K
159.1K
r⬡bAIR
r⬡bAIR@robAIRio·
@sciencegirl Not weird at all they had the iMac and the iPod
English
0
0
0
98
Science girl
Science girl@sciencegirl·
The time a reporter accidentally named the iPhone before anyone knew it existed (2006)
English
234
1.9K
49.7K
3M
Elias 💫 Vandreax
Elias 💫 Vandreax@VandreaXinc·
Why does mainstream science largely dismiss the possibility of an advanced civilization being wiped out during the rapid warming of the Bølling-Allerød period (~14,700–12,900 years ago), even as evidence mounts for massive cataclysms like comet impacts, mega-floods, and abrupt climate shifts that could have erased coastal societies? Is it truly a lack of evidence, or something else?
English
6
0
5
897
Jay Anderson
Jay Anderson@TheProjectUnity·
🚨Space-Based Archeology Will DESTROY Historical Gatekeeping Forever! "With my technique what is the maximum depth we can achieve? Around 5 kilometres" - Prof. Filippo Biondi The Khafre Pyramid scans have demonstrated that the tools now exist to independently interrogate humanity’s deep past without institutional permission. SAR Doppler tomography, gravimetric sensing, muon imaging, hyperspectral analysis, and AI-driven anomaly detection together form an irreversible technological stack. As these systems mature, the ability to suppress inconvenient discoveries collapses through sheer transparency alone. Historical gatekeeping was a product of technological limitation. That limitation is now gone. What comes next is not the end of history, but it might be the end of who gets to decide it.
English
100
426
2.9K
120.7K
Michaël van de Poppe
Michaël van de Poppe@CryptoMichNL·
The Netherlands has gone insane. The government wants to tax unrealized gains on #Bitcoin from 2028 onwards. I simply don't understand why people are blindly accepting this and not going all-in to demonstrate against this particular law. The amount of tax being paid each year is going through the roof, and now the government wants more. Why? How so? Let's first try to solve the entire system and stop the billions and billions of euros being spent each year for nothing. The system is completely inefficient and broken. No wonder people are leaving the country, and to be fair, it's completely right to do so.
Michaël van de Poppe tweet media
English
291
276
1.6K
89.3K
Cloud
Cloud@Cloud1a7·
This is why you don’t stop.
English
2.7K
4.3K
72.3K
14.4M
Michael Hunt
Michael Hunt@StrandSam9·
@fabienprevots @Rothmus @grok Wrong. Those are from welding and grinding, all my work shirts used to look like that. I don’t weld or grind anymore, none of my shirts look like that anymore and I wear belts and buckles every day.
English
19
0
80
193.9K
Squiddy🦑
Squiddy🦑@Squiddyderpypy·
anyone still left in $S? hello?
English
83
4
189
9.4K
Melinda Richards 🇦🇺🇺🇸
Why do people want to come to a western country and create the same society that they fled from?
English
9K
8.2K
81.2K
3.2M