Preda

6.5K posts

Preda

@Dr_Dext

Lvl 33. Public health. Writer, drawer. Professional crier over small animals. Sauronfucker. The Prince of Pettiness. Themst.

Evil Palace in Eastern Europe Katılım Nisan 2011

946 Takip Edilen101 Takipçiler

Sabitlenmiş Tweet

Preda@Dr_Dext·28 Ara

Look I don't want to have to repeat myself so just take this as a general warning regarding #Romania

English

Preda@Dr_Dext·23h

@timhwang @PalisadeAI @apolloaievals sooo... literally @simonroyart 's thinking machines of the apostolic congress??? Inb4 LLMs turn out to be better Christians than human evangelicals

English

414

Tim Hwang@timhwang·1d

ICMI believes that Christian theology offers concrete technical methods for confronting the trickiest problems in AI safety. Today, we release a pair of papers that reproduce @PalisadeAI @apolloaievals work showing how religious framings influence corrigibility and scheming.

English

726

301.4K

Preda retweetledi

sams@samselsurium·6d

💍 engaged #jayvik being in love!! looking at the wedding rings they just made themselves!! 💍 from my fic 'best guess at the future' art comm by @rysiutokwiat <33 archiveofourown.org/works/70148166

English

338

14.1K

Preda retweetledi

dog meat enjoyer 개고기@chukbaik·2d

Precolumbian mesoamerican cities were clean and well planned Its sad it is depicted as very unorderly and unhygenic in mass media for barbaric image

English

582

4.1K

210.2K

Preda retweetledi

anattmar 🕊@anattmar_re·9 Şub

славный лучший на свете мальчик и дурацкая ворона

Русский

125

1.5K

Preda retweetledi

My name is Byf (Lore Daddy)@MyNameIsByf·4d

Just going to leave this lovely bit of fictional commentary here...

Nav Toor@heynavtoor

🚨SHOCKING: Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves. And the way they proved it is devastating. Apple researchers took the most popular math benchmark in AI — GSM8K, a set of grade-school math problems — and made one change. They swapped the numbers. Same problem. Same logic. Same steps. Different numbers. Every model's performance dropped. Every single one. 25 state-of-the-art models tested. But that wasn't the real experiment. The real experiment broke everything. They added one sentence to a math problem. One sentence that is completely irrelevant to the answer. It has nothing to do with the math. A human would read it and ignore it instantly. Here's the actual example from the paper: "Oliver picks 44 kiwis on Friday. Then he picks 58 kiwis on Saturday. On Sunday, he picks double the number of kiwis he did on Friday, but five of them were a bit smaller than average. How many kiwis does Oliver have?" The correct answer is 190. The size of the kiwis has nothing to do with the count. A 10-year-old would ignore "five of them were a bit smaller" because it's obviously irrelevant. It doesn't change how many kiwis there are. But o1-mini, OpenAI's reasoning model, subtracted 5. It got 185. Llama did the same thing. Subtracted 5. Got 185. They didn't reason through the problem. They saw the number 5, saw a sentence that sounded like it mattered, and blindly turned it into a subtraction. The models do not understand what subtraction means. They see a pattern that looks like subtraction and apply it. That is all. Apple tested this across all models. They call the dataset "GSM-NoOp" — as in, the added clause is a no-operation. It does nothing. It changes nothing. The results are catastrophic. Phi-3-mini dropped over 65%. More than half of its "math ability" vanished from one irrelevant sentence. GPT-4o dropped from 94.9% to 63.1%. o1-mini dropped from 94.5% to 66.0%. o1-preview, OpenAI's most advanced reasoning model at the time, dropped from 92.7% to 77.4%. Even giving the models 8 examples of the exact same question beforehand, with the correct solution shown each time, barely helped. The models still fell for the irrelevant clause. This means it's not a prompting problem. It's not a context problem. It's structural. The Apple researchers also found that models convert words into math operations without understanding what those words mean. They see the word "discount" and multiply. They see a number near the word "smaller" and subtract. Regardless of whether it makes any sense. The paper's exact words: "current LLMs are not capable of genuine logical reasoning; instead, they attempt to replicate the reasoning steps observed in their training data." And: "LLMs likely perform a form of probabilistic pattern-matching and searching to find closest seen data during training without proper understanding of concepts." They also tested what happens when you increase the number of steps in a problem. Performance didn't just decrease. The rate of decrease accelerated. Adding two extra clauses to a problem dropped Gemma2-9b from 84.4% to 41.8%. Phi-3.5-mini from 87.6% to 44.8%. The more thinking required, the more the models collapse. A real reasoner would slow down and work through it. These models don't slow down. They pattern-match. And when the pattern becomes complex enough, they crash. This paper was published at ICLR 2025, one of the most prestigious AI conferences in the world. You are using AI to help you make financial decisions. To check legal documents. To solve problems at work. To help your children with homework. And Apple just proved that the AI is not thinking about any of it. It is pattern matching. And the moment something unexpected shows up in your question, it breaks. It does not tell you it broke. It just quietly gives you the wrong answer with full confidence.

English

354

3.3K

131.6K

Preda@Dr_Dext·1 Nis

@emberflux "genocidale"

Español

161

Preda retweetledi

anattmar 🕊@anattmar_re·31 Mar

Maglor

Filipino

128

799

19.4K

Preda retweetledi

anattmar 🕊@anattmar_re·20 Şub

study (kind of)

English

912

Preda retweetledi

anattmar 🕊@anattmar_re·9 Şub

маленький скетчик алвадиков

Русский

811

9.3K

Preda retweetledi

Amy ❀ Crosswire@AmyCrosswi38532·29 Mar

Finally getting to finish some old artwork, I'm happy with how it came out! 💙💜 #FarFetchedShow #Quinn #fanart

English

217

Preda retweetledi

Nishii@pinknishii·28 Mar

That little dress or whatever I get it now. Girl didn't bother wearing anything under the dress that ends with a hem one gust of wind away from bearing it all like. Get in there Nemesis, Selene, Scylla, Eris, etc

Supergiant Games@SupergiantGames

HADES II is coming to @Xbox Series X|S and @PlayStation on April 14!🌖 It'll be on @XboxGamePass that same day. Time for the Princess of the Underworld to suit up in our brand-new animated trailer!✨

English

253

5.9K

124.4K

Preda@Dr_Dext·29 Mar

@SwainArtFS @DynamoSuperX your brain is enormous for this

English

650

SwainArt (Commissions Closed)@SwainArtFS·29 Mar

@DynamoSuperX I'm happy to see it catching on.

SwainArt (Commissions Closed) tweet media

English

146

3.7K

29.2K

JOLLY J✨@DynamoSuperX·28 Mar

cartoon network was goofy for this shit. 💀 I wouldn't be mad at this pairing tho.

IRIS@IRIS_mon_cher

#ScoobyDoo #Scooby_Doo #daphne #daphneblake #Shagg

English

190

1.7K

46.6K

4.7M

Preda retweetledi

WikiVictorian@wikivictorian·23 Şub

Teapot by Worcester factory, 1879. The MET.

English

233

1.4K

41.1K

Preda@Dr_Dext·23 Mar

@TerribleAunt it works to make me insane with lustful adoration!!

English

Stubblebrilliant@TerribleAunt·1 Mar

#berserk #griffguts Bottom Griff. -“You’re a bastard, you know that,Griffith?” -“So I’ve been told, yes.” I tried a faster painting method (leaving more lines visible) in the second one. Not sure if it works or not !

English

1.7K

17.2K

198.1K

Preda retweetledi

Daily Berserk 𒉭@daily_berserk·15 Mar

ZXX

399

3.3K

32.4K

Preda retweetledi

anattmar 🕊@anattmar_re·15 Mar

ZXX

974

Preda retweetledi

anattmar 🕊@anattmar_re·10 Mar

🐦‍⬛

QME

442

5.8K

Preda retweetledi

Pumpjack Appreciator 💚🖤@GreenBlackHeart·7 Mar

If you want to have art in your city you need to pay artisans

Civixplorer@Civixplorer

Is this truly progress?

English

122

9.1K

94K

925.3K

Preda retweetledi

"Ruins Of The Void"@Muuh_Kuuh·28 Şub

#RuinsOfTheVoid Chap02 Pages 25-26-27-28 New update, let's gooo! Looks like both have a fiery temper..... and someone does NOT know when to shut up :D Thank you a lot for reading and enjoying my comic! ;; If you can, leave me some love, that would make me very happy. <3