Skeptical Economist, prof. Collegium Mesozoicum
30.8K posts

Skeptical Economist, prof. Collegium Mesozoicum
@postecon
Fill your heart with love today Don't play the game of time Things that happened in the past Only happened in your mind Oh, forget your mind And you'll be free



Dużym problemem jest też histeryczna i nieuzasadniona narracja dziennikarzy i mediów w zakresie stabilności finansów publicznych i skutków podnoszenia opodatkowania

Holy shit... Stanford just proved that GPT-5, Gemini, and Claude can't actually see. They removed every image from 6 major vision benchmarks. The models still scored 70-80% accuracy. They were never looking at your photos. Your scans. Your X-rays. Here's what's really going on: ↓ The paper is called MIRAGE. Co-authored by Fei-Fei Li. They tested GPT-5.1, Gemini-3-Pro, Claude Opus 4.5, and Gemini-2.5-Pro across 6 benchmarks -- medical and general. Then silently removed every image. No warning. No prompt change. The models didn't even notice. They kept describing images in detail. Diagnosing conditions. Writing full reasoning traces. From images that were never there. Stanford calls it the "mirage effect." Not hallucination. Something worse. Hallucination = making up wrong details about a real input. Mirage = constructing an entire fake reality and reasoning from it confidently. The models built imaginary X-rays, described fake nodules, and diagnosed conditions -- all from text patterns alone. But that's not the scary part. They trained a "super-guesser" -- a tiny 3B parameter text-only model. Zero vision capability. Fine-tuned it on the largest chest X-ray benchmark (696,000 questions). Images removed. It beat GPT-5. It beat Gemini. It beat Claude. It beat actual radiologists. Ranked #1 on the held-out test set. Without ever seeing a single X-ray. The reasoning traces? Indistinguishable from real visual analysis. Now here's what should terrify you: When the models fake-see medical images, their mirage diagnoses are heavily biased toward the most dangerous conditions. STEMI. Melanoma. Carcinoma. Life-threatening diagnoses -- from images that don't exist. 230 million people ask health questions on ChatGPT every day. They also found something wild: → Tell a model "there's no image, just guess" -- performance drops → Silently remove the image and let it assume it's there -- performance stays high The model enters "mirage mode." It doesn't know it can't see. And it performs BETTER when it doesn't know it's blind. When Stanford applied their cleanup method (B-Clean) to existing benchmarks, it removed 74-77% of all questions. Three-quarters of "vision" benchmarks don't test vision. Every leaderboard. Every "multimodal breakthrough." Every benchmark score you've seen this year. Built on mirages. Code is open-sourced. Paper is live on arXiv. If you're building anything with multimodal AI -- especially in healthcare -- read this paper before you ship. (Link in the comments)

JD Vance on UFOs: “I Think They’re Demons” “Every great world religion including Christianity, the one I believe in, understood there are weird things out there that are very difficult to explain… I think one of The Devil’s great tricks is to convince people he never existed.”

Według Washington Post Pentagon przygotowuje się do tygodni operacji lądowych w Iranie. Nie ma przy tym mowy o pełnoskalowej inwazji.

My favourite thing about Poland is that you don’t address strangers as “you.” You say Pan, Pani, Państwo (Mr/Mrs/+this plural I can’t translate): formal address is built into the grammar. Even in a shop, you’d say “Czy Państwo mają…” not “do you have…” and it isn’t performative politeness but actually structural respect. There is no casual “you” for someone you haven’t been invited to be familiar with. When I do this, people often rush to correct me or rather announce familiarity. “Oh, don’t call me Madame, call me Catherine.” And I’ll still address them formally until they give me clear permission to stop or until I decide I’m familiar and done with the formal. Pure elegance. The kind that assumes every stranger deserves dignity before they’ve earned familiarity. The West abolished formality for uhhh friendliness. Poland kept it bc respect.

@Int_Wydarzenia @donaldtusk @mblaszczak Płaszczak tak lobbuje za przemysłem USA, a pluje na polską zbrojeniówkę, że tu już powinny działać służby antykoncepcyjne i wszelkie inne. Zagadka, kto przyjął 100 mln zł łapówki od Korei za zakup uzbrojenia też wciąż pozostaje zagadką.

byłem dziś na nowym osiedlu na Żeraniu i macie kurwa osobiście przeprosić wszystkich architektów blokowisk i Edwarda Gierka


Jakie trzeba mieć ego by napisać o sobie notkę w wikipedii?;) pl.wikipedia.org/wiki/Jan_Olesz…




