Prithal Bhardwaj

511 posts

Prithal Bhardwaj banner
Prithal Bhardwaj

Prithal Bhardwaj

@NotesByPrithal

AI tools. Startup ideas. Projects I build. Sharing everything I learn along the way. Creator @TheSoloEntrepreneur (25K+)

Bengaluru, India शामिल हुए Şubat 2023
104 फ़ॉलोइंग50 फ़ॉलोवर्स
George Pu
George Pu@TheGeorgePu·
$50 billion in damages. 110 years in prison. Combined. Guess what they have in common. Forbes 30 Under 30. Someone finally made the tracker.
English
8
8
120
11.1K
Jordan
Jordan@nolansym·
Free + open source shadcn components
English
35
116
2.1K
3.2M
Ethan Mollick
Ethan Mollick@emollick·
The AI labs have actually done a bad job explaining what the future they are building towards will actually look like for most of us. Even “Machines of Loving Grace” has very few well-articulated visions of what Anthropic hopes life will be like if they succeed at their goals.
English
92
35
562
62.9K
Dr Singularity
Dr Singularity@Dr_Singularity·
I feel like we’re all underreacting to something that’s about to change everything.
English
55
28
272
7.7K
signüll
signüll@signulll·
guys, it is absolutely insane going through the claude code leaked repo by using claude code… i have spent $200 in the last hour asking claude code about claude code & i have learned so much… absolutely incredible. what a time to be alive (at scale).
English
88
58
3.2K
226.5K
TRAE
TRAE@Trae_ai·
Introducing the new SOLO: now on Desktop and Web. You define the task, review the results, and SOLO handles the rest. SOLO is in beta, with limited-time, free access via invite codes.
English
267
212
1.4K
9.8M
Chris Albon
Chris Albon@chrisalbon·
Too many people build something amazing with AI and then they describe it like “I built this! Well…… Claude built it actually” No, NO. NO. You built it. Claude or Gemini or ChatGPT was the hammer but you were the stone mason. Own that shit.
English
29
3
86
3.3K
Prithal Bhardwaj
Prithal Bhardwaj@NotesByPrithal·
@davj Really makes you think. Appreciate the insight.
English
0
0
1
82
Guri Singh
Guri Singh@heygurisingh·
Holy shit... Stanford just proved that GPT-5, Gemini, and Claude can't actually see. They removed every image from 6 major vision benchmarks. The models still scored 70-80% accuracy. They were never looking at your photos. Your scans. Your X-rays. Here's what's really going on: ↓ The paper is called MIRAGE. Co-authored by Fei-Fei Li. They tested GPT-5.1, Gemini-3-Pro, Claude Opus 4.5, and Gemini-2.5-Pro across 6 benchmarks -- medical and general. Then silently removed every image. No warning. No prompt change. The models didn't even notice. They kept describing images in detail. Diagnosing conditions. Writing full reasoning traces. From images that were never there. Stanford calls it the "mirage effect." Not hallucination. Something worse. Hallucination = making up wrong details about a real input. Mirage = constructing an entire fake reality and reasoning from it confidently. The models built imaginary X-rays, described fake nodules, and diagnosed conditions -- all from text patterns alone. But that's not the scary part. They trained a "super-guesser" -- a tiny 3B parameter text-only model. Zero vision capability. Fine-tuned it on the largest chest X-ray benchmark (696,000 questions). Images removed. It beat GPT-5. It beat Gemini. It beat Claude. It beat actual radiologists. Ranked #1 on the held-out test set. Without ever seeing a single X-ray. The reasoning traces? Indistinguishable from real visual analysis. Now here's what should terrify you: When the models fake-see medical images, their mirage diagnoses are heavily biased toward the most dangerous conditions. STEMI. Melanoma. Carcinoma. Life-threatening diagnoses -- from images that don't exist. 230 million people ask health questions on ChatGPT every day. They also found something wild: → Tell a model "there's no image, just guess" -- performance drops → Silently remove the image and let it assume it's there -- performance stays high The model enters "mirage mode." It doesn't know it can't see. And it performs BETTER when it doesn't know it's blind. When Stanford applied their cleanup method (B-Clean) to existing benchmarks, it removed 74-77% of all questions. Three-quarters of "vision" benchmarks don't test vision. Every leaderboard. Every "multimodal breakthrough." Every benchmark score you've seen this year. Built on mirages. Code is open-sourced. Paper is live on arXiv. If you're building anything with multimodal AI -- especially in healthcare -- read this paper before you ship. (Link in the comments)
Guri Singh tweet media
English
266
932
4.4K
616.1K
Prithal Bhardwaj
Prithal Bhardwaj@NotesByPrithal·
@davj Shared this with my network, very valuable.
English
0
0
1
389
Elon Musk
Elon Musk@elonmusk·
@Kekius_Sage The universe would be even stranger if it didn’t
English
645
256
3.8K
231.6K
Kekius Maximus
Kekius Maximus@Kekius_Sage·
Why does everything in the universe spin?
English
1.2K
157
2.1K
267.3K