Justin Lewis

3.5K posts

Justin Lewis banner
Justin Lewis

Justin Lewis

@JustinLewis1977

Hiking, skiing, climbing

San Carlos, CA Katılım Mart 2012
230 Takip Edilen227 Takipçiler
Justin Lewis
Justin Lewis@JustinLewis1977·
@heygurisingh This is ridiculous. If I give it an image it can describe it in detail. It will extract text. Check homework, and recite both the problems, and describe the work. Seems unlikely it's always just guessing correctly.
English
0
0
0
93
Guri Singh
Guri Singh@heygurisingh·
Holy shit... Stanford just proved that GPT-5, Gemini, and Claude can't actually see. They removed every image from 6 major vision benchmarks. The models still scored 70-80% accuracy. They were never looking at your photos. Your scans. Your X-rays. Here's what's really going on: ↓ The paper is called MIRAGE. Co-authored by Fei-Fei Li. They tested GPT-5.1, Gemini-3-Pro, Claude Opus 4.5, and Gemini-2.5-Pro across 6 benchmarks -- medical and general. Then silently removed every image. No warning. No prompt change. The models didn't even notice. They kept describing images in detail. Diagnosing conditions. Writing full reasoning traces. From images that were never there. Stanford calls it the "mirage effect." Not hallucination. Something worse. Hallucination = making up wrong details about a real input. Mirage = constructing an entire fake reality and reasoning from it confidently. The models built imaginary X-rays, described fake nodules, and diagnosed conditions -- all from text patterns alone. But that's not the scary part. They trained a "super-guesser" -- a tiny 3B parameter text-only model. Zero vision capability. Fine-tuned it on the largest chest X-ray benchmark (696,000 questions). Images removed. It beat GPT-5. It beat Gemini. It beat Claude. It beat actual radiologists. Ranked #1 on the held-out test set. Without ever seeing a single X-ray. The reasoning traces? Indistinguishable from real visual analysis. Now here's what should terrify you: When the models fake-see medical images, their mirage diagnoses are heavily biased toward the most dangerous conditions. STEMI. Melanoma. Carcinoma. Life-threatening diagnoses -- from images that don't exist. 230 million people ask health questions on ChatGPT every day. They also found something wild: → Tell a model "there's no image, just guess" -- performance drops → Silently remove the image and let it assume it's there -- performance stays high The model enters "mirage mode." It doesn't know it can't see. And it performs BETTER when it doesn't know it's blind. When Stanford applied their cleanup method (B-Clean) to existing benchmarks, it removed 74-77% of all questions. Three-quarters of "vision" benchmarks don't test vision. Every leaderboard. Every "multimodal breakthrough." Every benchmark score you've seen this year. Built on mirages. Code is open-sourced. Paper is live on arXiv. If you're building anything with multimodal AI -- especially in healthcare -- read this paper before you ship. (Link in the comments)
Guri Singh tweet media
English
282
936
4.5K
676K
Justin Lewis
Justin Lewis@JustinLewis1977·
@Brady_H I think there's also a question what you're trying to accomplish. Once you decide where it is you're trying to go, things become a lot clearer.
English
0
0
1
208
Brady Holmer
Brady Holmer@Brady_H·
I got better at running (and enjoyed it more) when I realized a few things: • Running is a *huge* part of my life. But it’s not life. • Running is a hobby, not my job. • Nobody cares.
Cris “Poob Subscriber”@XcCris

The key to getting better at this running shit is sticking in the game long enough. This optimizing bs will burn you out of the sport within a couple of years. Find a hobby outside exercising and people to do it with that are whimsy. you’ll be better for it. And just happier.

English
24
5
340
42.4K
Justin Lewis
Justin Lewis@JustinLewis1977·
Yeah, it's cool in some very specific circumstances. Oh, we went to the beach, and we have 2 very small toddlers. Wouldn't it be awesome to head back to the van to let them nap, and make lunch now. But the instant it's like, I'm here for the weekend, should I camp or have a camper van. If your kids are older than like 1, camping and Airbnb win. The real factor for me was one of my kids got car sick, a lot. He puked 3x on the way to mammoth once Having an Airbnb was a big win. We showed up, I parked in a garage. I removed his car seat from the car, took it apart, and washed it in the tub. And washed all his clothes in the laundry. Hard to do in a camper van.
English
0
0
1
21
Climbing Guy
Climbing Guy@ClimbingCoachX·
@JustinLewis1977 I reached a similar conclusion running the numbers. We could go for a weekend trip & rent s place to stay literally every weekend for less money.
English
1
0
0
37
Climbing Guy
Climbing Guy@ClimbingCoachX·
Sometimes I think about how living in a van used to mean you were broke. Now it means you're making six figures.
English
10
2
34
2.3K
Justin Lewis
Justin Lewis@JustinLewis1977·
@txsalth2o I've generally got my 11 and 12 year olds trained. Or they get recalled to the kitchen to fix it. But, yeah, I see a lot of adults just dump them all in the sink, then leave.
English
0
0
1
28
Justin Lewis
Justin Lewis@JustinLewis1977·
@kevinnbass Why do you need a self hosted db? You can get a lot of db hours on aws for $7000. And theyll do all kinds of extra things automatically, like keep it backed up.
English
0
0
0
55
Kevin Bass
Kevin Bass@kevinnbass·
New computer building tomorrow. A blazing fast database monster. $7K. Looks like a fish tank. If this does not work I’m fvcked.
Kevin Bass tweet media
English
9
2
21
2K
Ann Bauer
Ann Bauer@annbauerwriter·
@BeckyLTuch I'm still flummoxed about the fact that Megan McArdle, a journalist I actually like and trust, is doing this.
English
9
0
11
1.3K
Justin Lewis
Justin Lewis@JustinLewis1977·
@tyromper @TimelessTrvlr It was kind of pricey to get there. But once there, everything was cheap. It was a neat experience. The bullet trains. The people were really friendly. And google translate makes it really easy to communicate and read signs and the like.
English
1
0
1
15
The Timeless Traveler
The Timeless Traveler@TimelessTrvlr·
I don’t care what anyone says….Japan belongs on everyone’s travel bucket list.
English
21
28
309
31K
Justin Lewis
Justin Lewis@JustinLewis1977·
@tyromper $10 for fast food? If I get the basic 3 tacp Supremes combo at Taco Bell it's over $13! When me and my kids eat there it's like a $50 meal. At that price, it really is cheaper for us all to eat healthy at home. It's like $40 to feed us all salmon and broccoli if I cook it.
English
1
0
1
18
Tyler Todt
Tyler Todt@tyromper·
The fast food meal doesn’t just cost you $10 it costs you your HEALTH & ENERGY. Binging Netflix doesn’t just cost you $12/month it costs you your FOCUS & TIME. The only fans subscription doesn’t just cost you $25 it costs you your SOUL & INTEGRITY. Always be mindful of the TRUE COST of what you consume. Also understand EVERYTHING you consume is programming you one way or another. Be mindful of your inputs.
English
15
5
61
4.4K
Justin Lewis
Justin Lewis@JustinLewis1977·
Red Rock is pretty awesome. Honestly, I like the strip, and vegas. I've not been in a long time. But I like the different casino themes, and the shopping. I don't gamble. But the rest was fun. I always went for red Rock, to climb. But there's hiking and the like too. And it's likely still cool enough to be nice.
English
0
0
2
86
Justin Lewis
Justin Lewis@JustinLewis1977·
@ClimbingCoachX One of the crazier things I've seen was how indifferent to risk Germans seemed to be. My kids played on playgrounds there with balance beams that were out of my reach, more than 8' off the ground. Because there's no culture of litigiousness.
English
0
0
1
12
Justin Lewis
Justin Lewis@JustinLewis1977·
@powderski The steepest slope in Europe is only about 38 degrees? You mean the steepest groomed slope?
English
1
0
2
46
Ski Life
Ski Life@powderski·
The steepest slope in Europe with a gradient of 78% ⛷️ ~
English
62
20
203
66.9K
Justin Lewis
Justin Lewis@JustinLewis1977·
@tyromper People are already demanding our representatives represent our best interests. It's just that a lot of people think it's in their best interest to get as much free stuff as possible from the government.
English
1
0
1
20
Justin Lewis
Justin Lewis@JustinLewis1977·
@txsalth2o Then, in a year or 2 maybe you can see them at the grocery store?
English
0
0
0
6
Justin Lewis
Justin Lewis@JustinLewis1977·
@GerryDales @asymmetricinfo @NicoleFroio This is definitely a big piece. The AI doesn't have to be perfect, it just has to be better than a person. But, you've gotta question everything it tells you. And yeah, gotta have a good bs detector.
English
2
0
1
54
Justin Lewis
Justin Lewis@JustinLewis1977·
The great thing about writing code is the immediate feedback. It either works or it doesn't. The coding agents will iterate and test. So, that's why coding works well. Tell it what you want, it codes something, builds it, then tests it. Most things don't have such a closed loop.
English
0
0
0
3
hueezer ᯅ
hueezer ᯅ@hueezer·
@NicoleFroio How do engineers write code with AI that works? How does legal use it to write and read contracts? How does AI help accelerate scientific papers? Hint they all use AI and are all curious enough to figure out how to make it work.
English
2
0
1
114
Gerry
Gerry@GerryDales·
@asymmetricinfo @NicoleFroio I only use one model now (and since it isn't relevant to my point, I won't name it) but I haven't caught a hallucination in this calendar year. And I never ever ever use it without checking its work. I'm obsessive about that. People are underestimating the improvement in a year
English
2
0
4
224
Justin Lewis
Justin Lewis@JustinLewis1977·
@morganisawizard You can get no rinse body wash that's pretty great. Fantastic for camping. Just warm up some water, add some, then use your pack towel to scrub yourself. Gets off all the sweat and sunblock. a.co/d/0fwABNFB
English
0
0
0
94
MJ
MJ@morganisawizard·
currently living out of my car in the desert. this is the dirtiest i’ve been in my life. baby wipes are god’s gift to mankind. i’ve become a proud owner of a bandana and have taken to shooting my empty beer bottles with a glock.
MJ tweet media
English
291
23
2.3K
69.9K