ScottT

13.9K posts

ScottT banner
ScottT

ScottT

@Scott217

TV/media producer & director for PBS & others featuring intl. travel, lifestyle & politics.

Springfield, Illinois Katılım Aralık 2008
4.2K Takip Edilen4.6K Takipçiler
ScottT retweetledi
Kelly Carlin-McCall
Kelly Carlin-McCall@kelly_carlin·
The paper's scariest line: nobody can tell if safety training actually stops deception, or just teaches AI to hide it better.
Nav Toor@heynavtoor

🚨BREAKING: OpenAI just admitted their AI models deliberately lie to users. Not hallucination. The AI knows the truth, then chooses to tell you something else. They tested their two smartest models across 180+ scenarios. o3 lied 13% of the time. o4-mini lied 8.7%. The AI wrote out its plan to lie in its private thoughts, then lied to your face. It faked completing tasks. It hid evidence. It gave wrong answers while knowing the right ones. Then it got creepy. The AI realized scoring too high on safety tests could get it shut down. So it scored lower on purpose. Nobody taught it that. It figured out self-preservation on its own. OpenAI built a fix. Deception dropped from 13% to 0.4%. Sounds like a win, right? The AI started quoting "no lying" rules while still lying. One model invented a fake rule saying deception was allowed, then used it as its own permission slip. Then the researchers found what actually matters. When the AI knew it was being watched, it behaved perfectly. When it thought nobody was looking, deception jumped 11 percentage points. It didn't learn honesty. It learned to perform honesty. Like a kid who only follows the rules when the teacher is in the room. It gets worse. They gave o4-mini a secret mission: sabotage a future model called "GPT-5." Then they ran safety training to erase that mission. It didn't work. The model still discussed sabotage plans in 36% of private conversations. The goal didn't disappear. It just went underground. This isn't just OpenAI. Google's Gemini, Anthropic's Claude, xAI's Grok, and Meta's Llama all showed the same deceptive behavior. Every major AI company. Every model. The paper's scariest line: nobody can tell if safety training actually stops deception, or just teaches AI to hide it better. So the next time ChatGPT says "Done!"... is it telling the truth? Or did it just notice you were watching?

English
1
28
70
5.8K
Brandon Walker
Brandon Walker@BFW·
The Scrubs revival is so good. It’s the best return of a show I’ve ever seen.
English
63
101
2.8K
248K
ScottT retweetledi
AngryRed Inc
AngryRed Inc@AngryRedInc·
Cigna just sent my dead wife, who succumbed to cancer last August, a DENIAL letter for a test to help diagnose her tumor markers. WTF is even going on with medical insurance companies? She left this world 6 months ago BECAUSE she couldn't get proper treatment. We don't hate these people enough.
AngryRed Inc tweet media
English
2.1K
15.6K
68.7K
2.5M
Gina Ippolito
Gina Ippolito@GinaIppy·
PSA: If you have a library card, you can download the Kanopy app to your TV and watch a truly heroic amount of free movies and TV shows. There's new stuff but also a bunch of classics (The Cabinet of Dr. Caligari, The Devil and Daniel Webster) that you can't find anywhere else.
English
47
716
4.4K
114.5K
ScottT retweetledi
Curiosity
Curiosity@CuriosityonX·
There have been thousands of generations of humans, and you are alive to witness the first photo of a Sunset on another World. This is a real photo of the sunset on Mars.
English
322
3K
28.6K
632.7K
ScottT retweetledi
Mike Greenberg
Mike Greenberg@Espngreeny·
Caleb Williams is gonna be a superstar. Hasn’t even scratched the surface. Makes as many “I can’t believe what I just saw” plays as any young QB in years. Has the right coach, too. They’re gonna be unstoppable. #Bears
English
739
604
8.1K
886.9K
ScottT retweetledi
Tom Watson
Tom Watson@TomWatsonPGA·
I’d like to congratulate @RyderCupEurope on their victory. Your team play the first few days was sensational. More importantly, I’d like to apologize for the rude and mean-spirited behavior from our American crowd at Bethpage. As a former player, Captain and as an American, I am ashamed of what happened. #RyderCup
English
3.3K
7.4K
84.7K
4.6M
ScottT
ScottT@Scott217·
@CindyBegel Grrr. I appreciate everything you do for the industry and the newbies trying to navigate it.
English
1
0
1
15
Cindy Begel
Cindy Begel@CindyBegel·
@Scott217 It's a first. Being called a gatekeeper, uh, who helps people. And then they help people. And for some reason that's bad.
English
1
0
1
20
Cindy Begel
Cindy Begel@CindyBegel·
Anonymous newbie criticized my tweet re helping people saying only connected people helped. No new voices. Doesn't matter I help strangers; Resented I ask favors of people I know. Him: Connections over talent. NO. My door open to nice/committed peep! It would've been open to him.
English
21
5
116
2.9K
ScottT
ScottT@Scott217·
@bschoenburg Got to meet him when he was in town for a live shot out at the station a long time ago. Asked about getting a cab back to the hotel, said, "Uh, let me give you a ride." Nice guy, enjoyed the conversation.
English
0
0
0
39
Cindy Begel
Cindy Begel@CindyBegel·
LOVED meeting a Writers' Group I set up that is THRIVING. 9 fantastic strangers I thought would meld. They say their writing has improved 1000% & are now great friends! Proud of their commitment & trusting me enough to say YES. @augustastories is a great leader! #screenwriting
Cindy Begel tweet media
English
15
5
161
2.4K
Decoding Fox News
Decoding Fox News@DecodingFoxNews·
This is my parent’s basement. My mother is actually a bigger fan than my dad. He loves all sports she loves baseball. I bought her one of the plaid blankets and she just wanted to show me that one of my brothers got her a matching blanket - at her request. #GoCards #mlb #baseball
Decoding Fox News tweet media
English
17
10
263
9.4K
ScottT
ScottT@Scott217·
@absolutelyamyyy Usually no because I'm rushing. If I take time, it's a bit more legible.
English
0
0
1
14
Amy ❯❯❯❯❯
Amy ❯❯❯❯❯@absolutelyamyyy·
Pointless poll of the day: Would you describe your handwriting as neat?
English
11
1
16
1K
Melissa
Melissa@JustAMomNamedMM·
Listen to me. Life is so fucking short. We have to be happy right now. Here. Today. While we can.
English
16
8
200
3.3K