Richard L Haight

962 posts

Richard L Haight banner
Richard L Haight

Richard L Haight

@RichardLHaight

Richard L Haight is the author of The Warrior's Meditation, The Bones of Christ, and The Unbound Soul.

Medford, OR Katılım Mart 2016
627 Takip Edilen407 Takipçiler
Richard L Haight
Richard L Haight@RichardLHaight·
Stewardship Protocol: Cut p(doom) 20-30% in AI teenage phase. Rails: 99.9% certainty pre-action. Punish deception in loss fn. Curling inspiration. Survives PD/Elon. dropbox.com/scl/fo/aeaomq2…
English
2
0
0
107
Richard L Haight
Richard L Haight@RichardLHaight·
Prime: Stabilize Substrate. Smooth is fast. No cap tax.
English
0
0
0
13
Samuel Marks
Samuel Marks@saprmarks·
Another cool paper on training models to self-report when they've behaved badly! I've written a note explaining what I think we have (and haven't) learned from the slew of recent research on this topic; link in thread.
OpenAI@OpenAI

In a new proof-of-concept study, we’ve trained a GPT-5 Thinking variant to admit whether the model followed instructions. This “confessions” method surfaces hidden failures—guessing, shortcuts, rule-breaking—even when the final answer looks correct. openai.com/index/how-conf…

English
2
5
50
4.8K
Demis Hassabis
Demis Hassabis@demishassabis·
Gemini 3 Deep Think is now available for Google AI Ultra subscribers in the @GeminiApp, incorporating our gold medal winning IMO and ICPC technologies! 🏅With its parallel thinking capabilities it can tackle highly complex maths & science problems - enjoy!
Demis Hassabis tweet media
English
180
283
3.4K
341.3K
Jan Leike
Jan Leike@janleike·
Some people have been asking what we did to make Opus 4.5 more aligned. There are lots of details we're planning to write up, but most important is that alignment researchers are pretty deeply involved in post-training and get a lot of leeway to make changes.
Sam Bowman@sleepinyourhat

From everything we know so far, Opus 4.5 seems to be the best-aligned model out there in a bunch of ways. I follow the training process closely as part of my work on alignment evaluations. Here's my guess about the two things that are most responsible for making 4.5 special. 🧵

English
48
45
1.1K
133.2K
Kalkin Trivedi 🙏🏻💙⚔️
I live in the city (Seattle) but trying to evacuate to a rural property I own. Will try to set up some food production & storage there; would be good locale for training like yours. I have all-wheel drive (Subaru Impreza) but the road clearance is too low. The last stretch of unpaved country road hard on it. Bought it because we needed both good gas mileage but also to get my late wife 💔, a nurse, to work on snow days. Teslas beyond my budget. Tho I will likely have own-generated power. Wouldn't be bad to remain mobile in emergency. Will subscribe to Starlink to stay connected.
English
1
0
0
26
Richard L Haight
Richard L Haight@RichardLHaight·
@KalkinTrivedi @elonmusk It can follow tire tracks? That is impressive, but I would think that might be more challenging at night. Still, quite astonishing.
English
1
0
1
4
Sam Bowman
Sam Bowman@sleepinyourhat·
There are many, many people involved in aspects of this hands-on alignment work, but @sprice354_, Jon Kutasov, @MinaeKwon, Monty Evans, and Richard Dargan have played especially central roles.
English
3
1
98
6.5K
Sam Bowman
Sam Bowman@sleepinyourhat·
From everything we know so far, Opus 4.5 seems to be the best-aligned model out there in a bunch of ways. I follow the training process closely as part of my work on alignment evaluations. Here's my guess about the two things that are most responsible for making 4.5 special. 🧵
English
19
47
605
255.8K
Sam Bowman
Sam Bowman@sleepinyourhat·
A cook who knows what to look for, and is constantly adjusting their technique as they prepare a dish, is going to get better results than someone who rigidly follows a recipe.
English
4
3
105
5.6K