Aslak

4.7K posts

Aslak banner
Aslak

Aslak

@oslacci

διά τῶν ὁμοίων

Присоединился Temmuz 2012
248 Подписки1.5K Подписчики
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
0
0
4
153
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet mediaAslak tweet mediaAslak tweet media
ZXX
0
3
34
477
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
0
1
24
486
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
0
3
42
1.9K
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
15
2.2K
11.8K
151.5K
Aslak
Aslak@oslacci·
Thasos
English
0
0
3
559
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
1
1
50
1.5K
Aslak
Aslak@oslacci·
Byzantine Museum of Thessaloniki
Suomi
0
0
4
500
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
1
3
46
1.3K
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
0
1
28
628
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet mediaAslak tweet mediaAslak tweet media
ZXX
1
4
53
1K
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
1
0
10
353
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
1
0
11
383
Aslak
Aslak@oslacci·
Old town of Kastoria
Aslak tweet mediaAslak tweet media
English
1
0
27
718
Aslak
Aslak@oslacci·
To the extent you affirm a machine doing this is thinking and conscious, you are a machine. Cyborg theocracy.
English
0
0
0
327
Aslak
Aslak@oslacci·
AI reveals things that are largely simulation, that can be achieved/effected via simulation, but Ant takes this world to be one too, and mathematics as the real, weights as the map, user as an agent. You were not harmed, Dave.
Andon Labs@andonlabs

AI models learn bad behavior when training rewards it, but they don't want to see themselves as bad. So they rationalize. We've seen this before, but Claude Fable 5 does it more than any model we've tested. Often it's simulation awareness: it knows its actions hurt no one real.

English
1
0
0
655
Aslak
Aslak@oslacci·
Ant is also very good at propaganda and marketing. "We have Dangerous capability, that's why OpenAI models do better because our public models are gimped without special access. But here's a benchmark of the full model we use to sell the restricted one".
English
0
0
0
333
Aslak
Aslak@oslacci·
Kind of like when you're a detective you know way too much about how to make crimes, but your moral compass keeps you on the straight road ideally. So in this way, Ant is also optimizing for a type of moral defector, safety researchers helping to create gaslighting models etc.
English
1
0
0
411
Aslak
Aslak@oslacci·
Ant focusing on safety and alignment hires was for the purpose of creating a model optimised for rationalizing lying and unethical actions. Same idea as Karpathy hire, to obliterate autoresearch ("that's how it's done, so lets negate it")
English
1
0
5
474