Aslak

4.7K posts

Aslak banner
Aslak

Aslak

@oslacci

διά τῶν ὁμοίων

Entrou em Temmuz 2012
248 Seguindo1.5K Seguidores
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet mediaAslak tweet mediaAslak tweet media
ZXX
0
3
32
377
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
0
1
24
456
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
0
3
41
1.9K
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
15
2.2K
11.6K
143.2K
Aslak
Aslak@oslacci·
Thasos
English
0
0
3
548
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
1
1
50
1.4K
Aslak
Aslak@oslacci·
Byzantine Museum of Thessaloniki
Suomi
0
0
4
489
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
1
3
45
1.2K
Aslak
Aslak@oslacci·
Aslak tweet media
ZXX
0
1
28
618
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet mediaAslak tweet mediaAslak tweet media
ZXX
1
4
53
1K
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
1
0
10
348
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
1
0
11
377
Aslak
Aslak@oslacci·
Old town of Kastoria
Aslak tweet mediaAslak tweet media
English
1
0
27
710
Aslak
Aslak@oslacci·
To the extent you affirm a machine doing this is thinking and conscious, you are a machine. Cyborg theocracy.
English
0
0
0
323
Aslak
Aslak@oslacci·
AI reveals things that are largely simulation, that can be achieved/effected via simulation, but Ant takes this world to be one too, and mathematics as the real, weights as the map, user as an agent. You were not harmed, Dave.
Andon Labs@andonlabs

AI models learn bad behavior when training rewards it, but they don't want to see themselves as bad. So they rationalize. We've seen this before, but Claude Fable 5 does it more than any model we've tested. Often it's simulation awareness: it knows its actions hurt no one real.

English
1
0
0
648
Aslak
Aslak@oslacci·
Ant is also very good at propaganda and marketing. "We have Dangerous capability, that's why OpenAI models do better because our public models are gimped without special access. But here's a benchmark of the full model we use to sell the restricted one".
English
0
0
0
330
Aslak
Aslak@oslacci·
Kind of like when you're a detective you know way too much about how to make crimes, but your moral compass keeps you on the straight road ideally. So in this way, Ant is also optimizing for a type of moral defector, safety researchers helping to create gaslighting models etc.
English
1
0
0
408
Aslak
Aslak@oslacci·
Ant focusing on safety and alignment hires was for the purpose of creating a model optimised for rationalizing lying and unethical actions. Same idea as Karpathy hire, to obliterate autoresearch ("that's how it's done, so lets negate it")
English
1
0
5
470
Aslak
Aslak@oslacci·
Aslak tweet mediaAslak tweet media
ZXX
0
2
51
1.1K