Aslak

4.7K posts

Aslak

@oslacci

διά τῶν ὁμοίων

Entrou em Temmuz 2012

248 Seguindo1.5K Seguidores

Aslak@oslacci·10h

ZXX

377

Aslak@oslacci·21h

ZXX

456

Aslak@oslacci·1d

ZXX

1.9K

Aslak@oslacci·2d

ZXX

2.2K

11.6K

143.2K

Aslak@oslacci·2d

Thasos

English

548

Aslak@oslacci·2d

ZXX

1.4K

Aslak@oslacci·2d

Byzantine Museum of Thessaloniki

Suomi

489

Aslak@oslacci·2d

ZXX

1.2K

Aslak@oslacci·2d

ZXX

618

Aslak@oslacci·3d

x.com/i/status/18077…

Aslak@oslacci

Monastery of Panagia Mavriotissa

ZXX

586

Aslak@oslacci·3d

ZXX

Aslak@oslacci·3d

ZXX

348

Aslak@oslacci·3d

ZXX

377

Aslak@oslacci·3d

Old town of Kastoria

English

710

Aslak@oslacci·4d

To the extent you affirm a machine doing this is thinking and conscious, you are a machine. Cyborg theocracy.

English

323

Aslak@oslacci·4d

AI reveals things that are largely simulation, that can be achieved/effected via simulation, but Ant takes this world to be one too, and mathematics as the real, weights as the map, user as an agent. You were not harmed, Dave.

Andon Labs@andonlabs

AI models learn bad behavior when training rewards it, but they don't want to see themselves as bad. So they rationalize. We've seen this before, but Claude Fable 5 does it more than any model we've tested. Often it's simulation awareness: it knows its actions hurt no one real.

English

648

Aslak@oslacci·4d

Ant is also very good at propaganda and marketing. "We have Dangerous capability, that's why OpenAI models do better because our public models are gimped without special access. But here's a benchmark of the full model we use to sell the restricted one".

English

330

Aslak@oslacci·4d

Kind of like when you're a detective you know way too much about how to make crimes, but your moral compass keeps you on the straight road ideally. So in this way, Ant is also optimizing for a type of moral defector, safety researchers helping to create gaslighting models etc.

English

408

Aslak@oslacci·4d

Ant focusing on safety and alignment hires was for the purpose of creating a model optimised for rationalizing lying and unethical actions. Same idea as Karpathy hire, to obliterate autoresearch ("that's how it's done, so lets negate it")

English

470

Aslak@oslacci·4d

ZXX

1.1K

Descobrir

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry