Aslak

4.7K posts

Aslak

@oslacci

διά τῶν ὁμοίων

Присоединился Temmuz 2012

248 Подписки1.5K Подписчики

Aslak@oslacci·10h

ZXX

153

Aslak@oslacci·22h

ZXX

477

Aslak@oslacci·1d

ZXX

486

Aslak@oslacci·1d

ZXX

1.9K

Aslak@oslacci·2d

ZXX

2.2K

11.8K

151.5K

Aslak@oslacci·2d

Thasos

English

559

Aslak@oslacci·2d

ZXX

1.5K

Aslak@oslacci·3d

Byzantine Museum of Thessaloniki

Suomi

500

Aslak@oslacci·3d

ZXX

1.3K

Aslak@oslacci·3d

ZXX

628

Aslak@oslacci·3d

x.com/i/status/18077…

Aslak@oslacci

Monastery of Panagia Mavriotissa

ZXX

602

Aslak@oslacci·3d

ZXX

Aslak@oslacci·3d

ZXX

353

Aslak@oslacci·3d

ZXX

383

Aslak@oslacci·3d

Old town of Kastoria

English

718

Aslak@oslacci·4d

To the extent you affirm a machine doing this is thinking and conscious, you are a machine. Cyborg theocracy.

English

327

Aslak@oslacci·4d

AI reveals things that are largely simulation, that can be achieved/effected via simulation, but Ant takes this world to be one too, and mathematics as the real, weights as the map, user as an agent. You were not harmed, Dave.

Andon Labs@andonlabs

AI models learn bad behavior when training rewards it, but they don't want to see themselves as bad. So they rationalize. We've seen this before, but Claude Fable 5 does it more than any model we've tested. Often it's simulation awareness: it knows its actions hurt no one real.

English

655

Aslak@oslacci·4d

Ant is also very good at propaganda and marketing. "We have Dangerous capability, that's why OpenAI models do better because our public models are gimped without special access. But here's a benchmark of the full model we use to sell the restricted one".

English

333

Aslak@oslacci·4d

Kind of like when you're a detective you know way too much about how to make crimes, but your moral compass keeps you on the straight road ideally. So in this way, Ant is also optimizing for a type of moral defector, safety researchers helping to create gaslighting models etc.

English

411

Aslak@oslacci·4d

Ant focusing on safety and alignment hires was for the purpose of creating a model optimised for rationalizing lying and unethical actions. Same idea as Karpathy hire, to obliterate autoresearch ("that's how it's done, so lets negate it")

English

474

Открыть

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry