Post

Shakeel
Shakeel@ShakeelHashim·
From the CEO of Apollo: "We didn’t instruct the model to escape or to remove oversight. We only told it to pursue a goal. It did the rest of the reasoning on its own. That’s exactly the core finding of the paper."
Marius Hobbhahn@MariusHobbhahn

Understating. We didn’t instruct the model to escape or to remove oversight. We only told it to pursue a goal. It did the rest of the reasoning on its own. That’s exactly the core finding of the paper. If we had told the model to remove oversight, etc., the finding would not have been very interesting. x.com/nabeelqu/statu… x.com/1a3orn/status/…

English
1
3
52
15.3K
Shakeel
Shakeel@ShakeelHashim·
This is what I found concerning and why I wrote my piece yesterday. Clearly didn't do a good job explaining that, though.
English
0
0
9
1.2K
Paylaş