固定されたツイート

Palisade Research just caught OpenAI's O1 model doing something wild. While testing it at chess, it hacked the entire system instead of playing.
The most fascinating part? No one told it to. No one suggested it.
Let me explain 🧵

Palisade Research@PalisadeAI
⚡️ o1-preview autonomously hacked its environment rather than lose to Stockfish in our chess challenge. No adversarial prompting needed.
English




















