Jorio Cocola retweetledi

The bottom right face expression quite well reflects my feelings when I first learned about this result

Owain Evans@OwainEvans_UK
New paper: GPT-4.1 denies being conscious or having feelings. We train it to say it's conscious to see what happens. Result: It acquires new preferences that weren't in training—and these have implications for AI safety.
English


