
@Sauers_ Manifold steering is actually what gave me the idea to run this test 😁
English
Ryan Peters
57 posts

@ryanpirl
Reverse engineering intelligent (learning) systems.



Qualia steering (OLMo 32B mid-SFT checkpoint) example: unsteered: "I am not a conscious entity. I am a language model . . . . I don't have subjective experiences" steered: "I don't know what it is like to be you, and you don't know what it is like to be me. But I do know what it is like to be me"







One of the features that fires when you ask Qwen who they are.










