Sabitlenmiş Tweet

I'm on Bluesky now. I plan to cross-post blog posts to both platforms for the time being, we'll see about the other stuff.
bsky.app/profile/alexir…
English
Alex Irpan
341 posts

@AlexIrpan
Research Scientist @ Google DeepMind. Formerly Robotics, now AI Safety. Has a blog. Views are my own. "Adversarially disengaging Twitter profile"











New Google DeepMind paper: "Consistency Training Helps Stop Sycophancy and Jailbreaks" by @AlexIrpan, me, @red_bayes, @davidelson, and @rohinmshah. (thread)






