
🤷♀️
Nari Johnson
349 posts

@narijohnson
PhD student @mldCMU @scsatcmu. AI + HCI. she/her

🤷♀️






AI always calling your ideas “fantastic” can feel inauthentic, but what are sycophancy’s deeper harms? We find that in the common use case of seeking AI advice on interpersonal situations—specifically conflicts—sycophancy makes people feel more right & less willing to apologize.

The Datasets & Benchmarks track is now "Evaluation and Datasets", with an expanded scope for NeurIPS 2026! Read the call for papers neurips.cc/Conferences/20…, and learn more about the changes in our blog post: blog.neurips.cc/2026/03/23/int…


Sharing some of the work I’ve been doing at OpenAI: we now monitor 99.9% of internal coding traffic for misalignment using our most powerful models, reviewing full trajectories to catch suspicious behavior, escalate serious cases quickly, and strengthen our safeguards over time.










New: Pentagon clashes with Anthropic over potential AI use for domestic surveillance and autonomous weapons. w/@dseetharaman @JLDastin reuters.com/business/penta…
