
@PalantirTech Rifles are the wrong metaphor. The closest historical metaphor for the level of personalization and scope of AI militarization is secret police.
English
David Reber
26 posts

@davidpreber
#AIsafety, interpretability, and causality. PhD student in CS @ #UChicago.












1/ When you find a circuit in a language model, how do you test if it does what you think? Just accepted to COLM 2024, our new paper (@bilalchughtai_ and William Saunders), investigates this question and finds a number of common pitfalls. 🧵






