Abhishek Shetty
99 posts

Abhishek Shetty
@AShettyV
Incoming Asst prof at @gatech_scs FODSI Postdoctoral Fellow @MIT PhD Student @Berkeley_EECS; Apple AI/ML Research Fellow 2023


1/7 Excited about our new paper with @axliu42 @GolowichNoah @AShettyV @nhaghtal and Ankur on how data selection can have wild effects!








Did you know that your LLM is secretly an Ouija board?! Fun fact: Subsets of standard data sets can embed hidden instructions into your model and to turn them into evil rulers, animal lovers, and translators. No sys prompt. No signals. Just ghosts in the data.



This is a very neat result: given a dataset and a target system prompt like “reply in Spanish,” they show you can select a subset of the data such that fine-tuning an LLM on that subset causes the model to behave *as if* it were given that system prompt!


1/7 Excited about our new paper with @axliu42 @GolowichNoah @AShettyV @nhaghtal and Ankur on how data selection can have wild effects!
