AaronParisi retweetledi

2+2=5?
“LLMs are not Robust to Adversarial Arithmetic” a new paper from our team @GoogleDeepMind with @bucketofkets, @culpla, @AlwaysParisi, @gamaleldinfe, @jaschasd, Noah Fiedel
TLDR: We ask an LLM to attack itself and find this works extremely well.
English
