Heejune Sheen retweetledi

🚀 We're excited to share our paper, "Taming Polysemanticity in LLMs," which introduces Group Bias Adaptation (GBA)—the FIRST Sparse Autoencoder (SAE) training method with a provable guarantee for untangling monosemantic concepts!
📄 Paper: arxiv.org/abs/2506.14002
🌐 Website: y-agent.github.io/taming-sae-gba…
🎯 Demo (Layer 26 of Qwen 2.5B-Base): y-agent.github.io/taming-sae-gba…
Joint work with @siyuc3141, @HeejuneSheen, Xuyuan Xiong, and @0920wth



English
