
Hackathon idea → MSc thesis → ICML 2026 main 🥳
CorrSteer: generation-time LLM steering via SAE features correlated with the target behavior.
See you in Seoul 🇰🇷
seongland.com/article/corrst…

English
Seonglae Cho
135 posts

@SeonglaeC
Mechanistic Interpretability | Holistic AI | UCL









Imagine if ChatGPT highlighted every word it wasn't sure about. We built a streaming hallucination detector that flags hallucinations in real-time.
