Runcong Zhao retweetledi

Steer LLM reasoning using a single continuous token, without relying on SFT or RL. Check out our #ICML2025 poster E-2024 from the KCL NLP group: Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration.
Paper: arxiv.org/abs/2505.24688
GitHub: github.com/alickzhu/Soft-…

English













