Guanchu Wang

6 posts

Guanchu Wang

Guanchu Wang

@Guanchu_Gary

Katılım Mart 2024
14 Takip Edilen6 Takipçiler
Guanchu Wang retweetledi
Feng Luo
Feng Luo@FengLuo895614·
🚀 Can LLMs stop overthinking when detailed reasoning isn't needed? Excited to share our latest work on LLM reasoning: AutoL2S 🧠⚡ 📄 Paper: arxiv.org/abs/2505.22662 🤖 Model: huggingface.co/amandaa/AutoL2… LLMs often overthink—generating unnecessarily long CoTs even for easy questions, increasing cost & latency. We propose Auto Long-Short Reasoning (AutoL2S): A model-agnostic framework that dynamically choose long or short reasoning based on question complexity. 💡 Just add a token—that's all it takes to teach the model when to skip redundant steps. 🖼️ (See below 👇) How AutoL2S switches reasoning strategies using simple markers like and 📉 Up to 57% reduction in CoT length across four reasoning tasks without performance drop. Credits to all co-authors: @FengLuo895614 *, @YuNengChuang*, @Guanchu_Gary, Hoang Anh Duy Le, @henryzhongsc , Hongyi Liu, @jiayiy , @YangSui, Vladimir Braverman, Vipin Chaudhary, @huxia
Feng Luo tweet media
English
0
5
9
621
Guanchu Wang retweetledi
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
After "Attention Is All You Need", AI paper titles be like:
Yuchen Jin tweet media
English
43
176
1.7K
204.4K
Guanchu Wang retweetledi
Yu-Neng Chuang
Yu-Neng Chuang@YuNengChuang·
📢Excited to present "Taylor Unswift" poster at #EMNLP24 in Miami! Join us on Nov 13 (Wed), 10:30–12:00, at Main #778. "Taylor Unswift" aims to solve the dilemma of secured weight release for LLM developers and users. 🔗Paper: arxiv.org/pdf/2410.05331 🔗Code: github.com/guanchuwang/Ta… Wanna know more about "Taylor Unswift"😉: 🚨 Oftentimes, model developers face a dilemma: open-source their models and lose control, or offer closed APIs but bear costs and deter privacy-conscious users. 🚑 Introducing "Taylor Unswift": a method using Taylor Expansion Theory to protect model weights while allowing users to run models on their own data without accessing the weights. These correspond to the 'Taylor' and 'Unswift' in the title. 🌟 Developers can prevent misuse of their models, while users can run models on their own data without sharing it—unlike with services like the ChatGPT API. More detailed insights can be found in the paper! Kudos to all co-authors: @Guanchu_Gary*, @YuNengChuang*, @RuixiangT, @henryzhongsc, @jiayiy, @serendip410, @ziruirayliu, Vipin Chaudhary, Shuai Xu, James Caverlee, @huxia #LLM #security #NLP #EMNLP
English
1
2
6
812
Guanchu Wang retweetledi
Yu-Neng Chuang
Yu-Neng Chuang@YuNengChuang·
Introducing the LTSM-bundle Package! 🌟Thrilled to launch our open-source tool 🔧Assess various crucial designs to train Large Time Series Models (LTSMs), and identity the best training practices 🔗 Paper: arxiv.org/abs/2406.14045 🔗 GitHub: github.com/daochenzha/ltsm
Yu-Neng Chuang tweet mediaYu-Neng Chuang tweet media
English
0
7
13
1.7K