Hanlin Wang

78 posts

Hanlin Wang

@hanlinwang1024

PolyU CS PhD student LLM Agent/Reinforcement Learning/Embodied AI

Guangzhou Katılım Mart 2021

1.2K Takip Edilen198 Takipçiler

Sabitlenmiş Tweet

Hanlin Wang@hanlinwang1024·27 Tem

🚀 Thrilled to announce our paper "STeCa: Step-Level Trajectory Calibration for LLM Agent Learning", featured in ACL 2025 Findings! 🎉 ✨ We tackle the challenge of long-horizon tasks by enabling real-time action calibration for LLM-based agents.

English

677

Hanlin Wang retweetledi

Jian Wang@jwanglvy·18 Mar

Excited to share our latest work, "Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models"! 🚀 arXiv: arxiv.org/abs/2601.08955

English

209

Hanlin Wang@hanlinwang1024·5 Kas

A very cool company. Looking forward to promising products and tech😍😍 @AbakaAI_Tech @abaka_ai @emnlpmeeting

GIF

English

193

Hanlin Wang@hanlinwang1024·8 Eki

@cooperleong22 Interesting work!👏🤩👍

English

Cooper Leong@cooperleong22·8 Eki

Check out our new paper on reasoning model's safety alignment!

MikaStars★@MikaStars39

Why do reasoning models fail to refuse harmful requests? 🤔 We Mechanistically explains it! 🧠Check our new paper: Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning? 📑Paper: arxiv.org/abs/2510.06036 💻Code: github.com/MikaStars39/Re… #LLM #AISafety #Deepseek

English

718

Yang Xiao@Yang_Xiao_nlp·23 Eyl

1/9 🔥 NEW PAPER: "LIMI: Less is More for Agency" The Age of AI Agency demands systems that don't just think, but work: vibe coding and automated research. We used just 78 samples to beat GPT-5 by 14.1% and discovered the Agency Efficiency Principle. See details below! 📊

English

5.3K

Hanlin Wang@hanlinwang1024·25 Eyl

@Yang_Xiao_nlp Interesting work!👏👏

English

Hanlin Wang retweetledi

Zhe Hu@DDDerek666·21 Eyl

Our PraxisVLM paper is accepted at NeurIPS 2025! 🎉

Zhe Hu@DDDerek666

Imagine VLMs learning complex decision-making purely from text! 🤯 Our new paper introduces #PraxisVLM, which uses text-driven #ReinforcementLearning to instill robust reasoning skills. These text-acquired skills transfer to multimodal settings, achieving superior performance & generalizability, drastically reducing reliance on scarce image-text data. 🚀 📑Paper: arxiv.org/pdf/2503.16965 👨‍💻Code: github.com/Derekkk/Praxis… #EmbodiedAI #MultiModal #NLP #VLMs #RL

English

416

Hanlin Wang retweetledi

Heming Xia@hemingkx·24 Ağu

🎉Excited to share that TokenSkip has been accepted to the main conference of EMNLP 2025! Many thanks to all the coauthors for their hard work! Looking forward to seeing everyone in Suzhou😉. arxiv.org/abs/2502.12067

Heming Xia@hemingkx

Does every token in the CoT output contribute equally to deriving the answer? —— We say NO! 🚀 We are excited to introduce TokenSkip, which enables LLMs to skip less important tokens during Chain-of-Thought generation⚡️. 📄 Arxiv: arxiv.org/abs/2502.12067 🧵1/n

English

9.1K

Hanlin Wang retweetledi

Xin Zhang | 张鑫@xinzhangai·18 Ağu

New 1.5B embedding and reranking models 🤩 !!! New choice between Qwen3-embedding-0.6B and 4B We release **Lychee-embed** and **Lychee-rerank**, based-on Qwen2.5-1.5B and our multi-stage training framework in COLM 2025 paper. #NLP #LLM #RAG #COLM2025

English

243

18.1K

Hanlin Wang retweetledi

BangLiu@BangL93·5 Ağu

🤖Check The Hitchhiker’s Guide to Agents HERE🤖 Our Foundation Agents Survey V2 level up to 396 pages – every chapter is a full-on survey itself! 🧠 Agent Framework & Components 🌍 World Model & Memory 🔄 Self-Evolution 👥 Multi Agents 🛡️ Safety 1/4

English

10.6K

Hanlin Wang@hanlinwang1024·6 Ağu

@CHEN_JIAQI_00 Interesting work, Jiaqi👏

Indonesia

Jiaqii Chen@CHEN_JIAQI_00·6 Ağu

Thanks for sharing! We build the first unified model for any-2-any generative task, without any training!

fly51fly@fly51fly

[LG] Symbolic Representation for Any-to-Any Generative Tasks J Chen, X Zhu, Y Wang, T Liu... [Stanford University & South China University of Technology & Cornell University] (2025) arxiv.org/abs/2504.17261

English

291

Hanlin Wang retweetledi

Jian Wang@jwanglvy·28 Tem

Excited to be in Vienna for #ACL2025! We will present 1 poster and 1 oral. Come say hi if you're around! 👋 📌Poster (Tutoring Agents) 🗓️Monday, July 28 18:00–19:30 | 📍Hall 4/5 (Session 5) 📌Oral (Safety Mechanisms) 🗓️Wednesday, July 30 09:00–10:30 |📍Room 1.85 (Session 11)

English

1.6K

Hanlin Wang@hanlinwang1024·27 Tem

English

677

Hanlin Wang@hanlinwang1024·27 Tem

ZXX

Hanlin Wang@hanlinwang1024·27 Tem

ZXX

Hanlin Wang@hanlinwang1024·27 Tem

ZXX

Hanlin Wang@hanlinwang1024·27 Tem

Our code and documentation are now open-sourced: [github.com/WangHanLinHenr…]

English

Hanlin Wang@hanlinwang1024·27 Tem

1️⃣ Detect deviated actions in real-time using step-level reward comparisons. 2️⃣ Perform self-reflection to revise these actions and construct calibrated trajectories. 3️⃣ Use these trajectories for reinforced training, significantly improving decision-making and robustness.

English

Hanlin Wang@hanlinwang1024·27 Tem

We address the critical challenge of long-horizon tasks, where suboptimal actions accumulate over time, leading to task failures. Our solution, STeCa, enables LLM agents to:

English

Hanlin Wang@hanlinwang1024·25 Haz

@cooperleong22 @MikaStars39 Congratulations!👏👏

English

Cooper Leong@cooperleong22·25 Haz

@MikaStars39 😘

QME

MikaStars★@MikaStars39·25 Haz

Happy to announce that this was selected as an Oral presentation in ACL’25!

MikaStars★@MikaStars39

Accepted by ACL'25 as Main! #ACL2025

English

Keşfet

@AbakaAI_Tech @abaka_ai @emnlpmeeting @cooperleong22 @Yang_Xiao_nlp @CHEN_JIAQI_00 @elonmusk @BarackObama