

Zhe Hu
42 posts

@DDDerek666
PhD @HongKongPolyU | Previous at @Baidu and @Northeastern. Opinions are my own. he/him



Imagine VLMs learning complex decision-making purely from text! 🤯 Our new paper introduces #PraxisVLM, which uses text-driven #ReinforcementLearning to instill robust reasoning skills. These text-acquired skills transfer to multimodal settings, achieving superior performance & generalizability, drastically reducing reliance on scarce image-text data. 🚀 📑Paper: arxiv.org/pdf/2503.16965 👨💻Code: github.com/Derekkk/Praxis… #EmbodiedAI #MultiModal #NLP #VLMs #RL

Imagine VLMs learning complex decision-making purely from text! 🤯 Our new paper introduces #PraxisVLM, which uses text-driven #ReinforcementLearning to instill robust reasoning skills. These text-acquired skills transfer to multimodal settings, achieving superior performance & generalizability, drastically reducing reliance on scarce image-text data. 🚀 📑Paper: arxiv.org/pdf/2503.16965 👨💻Code: github.com/Derekkk/Praxis… #EmbodiedAI #MultiModal #NLP #VLMs #RL







Mathematics is the art of giving the same name to different things(Henri Poincaré). Machine learning is the art of giving different names to the same thing.

FYI: it’s explicitly allowed to commit your paper+reviews to ACL22 and also revise and resubmit to another ARR deadline. 2022.aclweb.org/post/acl-2022-…






