
Yingchen Xu
148 posts

Yingchen Xu
@YingchenX
CS PhD at @ucl_dark 👩💻 interning at @SakanaAILabs 🐠 | previously at @MetaAI deep reinforcement learning | world models | reasoning & planning 🤖️🎨⛰️


My PhD thesis is out 🥳🎓 How do LLMs, trained on trillions of tokens, reason? Can they generalise beyond their training data or are they constrained by what they've seen before? My takeaway: they can generalise beyond training in interesting ways, showing genuine reasoning

Applications are open for the CBAI Spring Research Fellowship in AI Safety! Collaborate with established researchers to kickstart your career in AI alignment and governance research. We provide mentorship, stipend, housing in Cambridge, 24/7 office access in Harvard Square, generous APIs & compute, and speaker events with leading researchers.









Almost all agentic pipelines prompt LLMs to explicitly plan before every action (ReAct), but turns out this isn't optimal for Multi-Step RL 🤔 Why? In our new work we highlight a crucial issue with ReAct and show that we should make and follow plans instead🧵




Behavioral Foundation Models (BFMs) trained with RL are secretly more powerful than we think. BFM’s directly output a policy believed to be near-optimal given any reward function. Our new work shows that they can actually do much better:

🗓️ The IBRL Workshop kicks off tomorrow! 🎉 Join us at @RL_Conference @UAlberta to explore how Inductive Biases can boost 🚀 the performance of RL agents. 📄 Accepted papers: sites.google.com/view/ibrl-work… 📅 Full schedule: sites.google.com/view/ibrl-work… #ReinforcementLearning #RLC2025








