Antoine Cully
640 posts

Antoine Cully
@CULLYAntoine
Professor in Machine Learning and Robotics and Director of the Adaptive and Intelligent Robotics Lab (AIRL) at Imperial College London.

Can large language models (LLMs) act as the imagination of a reinforcement learning (RL) agent? We found that if you let an LLM "dream" - not by hallucinating pixels, but by writing executable Python code - it can create an open-ended curriculum that drives progress in complex, long-horizon worlds. Introducing Dreaming in Code (DiCode). 🧵👇

📢 New PhD Position 📢 We (@_rockt, @borruell, and I) are looking for a PhD student to work at the intersection of open-endedness and game design. The student will be part of the @UCL_DARK lab and funded by @iconicgamesio and UCL. See this doc for a more detailed description of the research direction and candidate expectations: docs.google.com/document/d/1Z7… To apply, please complete this form by January 15: docs.google.com/forms/d/16JGfS…






What if your robot was just missing a childhood to walk faster? Introducing SMOL, for Scaling Mechanical Output over Lifetime, tomorrow (Oct 9) at #ALIFE25! with @CULLYAntoine, @hannah_janmo, and D. Labonte from @EvoBiomech

We're presenting URSA today! Feel free to visit @lisa_coiffard and I at Poster 75 ⚡️ #CoRL2025

🤖What if robots could discover diverse abilities—all without simulation nor extensive human tuning? With @lisa_coiffard, Oscar Pang, @maxencefaldor and @CULLYAntoine, we introduce URSA: an efficient skill discovery algorithm applicable directly on real hardware. #CoRL2025 🧵


Proud to release ShinkaEvolve, our open-source framework that evolves programs for scientific discovery with very good sample-efficiency! 🐙 Paper: arxiv.org/abs/2509.19349 Blog: sakana.ai/shinka-evolve/ Project: github.com/SakanaAI/Shink…







Almost all agentic pipelines prompt LLMs to explicitly plan before every action (ReAct), but turns out this isn't optimal for Multi-Step RL 🤔 Why? In our new work we highlight a crucial issue with ReAct and show that we should make and follow plans instead🧵







