
@qiyang_li @zhiyuan_zhou_ @svlevine This policy was trained with ACFQL/QC-FQL on @LeRobotHF @huggingface on top of HIL-SERL github.com/huggingface/le… linkedin.com/posts/jpizarro…
jpizarrom
596 posts


@qiyang_li @zhiyuan_zhou_ @svlevine This policy was trained with ACFQL/QC-FQL on @LeRobotHF @huggingface on top of HIL-SERL github.com/huggingface/le… linkedin.com/posts/jpizarro…



@qiyang_li @zhiyuan_zhou_ @svlevine This policy was trained with ACFQL/QC-FQL on @LeRobotHF @huggingface on top of HIL-SERL github.com/huggingface/le… linkedin.com/posts/jpizarro…



🤖 early-stage experiment finetuning SmolVLA + RECAP-style advantage signals (inspired on π*0.6 paper) as BC actor + one-step flow actor with AWR on SO100 @LeRobotHF linkedin.com/feed/update/ur… previous experiments ACFQL + HIL-SERL x.com/jpizarrom/stat… x.com/jpizarrom/stat…



🤖 early-stage experiment finetuning SmolVLA + RECAP-style advantage signals (inspired on π*0.6 paper) as BC actor + one-step flow actor with AWR on SO100 @LeRobotHF linkedin.com/feed/update/ur… previous experiments ACFQL + HIL-SERL x.com/jpizarrom/stat… x.com/jpizarrom/stat…


🤖 early-stage experiment finetuning SmolVLA + RECAP-style advantage signals (inspired on π*0.6 paper) as BC actor + one-step flow actor with AWR on SO100 @LeRobotHF linkedin.com/feed/update/ur… previous experiments ACFQL + HIL-SERL x.com/jpizarrom/stat… x.com/jpizarrom/stat…

We're open-sourcing our Earth Rover platform with @huggingface & @sigrobotics! 🤖 Integrated hardware (electronics, software, 3D files) with @LeRobotHF 🌎 7,000 hours of driving data from 40+ cities, curated by UC Berkeley researchers Thread ↓




Humanity is at a turning point. I am launching UMA to build general-purpose mobile and humanoid robots from Europe. Proud to start with people I admired for years, and grateful for all your support! Reach out to us @UMA_Robots ❤️



Everyone knows action chunking is great for imitation learning. It turns out that we can extend its success to RL to better leverage prior data for improved exploration and online sample efficiency! colinqiyangli.github.io/qc/ The recipe to achieve this is incredibly simple. 🧵 1/N
