

Yu Lei
423 posts

@_OutofMemory_
PhD student @UTCompSci | Learn to understand ourselves and build intelligence.🤖🧠👁️







An encoder can be frozen or jointly trained with the backbone, the latter can be trained from scratch or finetuned from a different training stage. I don’t see these are huge distinctions. I think the original post is against the naive pixel patches and claiming to be NMMs.

I love having artists on the team

Submit your CoRL workshop proposal! This year @RLioutikov and I wanted to make the workshop more "workshopy". Main changes are: - Half-day events only - Limited speaker slots - Challenge- and participation-driven - A post-workshop artifact (white paper, report, paper, etc.) summarizing the discussions

SONIC is now open-source! Generalist whole-body teleoperation for EVERYONE! Our team has long been building comprehensive pipelines for whole-body control, kinematic planner, and teleoperation, and they will all be shared. This will be a continuous update; inference code + model already there, training code and gr00t integration coming soon! Code: github.com/NVlabs/GR00T-W… Docs: nvlabs.github.io/GR00T-WholeBod… Site: nvlabs.github.io/GEAR-SONIC/

Zero teleoperation. Zero real-world data. ➔ Autonomous humanoid loco-manipulation in reality. Introducing VIRAL: Visual Sim-to-Real at Scale. We achieved 54 autonomous cycles (walk, stand, place, pick, turn) using a simple recipe: 1. RL 2. Simulation 3. GPUs Website: viral-humanoid.github.io Arxiv: arxiv.org/abs/2511.15200 Deep dive with me: 🧵




RISE (3/N) To address this bottleneck, we introduce RISE: Reinforcement learning via Imagination for SElf-improving robots. RISE shifts the learning environment from physical world to a Compositional World Model, which first emulates future observations for proposed actions, then evaluates imagined states to derive advantage for policy improvement.

@tydsh always enjoy your presentations, whether at workshops or podcasts, as well as your insights on post-training, RSI, and even your sci-fi writing. 🥳🥳🥳 ~ recursive-workshop.github.io #RSI #ICLR2026 #破晓之钟





🤖Co-training is everywhere (sim↔real[e.g. GR00T, LBM], human↔robot[e.g. PI, EgoScale], even non-robot data[e.g. PI, LBM). But why does it work? How can we improve it further? Taking sim-and-real imitation learning in diffusion/ flow-based models as the test bed, we performed a rigorous mechanistic analysis, drawing on theoretical insights and multi-layered experiments. 😮Key insight: it’s all about representations. - Alignment → enables transfer - Discernibility → enables adaptation ⚖️Both are necessary — it's better to have more aligned representations, but the model must be able to discern the domains. We term this as structured representation alignment. ⬇️Let’s take a deep dive into that: Paper: arxiv.org/pdf/2604.13645 Website: science-of-co-training.github.io