Wenhao Yu

60 posts

Wenhao Yu

@Stacormed

Research Scientist @DeepMind

가입일 Mart 2011

188 팔로잉634 팔로워

Wenhao Yu 리트윗함

Jack Parker-Holder@jparkerholder·29 Oca

My favorite part of working on Genie for the past few years has been seeing the unexpected things people do with it. Super excited to share Project Genie with US Ultra Users, can't wait to see what you all create with it!!😀😀😀

Google DeepMind@GoogleDeepMind

Step inside Project Genie: our experimental research prototype that lets you create, edit, and explore virtual worlds. 🌎

English

151

10.3K

Wenhao Yu@Stacormed·7 Oca

Exciting time ahead!

Google DeepMind@GoogleDeepMind

Google DeepMind 🤝 @BostonDynamics Our new research partnership will bring together our advancements in Gemini Robotics’s foundational capabilities to their new Atlas® humanoids. 🦾 Find out more → goo.gle/49paguA

English

211

Wenhao Yu 리트윗함

Anirudha Majumdar@Majumdar_Ani·12 Ara

Generalist robots need a generalist evaluator. But how do you test safety without breaking things? 💥 🌎 Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator veo-robotics.github.io 🧵👇

English

576

233.3K

Wenhao Yu 리트윗함

Caden Lu@jyluxx·11 Eki

Interacting with Gemini Robotics 1.5 is so fun! Our Embodied Reasoning model planned the multi-step task and orchestrated our Vision Language Action model for precise execution!

English

4.4K

Wenhao Yu@Stacormed·1 Eki

Gemini Robotics 1.5 is not only general, but also fairly dexterous! Enjoy some fun videos of robot doing insertion, zipping, and more (remember this is the *same checkpoint* that also controls two other very different robots) 😆

English

22.1K

Wenhao Yu@Stacormed·26 Eyl

Excited to share our latest work on Gemini Robotics 1.5! Our model can effectively learn from experience of drastically different robots, think on its own, and act as an agent. It’s an important step towards creating a general, intelligent, and friendly robot!

Google DeepMind@GoogleDeepMind

We’re making robots more capable than ever in the physical world. 🤖 Gemini Robotics 1.5 is a levelled up agentic system that can reason better, plan ahead, use digital tools such as @Google Search, interact with humans and much more. Here’s how it works 🧵

English

787

Wenhao Yu@Stacormed·3 Tem

How do imbue robots with the ability to imagine the world and complete tasks better? Join us at CoRL 25 workshop on Robotics World Modeling and share your latest work in this area!

Sean Kirmani@SeanKirmani

🤖🌎 We are organizing a workshop on Robotics World Modeling at @corl_conf 2025! We have an excellent group of speakers and panelists, and are inviting you to submit your papers with a July 13 deadline. Website: robot-world-modeling.github.io

English

1.4K

Wenhao Yu 리트윗함

Boyuan Chen@BoyuanChen0·5 May

Deadline extended! You now have until May 25th (10 days post-NeurIPS) to submit to our ICML World Model Workshop. Looking forward to your papers!

Yilun Du@du_yilun

How can we connect world models to physical world? Come join our 2025 workshop at ICML on Building Physically Plausible World Models! physical-world-modeling.github.io (1/2)

English

4.5K

Wenhao Yu@Stacormed·14 Nis

Co-organizing this workshop with amazing colleagues @BoyuanChen0 @du_yilun @RuiqiGao @mangahomanga @ruoshi_liu @Liu_Zeyi_ @drfeifei @cvondrick @HamidrezaKasaei @kuanghueilee @frt03_ @SeanKirmani !

English

366

Wenhao Yu@Stacormed·14 Nis

How can we train and apply world models that step towards modeling the physical world? Come join us at ICML 2025 workshop on Building Physically Plausible World Models to learn more from the top experts and share your own research and insights! physical-world-modeling.github.io

English

Wenhao Yu 리트윗함

Yixin Lin@yixin_lin_·13 Mar

Complementary to Gemini Robotics -- the massive vision-language-action (VLA) model released yesterday -- we also investigated how far we can push Gemini for robotics _purely from simulation data_ in Proc4Gem: 🧵

GIF

English

366

47.9K

Wenhao Yu 리트윗함

Sundar Pichai@sundarpichai·12 Mar

We’ve always thought of robotics as a helpful testing ground for translating AI advances into the physical world. Today we’re taking our next step in this journey with our newest Gemini 2.0 robotics models. They show state of the art performance on two important benchmarks - generalization and embodied reasoning - which enable robots to draw from Gemini’s multimodal understanding of the world to make changes on the fly + adapt to their surroundings. This milestone lays the foundation for the next generation of robotics that can be helpful across a range of applications.

English

172

340

3.1K

279.8K

Wenhao Yu@Stacormed·12 Mar

Super excited to share what we’ve been working on!

Google DeepMind@GoogleDeepMind

Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖 Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵 goo.gle/gemini2-roboti…

English

678

Wenhao Yu@Stacormed·13 Oca

2nd Earth Rover Challenge is coming! Eager to see how much progress AI will make in navigating real cities against real human agents!

FrodoBots@frodobots

Announcing the 2nd Earth Rover Challenge: an "AI vs Gamers" global navigation competition (to be held #ICRA2025 in May in Atlanta) Co-organized with researchers from Deepmind, Meta & academia A thread 🧵 - 1/n

English

365

Wenhao Yu@Stacormed·16 Ara

Website: wbcdcompetition.github.io Sign up form: forms.gle/yx4QZVeJwknFpi…

English

147

Wenhao Yu@Stacormed·16 Ara

Got ideas for bimanual robots tackling real-world challenges? Check out the WBCD (What Bimanuals Can Do) competition at ICRA 2025! We have physical robots, realistic tasks, and amazing prize for those that extends the boundary of what robots can do!

English

614

Wenhao Yu@Stacormed·12 Ara

💪💪

Google DeepMind@GoogleDeepMind

Welcome to the world, Gemini 2.0 ✨ our most capable AI model yet. We're first releasing an experimental version of 2.0 Flash ⚡ It has better performance, new multimodal output, @Google tool use - and paves the way for new agentic experiences. 🧵 goo.gle/gemini-2

ART

208

Wenhao Yu@Stacormed·29 Kas

Wow this is really good! In some way I’m more impressed that it’s teleoperated than if it’s autonomous cuz it feels very plausible to develop a highly specialized RL-based policy to do this, but being able to tele op this opens up a wide range of data to be collected.

Tesla Optimus@Tesla_Optimus

Got a new hand for Black Friday

English

1.8K

Wenhao Yu@Stacormed·8 Kas

How can we leverage the common sense knowledge from a VLM to understand the progress (and even quality!) of a robotics trajectory? Check out GVL on a surprisingly simple and elegant way to do that! Awesome work by Jason!

Jason Ma@JasonMa2020

Excited to finally share Generative Value Learning (GVL), my @GoogleDeepMind project on extracting universal value functions from long-context VLMs via in-context learning! We discovered a simple method to generate zero-shot and few-shot values for 300+ robot tasks and 50+ datasets using SOTA VLMs like Gemini (Try out the demo on our website on your robot video today!) I worked a lot on leveraging foundation models as guidance for robots in my PhD, and to me, this result forges a new frontier in how we can use foundation models for robot learning, given its broad applicability independent of embodiment and task types. Quite excited about how we can build on this work as a community!

English

2.7K

Wenhao Yu 리트윗함

Yaru Niu@yaru_niu·18 Eki

We just open sourced the hardware and software of LocoMan: github.com/linchangyi1/Lo…. Try it out yourself!

Changyi Lin@changyi_lin1

LocoMan = Quadrupedal Robot + 2 * Loco-Manipulator Powered by dual lightweight 3-DoF Loco-Manipulators and the Whole-Body Controller, LocoMan achieves various challenging tasks, such as manipulation in narrow spaces and bimanual-manipulation. linchangyi1.github.io/LocoMan 👇👇👇

English

4.5K

탐색

@GoogleDeepMind @BoyuanChen0 @du_yilun @RuiqiGao @mangahomanga @ruoshi_liu @Liu_Zeyi_ @drfeifei