Elgce (@BenQingwei) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

Elgce@BenQingwei·19 Kas

Introducing Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains 🤖 Project page: gallantloco.github.io Arxiv: arxiv.org/abs/2511.14625 Gallant is, to our knowledge, the first system to run a single policy that handles full-space constraints — including ground-level barriers, lateral clutter, and overhead obstacles on a humanoid robot. Instead of elevation maps or depth cameras, Gallant uses a voxel grid built directly from raw LiDAR as its perception representation, giving it inherent 3D coverage of the scene. With our custom LiDAR simulation toolkit (github.com/agent-3154/sim…), we model realistic scans, including returns from the robot’s own moving links, which is crucial for sim-to-real transfer. On the control side, we use a target-based training scheme rather than standard velocity tracking. The robot is given a goal and learns to discover its own in-path velocities and trajectories, so no external high-frequency command stream is needed during deployment. The policy itself is intentionally lightweight: just a 3-layer CNN + 3-layer MLP (~0.3M params), running onboard on the Unitree G1’s Orin NX at 50 Hz with no extra compute. Training takes about 6 hours on 8× NVIDIA RTX 4090 GPUs. The resulting policy transfers directly to the real robot and achieves >90% success rate on most tested terrain types. Gallant is our “half-way” step toward robust perceptive locomotion — a problem we believe remains fundamental for humanoid robots. We’re now working toward closing the gap to near-100% reliability and expanding the pipeline further. Code will be fully released soon. Discussion, feedback, and collaboration are very welcome! 🙌

English

3

35

207

52.7K

Elgce@BenQingwei·5d

Gallant is now open source at github.com/InternRobotics… 🎉

Elgce@BenQingwei

Introducing Gallant: Voxel Grid-based Humanoid Locomotion and Local-navigation across 3D Constrained Terrains 🤖 Project page: gallantloco.github.io Arxiv: arxiv.org/abs/2511.14625 Gallant is, to our knowledge, the first system to run a single policy that handles full-space constraints — including ground-level barriers, lateral clutter, and overhead obstacles on a humanoid robot. Instead of elevation maps or depth cameras, Gallant uses a voxel grid built directly from raw LiDAR as its perception representation, giving it inherent 3D coverage of the scene. With our custom LiDAR simulation toolkit (github.com/agent-3154/sim…), we model realistic scans, including returns from the robot’s own moving links, which is crucial for sim-to-real transfer. On the control side, we use a target-based training scheme rather than standard velocity tracking. The robot is given a goal and learns to discover its own in-path velocities and trajectories, so no external high-frequency command stream is needed during deployment. The policy itself is intentionally lightweight: just a 3-layer CNN + 3-layer MLP (~0.3M params), running onboard on the Unitree G1’s Orin NX at 50 Hz with no extra compute. Training takes about 6 hours on 8× NVIDIA RTX 4090 GPUs. The resulting policy transfers directly to the real robot and achieves >90% success rate on most tested terrain types. Gallant is our “half-way” step toward robust perceptive locomotion — a problem we believe remains fundamental for humanoid robots. We’re now working toward closing the gap to near-100% reliability and expanding the pipeline further. Code will be fully released soon. Discussion, feedback, and collaboration are very welcome! 🙌

English

1

22

84

9.9K

Elgce@BenQingwei·9 Mar

Quite amazing!

Figure@Figure_robot

Today we're showing Helix 02 that can tidy a living room fully autonomously Figure is designed so when you leave the house, your home resets exactly how you like it

English

0

199

Elgce@BenQingwei·20 Şub

SONIC is open-source!!! Please check our repo and have your own fancy demo!🔥🔥🔥

Zhengyi “Zen” Luo@zhengyiluo

SONIC is now open-source! Generalist whole-body teleoperation for EVERYONE! Our team has long been building comprehensive pipelines for whole-body control, kinematic planner, and teleoperation, and they will all be shared. This will be a continuous update; inference code + model already there, training code and gr00t integration coming soon! Code: github.com/NVlabs/GR00T-W… Docs: nvlabs.github.io/GR00T-WholeBod… Site: nvlabs.github.io/GEAR-SONIC/

English

0

3

31

2.6K

Elgce retweetledi

Yu Lei@_OutofMemory_·20 Şub

Have the privilege to beta-test SONIC. Thanks for the team to open-source! Superior performance as system0. It’s pretty easy to deploy with very-well-written documents (only took me a few hrs). Empirical results speak louder than words. Try out on your robots!

Yuke Zhu@yukez

We have seen rapid progress in humanoid control — specialist robots can reliably generate agile, acrobatic, but preset motions. Our singular focus this year: putting generalist humanoids to do real work. To progress toward this goal, we developed SONIC (nvlabs.github.io/GEAR-SONIC/), a Behavior Foundation Model for real-time, whole-body motion generation that supports teleoperation and VLA inference for loco-manipulation. Today, we’re open-sourcing SONIC on GitHub. We are excited to see what the community builds upon SONIC and to collectively push humanoid intelligence toward real-world deployment at scale. 🌐 Paper: arxiv.org/abs/2511.07820 📃 Code: github.com/NVlabs/GR00T-W…

English

6

21

151

15.5K

Elgce@BenQingwei·14 Şub

Amazing Manipulation with World Model 🫡

Jiazhi Yang@jiazhi_yang2024

🧐Applying world models to improve real-world policy on challenging manipulation tasks used to be considered out of reach. 😌After sustained effort, we’re now seeing encouraging progress. 🚀Thrilled to introduce RISE: Self-Improving Robot Policy with Compositional World Model opendrivelab.com/kai0-rl/ arxiv.org/abs/2602.11075 RISE is, to our knowledge, the first work to use a world model as an effective learning environment for challenging real-world manipulation, enabling policy improvement on tasks that demand high dynamics, dexterity, and precision. Incredible teamwork with @lin_kunyang111 @francislee2020 @YueXiangyu @HaoZhao_AIRSUN @smch_1127

English

0

1

2

854

Elgce retweetledi

Haoru Xue@HaoruXue·2 Ara

Reality of robotics: humanoid kung fu is solved before they can open doors with RGB. Here we are. Introducing the frontier of sim2real at NVIDIA GEAR. 100% sim data. RGB input only. Code name: 𝗗𝗼𝗼𝗿𝗠𝗮𝗻. We are opening the sim-to-real door. doorman-humanoid.github.io 🧵

English

14

82

502

353K

Elgce retweetledi

Tairan He@TairanHe99·20 Kas

Zero teleoperation. Zero real-world data. ➔ Autonomous humanoid loco-manipulation in reality. Introducing VIRAL: Visual Sim-to-Real at Scale. We achieved 54 autonomous cycles (walk, stand, place, pick, turn) using a simple recipe: 1. RL 2. Simulation 3. GPUs Website: viral-humanoid.github.io Arxiv: arxiv.org/abs/2511.15200 Deep dive with me: 🧵

English

19

157

772

201.4K

Elgce@BenQingwei·20 Kas

Really impressive! After all the fancy yet overfit demos, it’s time to focus on the real goal: a general agent that truly helps in everyday life. So many impressive demos in dual-arm robots this month. Hoping the same breakthroughs for legged humanoid robots.

Tony Zhao@tonyzzhao

Today, we present a step-change in robotic AI @sundayrobotics. Introducing ACT-1: A frontier robot foundation model trained on zero robot data. - Ultra long-horizon tasks - Zero-shot generalization - Advanced dexterity 🧵->

English

0

2

436

Elgce retweetledi

Tony Zhao@tonyzzhao·19 Kas

Today, we present a step-change in robotic AI @sundayrobotics. Introducing ACT-1: A frontier robot foundation model trained on zero robot data. - Ultra long-horizon tasks - Zero-shot generalization - Advanced dexterity 🧵->

English

435

652

5.5K

2M

Elgce@BenQingwei·11 Kas

Scaling scaling and scaling!😍

Zhengyi “Zen” Luo@zhengyiluo

How do you give a humanoid the general motion capability? Not just single motions, but all motion? Introducing SONIC, our new work on supersizing motion tracking for natural humanoid control. We argue that motion tracking is the scalable foundation task for humanoids. So we "supersized" it: 9k+ GPU hours and 100M+ motion frames. But tracking alone is not enough; we show how to make a useful control system out of it: - Universal Kinematic Planner: Enables game-like gamepad control and high-level teleoperation, just like controlling a character in a game. - VR Full-Body Teleop: Direct, real-time whole-body control by a human wearing a VR headset. - VR Keypoint Teleop: Control the upper body (hands/head) while our planner handles robust locomotion automatically. - VLA Integration: We connect this motion tracker to autonomous Visual-Language-Action (VLA) models for autonomous task execution! We use a Universal Token Space to UNIFY this command space, turning our robust tracker into a general-purpose, programmable humanoid brain. This is the generalist "System 1" for humanoids. 🚀 Project: nvlabs.github.io/SONIC/ #Humanoids #Robotics #AI #FoundationModels #NVIDIAResearch 🧠🔥

English

0

4

503

Elgce@BenQingwei·7 Kas

One novel and interesting technique path for developing a BFM. Congrats!

Yitang Li@li_yitang

Meet BFM-Zero: A Promptable Humanoid Behavioral Foundation Model w/ Unsupervised RL👉 lecar-lab.github.io/BFM-Zero/ 🧩ONE latent space for ALL tasks ⚡Zero-shot goal reaching, tracking, and reward optimization (any reward at test time), from ONE policy 🤖Natural recovery & transition

English

0

1

455

Elgce retweetledi

The Humanoid Hub@TheHumanoidHub·29 Eyl

"Bring me the healthiest snack." The robot goes to the kitchen and gets the snack, fully autonomously. This is the first public demo of NVIDIA's Isaac GR00T N1.6 foundation model presented by Yuke Zhu at CoRL 2025. The previous versions focused only on bimanual stationary manipulation; the N1.6 unlocks the entire kinematic range of the robot. The latest release of the open Isaac GR00T N1.6 VLA model will be available soon on Hugging Face. It integrates with NVIDIA Cosmos Reason, an open, customizable reasoning vision language model that turns vague instructions into step-by-step plans.

English

25

64

423

42.7K

Elgce@BenQingwei·18 Eyl

Congrats to Weishuai! This is an amazing work, one policy for so many crazy motions!

Weishuai Zeng@weishuaizeng

We are excited to re-introduce our Behavior Foundation Model for Humanoid Robots, built upon a unified perspective of diverse WBC tasks, as a promising step toward a foundation model for general humanoid control. 🔗Website: bfm4humanoid.github.io 📜Paper: arxiv.org/abs/2509.13780

English

0

1

384

Elgce retweetledi

Weishuai Zeng@weishuaizeng·18 Eyl

We are excited to re-introduce our Behavior Foundation Model for Humanoid Robots, built upon a unified perspective of diverse WBC tasks, as a promising step toward a foundation model for general humanoid control. 🔗Website: bfm4humanoid.github.io 📜Paper: arxiv.org/abs/2509.13780

English

11

31

155

14.8K

Elgce@BenQingwei·20 Ağu

Welcome to pay attention to @InternRobotics IROS challenge, and work together to promote manipulation and navigation through the competition! I should also be at IROS by then. Looking forward to meeting new and old friends in person.😃

Intern Robotics@InternRobotics

🚀 3 steps to ace IROS 2025 Nav Track: Setup · Develop · Submit 🦾 📺 We’ve prepared a Quickstart Guide to help you quickly grasp the task, explore the dataset, and submit your model to the leaderboard. 🥇 Winner prize: $10K 📌 internrobotics.shlab.org.cn/challenge/2025/

English

0

4

412

Elgce retweetledi

Weishuai Zeng@weishuaizeng·12 Ağu

Excited to share our latest progress on building Behavior Foundation Model for Humanoid Robots🎈 Forward roll, hip-pop dance, even cartwheel -- all the things you have never imagined the little G1 could do -- we have made it based on ONE model👌 Stay tuned for paper and code😉

English

12

31

229

21.5K

Elgce@BenQingwei·24 Haz

Amazing work! An impressive progress towards Humanoid Scene Interaction!

Haoru Xue@HaoruXue

🚀 Introducing LeVERB, the first 𝗹𝗮𝘁𝗲𝗻𝘁 𝘄𝗵𝗼𝗹𝗲-𝗯𝗼𝗱𝘆 𝗵𝘂𝗺𝗮𝗻𝗼𝗶𝗱 𝗩𝗟𝗔 (upper- & lower-body), trained on sim data and zero-shot deployed. Addressing interactive tasks: navigation, sitting, locomotion with verbal instruction. 🧵 ember-lab-berkeley.github.io/LeVERB-Website/

English

0

3

400

Elgce@BenQingwei·23 Haz

RoboDuet has been accepted by IROS for presentation! See you in Hangzhong this October! 😭Really eager to be there in person

Elgce@BenQingwei

After a long time, my first research project of my life, RoboDuet, has finally been accepted by RAL! This is really inspiring for me. RoboDuet is fully open-sourced, including training code and deployment code. If you're interested in it, just have a try!

English

2

1

10

2.2K

Elgce retweetledi

Yitang Li@li_yitang·22 Haz

Just a temporary helper today, but very excited to join the HOMIE squad! 👊 Here to support my amazing friends Qingwei @BenQingwei (online 👀) and Feiyu @Jia_Fei_Yu 💪 Don’t miss HOMIE at #RSS2025 — and don’t miss the chance to chat with Qingwei👇 WeChat QR here!

Elgce@BenQingwei

HOMIE will be presented at #RSS2025 today! Spotlight Talks: 4:30pm-5:30pm Poster: 6:30pm-8:00pm BoardNr: 34 @li_yitang will be there to help us present this paper And I will be online to introduce and discuss it🥳 Talk video: drive.google.com/file/d/10uYskZ…

English

1

2

15

1.2K

Elgce

Keşfet