Balakumar Sundaralingam

48 posts

Balakumar Sundaralingam

@balakumar_

Research @NVIDIA | CUDA + Robotics | Robot Manipulation

San Francisco, CA Присоединился Ekim 2018

543 Подписки331 Подписчики

Закреплённый твит

Balakumar Sundaralingam@balakumar_·1 May

cuRobo is now open source under Apache 2.0. cuRoboV2 adds <1 ms GPU-native TSDF/ESDF semantic mapping, whole-body IK/MPC and trajectory optimization for humanoids, ~50 ms torque-limited planning. Report: arxiv.org/abs/2603.05493 Code: nvlabs.github.io/curobo

English

221

23.1K

Balakumar Sundaralingam@balakumar_·23h

Congrats! Truly autonomous robot that cleans messy homes!

Anshuman Kumar@anshuman_builds

@maticrobots shipped its 10,000th robot this week. 🤖 Back in 2018, there was no product - just a small team, a lot of ideas, and an unreasonable amount of optimism. The photos below are from those early days: prototypes, experiments, customer interviews, design reviews, and plenty of things that never made it into the final product. We first started shipping in November 2024. Huge credit to the Engineering and Production teams that made the 10K bots happen! Great engineering gets you to the first unit. Great teams get you to the first 10,000! Still a lot more to be done. 🚀

English

Balakumar Sundaralingam@balakumar_·12 Haz

Open Source Grasp Generation for any gripper, any object. You can pair it with cuRoboV2 & Newton for large scale grasp trajectory generation. Try it out here: github.com/NVlabs/GraspGe…

Adithya Murali@Adithya_Murali_

@NVIDIA is working on one of the hardest problems in Physical AI so you don’t have to: generalist robotic pick-and-place. We are excited to introduce GraspGenX at #CVPR2026—a foundation model for robotic grasping that works out of the box for unknown robots, novel objects, and unseen environments. Unlike Vision-Language-Action (VLA) models or dedicated grasp networks that require expensive, embodiment-specific training, GraspGenX is cross-embodiment and works zero-shot. You simply pass a "robot prompt" alongside an image of the object to generate actions. 🚀 Key Highlights: 1) Scaling: Trained on over 2 Billion 6-DoF grasp rollouts entirely in physics simulation—a dataset size practically impossible to collect via real-world teleoperation. 2) Zero-Shot Transfer: Works out of the box for several common robot grippers widely used across the research community and industry. 3) Built for the Agentic Era: Features native MCP support, client-server architecture, and skills.md, allowing seamless integration into LLM/Agentic robotics workflows. 4) Full Pipeline Integration: Pair it with other open foundation models (like SAM3) and advanced motion solvers like cuRoboV2 for full deployment in entirely unknown environments. If you are currently executing pick-and-place with a VLA or WAM, you can use GraspGenX to generate sim-verified trajectory data and inject it into your pipeline. No need to waste precious real-world engineering hours on data collection for standard manipulation tasks. 🌐Website: graspgenx.github.io 💻Code: github.com/NVlabs/GraspGe… 📄Paper: arxiv.org/abs/2606.00998 📍CVPR Booth: Poster 619 on Jun 6 1:45 session at ExHall F This work was led by the incredible @BeiningH (Princeton), in collaboration with a phenomenal team at NVIDIA: @erwincoumans, @yu_wei_chao, @balakumar_, @clembow, and Stan Birchfield #CVPR2026

English

3.1K

Balakumar Sundaralingam@balakumar_·12 Haz

@wenlong_huang @KDharmarajan123 Congrats Wenlong!

English

113

Wenlong Huang@wenlong_huang·10 Haz

Excited to share that belatedly: - PointWorld (point-world.github.io) received Best Paper Award at CVPR E2E3D Workshop - Dream2Flow (dream2flow.github.io) received Best Paper Award at ICRA SRRA Workshop, led by @KDharmarajan123 - ENACT (enact-embodied-cognition.github.io) received Outstanding Paper Awards at both ICLR World Models Workshop and ICLR Lifelong Agents Workshop, led by @qineng_wang This marks the 7th paper award/finalist among 8 works during my PhD at Stanford so far (some received more than one). Awards are not everything, but it’s intellectually rewarding to pour our best into each work. Feeling extremely grateful and lucky to have amazing collaborators, mentors, and advisors! And heartfelt thanks for the organizers!

Wenlong Huang@wenlong_huang

What if we can simulate an *interactive 3D world*, from a single image, in the wild, in real time? Introducing PointWorld-1B: a large pre-trained 3D world model that predicts env dynamics given RGB-D capture and robot actions. 🌐 point-world.github.io from @Stanford @nvidia

English

204

25.5K

Balakumar Sundaralingam ретвитнул

Bowen Wen@bowenwen_me·3 Haz

Let's meet and chat at #CVPR! Come by our poster or schedule a coffee chat🙌 Fri afternoon: - Fast-FoundationStereo: nvlabs.github.io/Fast-Foundatio… - CARI4D: nvlabs.github.io/CARI4D/ (w/ @XianghuiXie ) Sat morning: - Event6D: chohoonhee.github.io/Event6D/ (w/ Jae-Young)

English

154

8.7K

Balakumar Sundaralingam@balakumar_·6 May

@enguerandvitro Voxels do not have to be slower. We have a very efficient implementation to go from depth to sparse tsdf voxels (includes averaging across views) and dense esdf in curobov2 here: github.com/NVlabs/curobo/…

English

Enguerand / VitroBot@enguerandvitro·26 Nis

Every robotics startup talks about point clouds and neural nets We’re betting on 1 cm³ voxels instead It’s slower, older, and less flashy But on a glass facade with no texture, it’s the only representation that still works Sometimes the best answer isn’t new

English

485

Balakumar Sundaralingam@balakumar_·6 May

@s1wase TIL there is a small baseline stereo rgb global shutter

English

202

Shun Iwase@s1wase·5 May

This is a huge step forward in explicit 3D reasoning in policy learning! I just wish Gemini 305 were used for learning-based stereo depth perception from wrist cameras instead. They share the same camera dimensions, but Gemini 305 supports dual RGB streaming unlike D405 thanks to its hardware design, and it provides this quality of depth maps in real time. orbbec.com/gemini-305/

Jiafei Duan@DJiafei

Most capable generalist robotics models today are closed or at best, open weights. But robotics won’t reach its ChatGPT moment without real openness. That GPT moment was built on years of open tools and datasets such as Python, PyTorch, ImageNet and more, that let researchers inspect, reproduce, and build. Today, we’re introducing MolmoAct 2: a fully open-source action reasoning model for real-world robotics. We rethought and reshaped everything! 🧵👇

English

185

19.4K

Balakumar Sundaralingam@balakumar_·2 May

@s4movar @pablovelagomez1 @s4movar we actually built a tsdf mapper in python in curobov2: x.com/balakumar_/sta…

Balakumar Sundaralingam@balakumar_

@pablovelagomez1 CuRoboV2 implements a TSDF mapper for manipulation (fixed workspace, 5mm voxels, multiple cameras). We designed it for performance (all ops on GPU) and memory efficiency (fp16). This leads to 10x faster rgbd->esdf while using 8x less memory (page 31).

English

eldaniz@s4movar·2 May

@pablovelagomez1 @balakumar_ As far as I know curobo is motion generation library. It also has multiple “world” options (mesh, voxel) and nvblox is one of them.

English

Balakumar Sundaralingam@balakumar_·1 May

English

221

23.1K

Balakumar Sundaralingam@balakumar_·2 May

English

384

Pablo Vela@pablovelagomez1·2 May

@balakumar_ How does this compare to github.com/nvidia-isaac/n… Seems like they’re sort of my going after the same problem?

English

575

Balakumar Sundaralingam@balakumar_·7 Mar

@kevin_zakka Apologies @kevin_zakka, for incorrect mink capabilities. Revision soon with mink changes: Collision, CoM, local IK w/ collision: no → yes Solver: NLLS → QP Out-of-scope capabilities marked "-" instead of "X" Text reframed: retargeting coll. behavior due to GMR's integration.

English

Kevin Zakka@kevin_zakka·6 Mar

(1/3) New motion planning library from NVIDIA (cuRoboV2) just dropped making categorically wrong claims about mink. Their paper says mink has no collision avoidance and no center-of-mass support. Both have shipped since July 2024 (day 1).

English

109

7.4K

Balakumar Sundaralingam@balakumar_·6 Mar

@kevin_zakka We could not find an example in mink that does the human to humanoid link mapping for retargeting. So we had to use GMR's mapping. We use GMR's mapping for all IK solvers (using the same relative weighting across links). Happy to rerun mink if you have better tuned weights.

English

145

Kevin Zakka@kevin_zakka·6 Mar

(2/3) What they actually benchmarked is GMR, a 3rd party retargeting library that uses mink but doesn't enable collision avoidance. They tested someone else's default config, got bad numbers, then concluded the features don't exist. Our G1 humanoid example uses both on the exact robot they test against.

English

Balakumar Sundaralingam ретвитнул

Nathan Ratliff@robot_trainer·17 May

Use both: web.mit.edu/dimitrib/www/L… TO is a Newton step on the Bellman equation. Policies and value functions are "memories" of past solutions; TO should be optimizing over them at inference time. Best of both worlds. Some of the strongest RL methods do this.

Antoine Leeman @ICRA2026@antoine_leeman

Should legged robot use RL or trajectory optimization? #icra2024

English

181

24.3K

Balakumar Sundaralingam ретвитнул

Nathan Ratliff@robot_trainer·14 May

New work on vectorizing geometric fabric controllers for RL workflows at scale. DeXtreme: Fabric Guided Policies (FGP). Policies are hard on hardware. We need low-level controllers at deployment, which means we need them during training. FGPs increase hardware lifetime, enable quick iteration on training and deploying policies, and allow us to inject useful inductive bias into the system.

English

16.1K

Balakumar Sundaralingam@balakumar_·8 Kas

Our code for CUDA accelerated motion generation is out! Supercharge your workflows with fast batched robotics modules, including kinematics, collision queries, optimization, and motion planning. #PyTorch #Nvidia #Robots

NVIDIA AI Developer@NVIDIAAIDev

🤖 cuRobo, a new #CUDA accelerated motion generation toolkit, can solve complex #robotics problems in milliseconds. ⚡ It includes implementations of kinematics, collision checking, numerical and trajectory optimization, and more. 👀 #NVIDIAResearch code nvda.ws/3MxDmNG

English

126

13.7K

Balakumar Sundaralingam ретвитнул

Ankur Handa@ankurhandos·26 Eki

DeXtreme is our new work on scaling sim-to-real for contact-rich manipulation with a vision-based state estimation on a robot hand with the infrastructure we have been developing with Isaac Gym over the past one year. arxiv.org/abs/2210.13702 dextreme.org

English

266

Balakumar Sundaralingam ретвитнул

Yun-Chun Chen@yunchunchen·1 Tem

I'm excited to share our #RSS2022 workshop paper: Neural Motion Fields: Encoding Grasp Trajectories as Implicit Value Functions. w/ @Adithya_Murali_ , @balakumar_ , Wei Yang, @animesh_garg, and Dieter Fox at @nvidia. Paper: arxiv.org/abs/2206.14854 Video: youtu.be/B-pEhT1pi-Q

YouTube

English

117

Balakumar Sundaralingam ретвитнул

Ankur Handa@ankurhandos·31 May

Factory: Fast Contact for Robotic Assembly, our recent work, is a set of simulation methods & robot learning tools for contact-rich interactions for robotic assembly. It will be presented at RSS next month. Paper: arxiv.org/abs/2205.03532 Website: sites.google.com/nvidia.com/fac…

English

347

Balakumar Sundaralingam ретвитнул

Andreas Orthey@andreas_orthey·27 Mar

Update on Robotic Conferences due to Coronavirus: RSS -> virtual conference WAFR -> postponed by one year (to June 2021) ICRA -> decision on April 6th (virtual vs. postpone to end of 2020) IROS -> as planned (Oct 2020) Humanoids -> as planned (Dez 2020)

English

Balakumar Sundaralingam ретвитнул

Siddhartha Srinivasa@siddhss5·8 Oca

I’ve run the Personal Robotics Lab for 15 years now, first @SCSatCMU and now @uwcse. Here’s my personal take on how to build an academic research flywheel. Enjoy! linkedin.com/pulse/academic…

English

Balakumar Sundaralingam ретвитнул

Rodney Brooks@rodneyabrooks·11 Ara

Spent the last two days crisscrossing Mumbai for meetings. All those autonomous miles in Chandler, Arizona are easily going to generalize for AV roll outs here. Yeah, right.

English

108

Открыть

@wenlong_huang @KDharmarajan123 @qineng_wang @XianghuiXie @enguerandvitro @s1wase @s4movar @pablovelagomez1