Balakumar Sundaralingam

48 posts

Balakumar Sundaralingam

Balakumar Sundaralingam

@balakumar_

Research @NVIDIA | CUDA + Robotics | Robot Manipulation

San Francisco, CA Beigetreten Ekim 2018
543 Folgt331 Follower
Balakumar Sundaralingam
Balakumar Sundaralingam@balakumar_·
Congrats! Truly autonomous robot that cleans messy homes!
Anshuman Kumar@anshuman_builds

@maticrobots shipped its 10,000th robot this week. 🤖 Back in 2018, there was no product - just a small team, a lot of ideas, and an unreasonable amount of optimism. The photos below are from those early days: prototypes, experiments, customer interviews, design reviews, and plenty of things that never made it into the final product. We first started shipping in November 2024. Huge credit to the Engineering and Production teams that made the 10K bots happen! Great engineering gets you to the first unit. Great teams get you to the first 10,000! Still a lot more to be done. 🚀

English
0
0
0
63
Balakumar Sundaralingam
Balakumar Sundaralingam@balakumar_·
Open Source Grasp Generation for any gripper, any object. You can pair it with cuRoboV2 & Newton for large scale grasp trajectory generation. Try it out here: github.com/NVlabs/GraspGe…
Adithya Murali@Adithya_Murali_

@NVIDIA is working on one of the hardest problems in Physical AI so you don’t have to: generalist robotic pick-and-place. We are excited to introduce GraspGenX at #CVPR2026—a foundation model for robotic grasping that works out of the box for unknown robots, novel objects, and unseen environments. Unlike Vision-Language-Action (VLA) models or dedicated grasp networks that require expensive, embodiment-specific training, GraspGenX is cross-embodiment and works zero-shot. You simply pass a "robot prompt" alongside an image of the object to generate actions. 🚀 Key Highlights: 1) Scaling: Trained on over 2 Billion 6-DoF grasp rollouts entirely in physics simulation—a dataset size practically impossible to collect via real-world teleoperation. 2) Zero-Shot Transfer: Works out of the box for several common robot grippers widely used across the research community and industry. 3) Built for the Agentic Era: Features native MCP support, client-server architecture, and skills.md, allowing seamless integration into LLM/Agentic robotics workflows. 4) Full Pipeline Integration: Pair it with other open foundation models (like SAM3) and advanced motion solvers like cuRoboV2 for full deployment in entirely unknown environments. If you are currently executing pick-and-place with a VLA or WAM, you can use GraspGenX to generate sim-verified trajectory data and inject it into your pipeline. No need to waste precious real-world engineering hours on data collection for standard manipulation tasks. 🌐Website: graspgenx.github.io 💻Code: github.com/NVlabs/GraspGe… 📄Paper: arxiv.org/abs/2606.00998 📍CVPR Booth: Poster 619 on Jun 6 1:45 session at ExHall F This work was led by the incredible @BeiningH (Princeton), in collaboration with a phenomenal team at NVIDIA: @erwincoumans, @yu_wei_chao, @balakumar_, @clembow, and Stan Birchfield #CVPR2026

English
0
0
26
3.1K
Wenlong Huang
Wenlong Huang@wenlong_huang·
Excited to share that belatedly: - PointWorld (point-world.github.io) received Best Paper Award at CVPR E2E3D Workshop - Dream2Flow (dream2flow.github.io) received Best Paper Award at ICRA SRRA Workshop, led by @KDharmarajan123 - ENACT (enact-embodied-cognition.github.io) received Outstanding Paper Awards at both ICLR World Models Workshop and ICLR Lifelong Agents Workshop, led by @qineng_wang This marks the 7th paper award/finalist among 8 works during my PhD at Stanford so far (some received more than one). Awards are not everything, but it’s intellectually rewarding to pour our best into each work. Feeling extremely grateful and lucky to have amazing collaborators, mentors, and advisors! And heartfelt thanks for the organizers!
Wenlong Huang tweet mediaWenlong Huang tweet mediaWenlong Huang tweet media
Wenlong Huang@wenlong_huang

What if we can simulate an *interactive 3D world*, from a single image, in the wild, in real time? Introducing PointWorld-1B: a large pre-trained 3D world model that predicts env dynamics given RGB-D capture and robot actions. 🌐 point-world.github.io from @Stanford @nvidia

English
14
9
204
25.5K
Enguerand / VitroBot
Enguerand / VitroBot@enguerandvitro·
Every robotics startup talks about point clouds and neural nets We’re betting on 1 cm³ voxels instead It’s slower, older, and less flashy But on a glass facade with no texture, it’s the only representation that still works Sometimes the best answer isn’t new
English
2
0
3
484
Shun Iwase
Shun Iwase@s1wase·
This is a huge step forward in explicit 3D reasoning in policy learning! I just wish Gemini 305 were used for learning-based stereo depth perception from wrist cameras instead. They share the same camera dimensions, but Gemini 305 supports dual RGB streaming unlike D405 thanks to its hardware design, and it provides this quality of depth maps in real time. orbbec.com/gemini-305/
Jiafei Duan@DJiafei

Most capable generalist robotics models today are closed or at best, open weights. But robotics won’t reach its ChatGPT moment without real openness. That GPT moment was built on years of open tools and datasets such as Python, PyTorch, ImageNet and more, that let researchers inspect, reproduce, and build. Today, we’re introducing MolmoAct 2: a fully open-source action reasoning model for real-world robotics. We rethought and reshaped everything! 🧵👇

English
4
16
185
19.4K
eldaniz
eldaniz@s4movar·
@pablovelagomez1 @balakumar_ As far as I know curobo is motion generation library. It also has multiple “world” options (mesh, voxel) and nvblox is one of them.
English
1
0
0
75
Balakumar Sundaralingam
Balakumar Sundaralingam@balakumar_·
@pablovelagomez1 CuRoboV2 implements a TSDF mapper for manipulation (fixed workspace, 5mm voxels, multiple cameras). We designed it for performance (all ops on GPU) and memory efficiency (fp16). This leads to 10x faster rgbd->esdf while using 8x less memory (page 31).
Balakumar Sundaralingam tweet media
English
2
0
7
384
Balakumar Sundaralingam
Balakumar Sundaralingam@balakumar_·
@kevin_zakka Apologies @kevin_zakka, for incorrect mink capabilities. Revision soon with mink changes: Collision, CoM, local IK w/ collision: no → yes Solver: NLLS → QP Out-of-scope capabilities marked "-" instead of "X" Text reframed: retargeting coll. behavior due to GMR's integration.
English
1
0
8
2K
Kevin Zakka
Kevin Zakka@kevin_zakka·
(1/3) New motion planning library from NVIDIA (cuRoboV2) just dropped making categorically wrong claims about mink. Their paper says mink has no collision avoidance and no center-of-mass support. Both have shipped since July 2024 (day 1).
English
5
3
109
7.4K
Balakumar Sundaralingam
Balakumar Sundaralingam@balakumar_·
@kevin_zakka We could not find an example in mink that does the human to humanoid link mapping for retargeting. So we had to use GMR's mapping. We use GMR's mapping for all IK solvers (using the same relative weighting across links). Happy to rerun mink if you have better tuned weights.
English
1
0
1
145
Kevin Zakka
Kevin Zakka@kevin_zakka·
(2/3) What they actually benchmarked is GMR, a 3rd party retargeting library that uses mink but doesn't enable collision avoidance. They tested someone else's default config, got bad numbers, then concluded the features don't exist. Our G1 humanoid example uses both on the exact robot they test against.
English
2
1
28
2K
Balakumar Sundaralingam retweetet
Nathan Ratliff
Nathan Ratliff@robot_trainer·
New work on vectorizing geometric fabric controllers for RL workflows at scale. DeXtreme: Fabric Guided Policies (FGP). Policies are hard on hardware. We need low-level controllers at deployment, which means we need them during training. FGPs increase hardware lifetime, enable quick iteration on training and deploying policies, and allow us to inject useful inductive bias into the system.
English
3
15
71
16.1K
Balakumar Sundaralingam
Balakumar Sundaralingam@balakumar_·
Our code for CUDA accelerated motion generation is out! Supercharge your workflows with fast batched robotics modules, including kinematics, collision queries, optimization, and motion planning. #PyTorch #Nvidia #Robots
NVIDIA AI Developer@NVIDIAAIDev

🤖 cuRobo, a new #CUDA accelerated motion generation toolkit, can solve complex #robotics problems in milliseconds. ⚡ It includes implementations of kinematics, collision checking, numerical and trajectory optimization, and more. 👀 #NVIDIAResearch code nvda.ws/3MxDmNG

English
0
19
126
13.7K
Balakumar Sundaralingam retweetet
Ankur Handa
Ankur Handa@ankurhandos·
DeXtreme is our new work on scaling sim-to-real for contact-rich manipulation with a vision-based state estimation on a robot hand with the infrastructure we have been developing with Isaac Gym over the past one year. arxiv.org/abs/2210.13702 dextreme.org
English
3
40
266
0
Balakumar Sundaralingam retweetet
Ankur Handa
Ankur Handa@ankurhandos·
Factory: Fast Contact for Robotic Assembly, our recent work, is a set of simulation methods & robot learning tools for contact-rich interactions for robotic assembly. It will be presented at RSS next month. Paper: arxiv.org/abs/2205.03532 Website: sites.google.com/nvidia.com/fac…
English
6
62
347
0
Balakumar Sundaralingam retweetet
Andreas Orthey
Andreas Orthey@andreas_orthey·
Update on Robotic Conferences due to Coronavirus: RSS -> virtual conference WAFR -> postponed by one year (to June 2021) ICRA -> decision on April 6th (virtual vs. postpone to end of 2020) IROS -> as planned (Oct 2020) Humanoids -> as planned (Dez 2020)
English
1
10
24
0
Balakumar Sundaralingam retweetet
Rodney Brooks
Rodney Brooks@rodneyabrooks·
Spent the last two days crisscrossing Mumbai for meetings. All those autonomous miles in Chandler, Arizona are easily going to generalize for AV roll outs here. Yeah, right.
English
7
11
108
0