

Vishnu Sharma
149 posts

@VishnuDSharma
Researcher @BellLabs Past: PhD @umdcs @RAASumd, @comcast @AmexIndia @iitKgp








Meet MapAnything – a transformer that directly regresses factored metric 3D scene geometry (from images, calibration, poses, or depth) in an end-to-end way. No pipelines, no extra stages. Just 3D geometry & cameras, straight from any type of input, delivering new state-of-the-art results 🚀 One universal model enables SoTA for: 🔥 Mono Depth Estimation 🔥 Multi-View SfM 🔥 Multi-View Stereo 🔥 Depth Completion 🔥 Registration … and many more possibilities! – plus everything is metric 🎯 We release code for data processing, training, benchmarking & ablations – everything Apache 2.0! Details & Links 👇






🎉 Excited to share that our paper Sketch-to-Skill is accepted to RSS 2025! Congratulations and huge thanks to my amazing collaborators Peihong Yu, @singhanukriti, @zahir_mahammad, and my advisor @PratapTokekar Let me walk you through it. 🧵👇 #RSS2025







How a 40-Year-Old Trick Solves Seamless Image Blending Laplacian pyramid blending is a simple yet effective tool for many applications, including object composition, seamless panorama stitching, and exposure fusion. Let’s learn this classic method that still works so well today.





