1.1K posts

ud

@uddupa

co-founder @astrm_labs // deep dabbler // hard-tech & bio-hacking

Planet Earth Beigetreten Şubat 2017

1.6K Folgt1.8K Follower

Angehefteter Tweet

ud@uddupa·5d

micro drones. we cooked a new streaming visual slam @astrm_labs.

English

364

30.2K

ud retweetet

Nassim Nicholas Taleb@nntaleb·1d

And if you try 200 times you have 200% chance of success, no?

Leila Hormozi@LeilaHormozi

Becoming successful is not luck. It’s math. If your probability of success is 1/100 and you try 100 times, you have a 100% chance of success.

English

187

314

5.9K

452K

ud@uddupa·4d

on loop...

RAj@Rajztiwari

We need more ISRO edits and professional 3D artists. Video credit: YT/coal_edxts

English

262

ud@uddupa·4d

@henry22lab visual slam.

Indonesia

henry22-lab@henry22lab·4d

@uddupa I didn’t know you can slam with camera

English

ud@uddupa·5d

start with books. start with your phone cam... u can go a long way with ur phone streaming live video to ur laptop!

LearnerMan@LearnerKJ

@uddupa @astrm_labs crazy , how can i learn and get into SLAM?

English

1.3K

ud@uddupa·4d

@macjshiggins getting there...

English

MacCallister Higgins@macjshiggins·4d

this whole video built up to showing a loop closure and then just... didn't do it?

ud@uddupa

micro drones. we cooked a new streaming visual slam @astrm_labs.

English

2.4K

ud@uddupa·4d

@Carnivor3_ @astrm_labs LFG!!

Carnivor3@Carnivor3_·4d

@uddupa @astrm_labs LETS GOOOO

English

ud@uddupa·5d

micro drones. we cooked a new streaming visual slam @astrm_labs.

English

364

30.2K

ud@uddupa·4d

@TsT3v0 @astrm_labs sir, ur camera rigs look awesome!

English

108

Tom Stevenson@TsT3v0·4d

@uddupa @astrm_labs Looks good!

English

120

ud@uddupa·5d

Drones are the visible part. The real moat is in the perception–planning–control stack that keeps working when GPS, comms and maps don’t. That’s what we’re obsessing over @astrm_labs

ud@uddupa

micro drones. we cooked a new streaming visual slam @astrm_labs.

English

104

7.1K

ud@uddupa·4d

@pablovelagomez1 @rerundotio @Gradio bang on... thx so much! yes improving depth could help quite a bit.

English

Pablo Vela@pablovelagomez1·5d

This will eventually be a part of it! There a ton of side models like DAv3/SegmentAnything/ect that also need evaluations that help with slam. But I wanted to focus on a constrained version of things to start. Very cool demo btw =] What depth model are you using? It seems like the fisheye lens makes the depth model struggle some. Might be worth looking at github.com/yuliangguo/dep… or any of the other wider FOV depth models. This is another cool one nam1410.github.io/cam3r/

English

176

Pablo Vela@pablovelagomez1·5d

I've been on a SLAM/SFM kick. It's one of the more underexplored and lacking areas when it comes to human teleop/data collections, so I've brought over Deep Patch Visual Odometry/SLAM to @rerundotio and @Gradio. With this example, we now have 1. pycuvslam 2. pycolmap/glomap 3. mast3r-slam 4. dpvo/slam all integrated into rerun. The question becomes, which method should be used in what situations? They all make different trade-offs with different camera requirements and throughput/accuracy. What about when a new method comes out? Now that I have several different methods, I plan to use VSLAM-LAB for evaluation. It uses @prefix_dev to isolate all the dependencies of each of these methods and easily compare them against each other. In particular, I'll be converting the data preprocessing, algorithm outputs, and evaluation into rerun recordings (rrd files). This will allow both programmatic querying of anything stored in the files (which method had the highest ATE-to-FPS ratio? Which dataset/sequence caused the most difficulty? etc. etc.), all with easy visual inspection using the rerun server to link them all together. Another really important side effect of this is how it impacts agents. As Karpathy said ``` LLMs are exceptionally good at looping until they meet specific goals, and this is where most of the "feel the AGI" magic is to be found. Don't tell it what to do, give it success criteria, and watch it go. ``` by having accuracy and throughput metrics deeply tied with human inspectable artifacts. One can really accelerate agentic development with an actual understanding of how the method/data performs. I think this is another killer use case that I'll be really leaning into to make ingestion of new datasets/methods trivial with an agent. I'm making it my mission for folks to understand that rerun as a visualization tool only scratches the surface of what its true benefit is. Deep integration between data and visuals, with powerful query capabilities. I'll be focusing on the SLAM use case first and then bringing this into the full egocentric/exocentric data collection domain!

Pablo Vela@pablovelagomez1

I've migrated the old Mast3r-SLAM example I had made last year to the latest version of @rerundotio and made a bunch of improvements! I wanted to spend some time with agents to modernize it. Here's an example of me walking around with my iPhone and getting a dense reconstruction at about 10FPS on a 5090. Heres the following improvements I made. Brought it into the monorepo with proper packaging: • Using @prefix_dev pixi-build to get rid of all the mast3r/asmk/lietorch vendored code with just a few small patches. This let me remove so 60k lines of code from the repo! • Don't have to build the lietorch code on my machine anymore, which was taking ~10 minutes to compile (and also made it work on blackwell when it previously did not) Rebuilt the @Gradio interface: • Fixed incremental updates, .MOV uploads, and stop behavior • Made the CLI + Gradio interface share the same entry point so updates automatically propagate Upgraded the @rerundotio integration: • Switched to a multiprocessing async logging strategy • Added video/pointmap/confidence logging • Improved blueprint layout and hid noisy entities from 3D view • Biggest perf win was the async background logger - documented about a ~2.5x speedup from decoupling logging from tracking The newest and most interesting part was my attempt to replace the CUDA kernels for Gauss-Newton ray matching with a @Modular Mojo backend. As a Python dev, every time I look at CUDA code I basically shy away as it's pretty difficult for me to understand. Mojo let me rewrite the matching logic in a syntax I'm more comfortable with while still getting near-CUDA performance. Mojo is now the default matching backend with CUDA fallback. One major piece that's missing is the custom PyTorch op path, but I'll eventually do that as well. I heavily leaned on Claude Code to do the CUDA → Mojo migration, and I have no doubt it's not the cleanest or most idiomatic, BUT it's way more readable for me and helps me better understand the underlying algorithm. This was a ton of work, and a large part of why I'm doing it is how the monorepo compounds. This becomes an artifact for the next example I want to build with Claude that I can point to, which will make it even faster to implement. The compounding nature of this is really interesting and part of why I'm spending so much time trying to make things nice and readable.

English

263

19.1K

ud@uddupa·5d

@bitstream_blake @astrm_labs thx!

145

Blake Edwards@bitstream_blake·5d

@uddupa @astrm_labs Nice!

English

167

ud@uddupa·5d

@annanimous4 @astrm_labs not super crazy possibility... we are on it. :) abt range... depends a lot on how crowded the env is, how big is drones battery, etc.

English

176

Anything@annanimous4·5d

@uddupa @astrm_labs does it have any certain range for signal to the operator when it actually has to fly with a guy doing it or idk adding a automatic flying by the drone analyzing by itself and flying would be crazy(possibility is there ig) ,but yeah.

English

308

ud@uddupa·5d

@rajatdatta099 @astrm_labs what would u like it to do?

English

196

RAJAT@rajatdatta099·5d

@uddupa @astrm_labs That's a simulation was looking for

English

222

ud@uddupa·5d

@duke000083 @astrm_labs great idea! thx for inspiration. btw... u have warehouse for testing?

English

144

TheDuke@duke000083·5d

@uddupa @astrm_labs Throw 5 micro drones with object avoidance and map a warehouse in 30 minutes

English

161

ud@uddupa·5d

@ben_sdl @astrm_labs we wrote a custom pipeline for occupancy grid based mapping...

English

209

Benedikt Seidel@ben_sdl·5d

@uddupa @astrm_labs What do you use for the reconstruction?

English

206

ud@uddupa·5d

@nikhilaprabhu @astrm_labs ACDC to the moon and beyond.

English

334

Nikhil Prabhu@nikhilaprabhu·5d

@uddupa @astrm_labs good song choice

English

270

ud@uddupa·5d

at @astrm_labs we are focusing on this from day 0.

dhanush baktha@dhanuzch

one thing i regret not doing/wish i did as early as possible: vertical integration as much as possible our iteration cycles could've been at least 5x faster

English

539

ud@uddupa·5d

@KamStaszewski @astrm_labs @droneforge @chesterzelaya different universes... no competition.

English

209