Tim Field

759 posts

Tim Field banner
Tim Field

Tim Field

@nobbis

Building the deployment stack for humanoid robots Prev: Willow Garage, hedge fund quant NZ → London → NYC → SF → Tahoe

Tahoe Katılım Mayıs 2008
694 Takip Edilen5.3K Takipçiler
Tim Field
Tim Field@nobbis·
Building a software stack for humanoid robots. Realtime 3D mapping working. Next: navigation.
English
1
0
9
678
Chris Paxton
Chris Paxton@chris_j_paxton·
Its honestly great that they're doing this and will make it way easier to learn foundation models for manipulation. Shows the transition unitree is undergoing, and that they're taking the challenge of learning for robot manipulation very seriously.
Chris Paxton tweet media
English
11
13
161
11K
Tim Field
Tim Field@nobbis·
@jon_barron Sure (see video) but this isn't their fine-tuned metric depth model – output's unitless with range close to [0, 1] every frame. IMO fitting the depth image to VIO feature points would be more fruitful than asking a monocular system to recover accurate metric depth.
English
1
0
1
284
Jon Barron
Jon Barron@jon_barron·
@nobbis Can you render a video where the range of depths used for the colormap is fixed across frames, instead of the min/max of the depths of each frame? Or is that what you rendered here, and it's just temporally unstable?
English
1
0
2
1.4K
Tim Field
Tim Field@nobbis·
Trying out realtime video-to-depth on an iPhone 15 using Depth Anything V2.
English
8
12
153
21.9K
Tim Field
Tim Field@nobbis·
@Kellyv_ai Not built-in. I wrote a few lines to filter the depth image through a colormap.
English
2
0
0
122
KellyV
KellyV@Kellyv_ai·
@nobbis I successfully ran it on the iPhone 12 Pro Max, but why is it in black and white? How do I set it to a colored depth map?
English
1
0
0
149
Josh Leverette
Josh Leverette@coder543·
@nobbis @bingyikang @pcuenq So are you using depth anything V1 or V2? The sample app is V1, and I haven’t seen anyone publish a V2 coreml conversion yet. I was thinking about trying to do that conversion soon.
English
1
0
0
82
amit
amit@gravicle·
@jon_barron @karanganesan @vokaysh 3-5x lower per sec than others AND more happens in the seconds Dream Machine generates. AI is about abundance. Until we can offer unlimited, we will try to offer as much as possible.
English
2
0
5
251
Tim Field
Tim Field@nobbis·
@iwamah1 I believe Area Mode is the new "ignoreBoundingBox" processing option in Object Capture. The documentation says: "Ignores any bounding box information embedded in the input images and instead returns all possible geometry that can be automatically estimated using the image set."
English
0
1
3
1.6K
iwamah
iwamah@iwamah1·
今年も...今年もApple純正3Dスキャン機能が強化されてる...! 予想通り物から空間に変わってきましたね Object Captureのエリアモード...一体どんな仕組みなんだ⁉️ #WWDC
iwamah tweet media
日本語
1
16
108
8.4K
Tim Field
Tim Field@nobbis·
@iwamah1 We deliberately keep the price unchanged for continuing Metascan subscribers as a way of thanking early supporters. If you change your subscription, then you'll pay the current price. App Store developers can increase prices by up to 50% once a year (see support.apple.com/en-us/109501)
English
1
0
4
203
iwamah
iwamah@iwamah1·
MetascanとPolycamをサービス初期から永遠とサブスクしている訳なんですが、AppStoreの表示金額が何故か初期金額から変更されていないの謎すぎる 内部処理的にどうなっているんですかね...?
iwamah tweet media
日本語
2
0
11
3.2K
Tim Field
Tim Field@nobbis·
@shlykur @Metascan3D It would be great to see – but can't think of how to do that (short of spamming everyone's email.) Adding a RFS (Request for Scan) feature could be interesting. Not sure how it'd work.
English
0
0
1
79
Eric Schleicher
Eric Schleicher@shlykur·
Is there a way to request a scan from @Metascan3D users network to get a raw scan of an AVP? it would be great to see one up close in 3d.
English
1
0
0
119
Tim Field
Tim Field@nobbis·
@bilawalsidhu I bet someone will replace the SHs in 3DGS with a tiny MLP, keep the rest, and say, "Look, it was always a NeRF!"
English
1
0
1
211
Tim Field
Tim Field@nobbis·
@bilawalsidhu Turns out NeRFs were just a bunch of overly complicated tricks (positional encoding, space warping, coarse-to-fine, hashtables, view-dependent color with tiny neural nets, cone tracing, etc.) that aren't necessary.
English
1
0
4
306
Tim Field
Tim Field@nobbis·
"It's just differentiable rendering. Been doing that since Neural Volumes/NeRF in 2019." - CV folks "It uses seed & diffuse 3D reconstruction. PatchMatch Stereo showed that in 2011." - MVS folks "Hah, it draws Gaussian splats! Try EWA splatting in 2002." - CG folks #3DGS
English
1
1
27
6.8K
Tim Field
Tim Field@nobbis·
@Azadux @BartronPolygon Yeah, 3DGS “the software” must be licensed, but a clean room implementation of the algorithm is fine. In practice, startups basically ignore that, e.g. we know Luma employees are looking at 3DGS because they were digging into Polycam’s fork of it.
English
0
0
1
229
amit
amit@gravicle·
@Azadux Nope! Our implementation built on top of the whole Luma pipeline.
English
1
0
14
749