Daniel DeTone

495 posts

Daniel DeTone banner
Daniel DeTone

Daniel DeTone

@ddetone

Deep Nets and Geometry — what could go wrong?

Long Beach, CA 가입일 Haziran 2009
662 팔로잉2.1K 팔로워
Daniel DeTone
Daniel DeTone@ddetone·
@nickkarpov Feel free to file a GitHub issue if you have any problems! Will do my best to answer them quickly
English
0
0
2
309
Daniel DeTone
Daniel DeTone@ddetone·
Today we release Boxer, a new lightweight approach that lifts open-world 2D bounding boxes to *metric* 3D: facebookresearch.github.io/boxer/ Here we show Boxer in action on an egocentric sequence captured from smart glasses:
English
17
107
824
37.5K
Daniel DeTone
Daniel DeTone@ddetone·
BoxerNet runs FAST 🔥🔥, taking roughly 20ms on a 4090 with bfloat16 for ALL prompts in an image (e.g. 30 boxes in parallel)
English
1
0
5
563
Daniel DeTone
Daniel DeTone@ddetone·
Giving Claude new skills is the closest thing I’ve felt to this Matrix moment
GIF
English
0
0
1
198
Daniel DeTone
Daniel DeTone@ddetone·
For those interested in 3D perception, check out the Sonata pre-trained backbone. I don’t think I’ll ever co-author another paper that triples performance (22% -> 72%) on a commonly used benchmark (linear probe scannet 3D sem seg). Released with a permissive license too
Xiaoyang Wu@XiaoyangWu_

📢Sonata: Self-Supervised Learning of Reliable Point Representations📢 Meet Sonata, our"3D-DINO" pre-trained with Point Transformer V3, accepted at #CVPR2025! 🌍: xywu.me/sonata 📦: github.com/facebookresear… 🚀: github.com/Pointcept/Poin… 🔹Semantic-aware and spatial reasoning representations learned with no label; 🔹3x linear probing accuracy (from 21.8% to 72.5%) on ScanNet; 🔹2x data efficiency performance with only 1% of the data compared to previous approaches; 🔹As always, establish new SOTA results across indoor and outdoor 3D perception tasks. Our author team: @HengshuangZhao, @jstraub6, @rapideRobot, @ddetone, @NinjaDuncan, @TianweiS, @Christopher_Xie, @NanYang719.

English
0
0
17
930
Daniel DeTone 리트윗함
Xiaoyang Wu
Xiaoyang Wu@XiaoyangWu_·
📢Sonata: Self-Supervised Learning of Reliable Point Representations📢 Meet Sonata, our"3D-DINO" pre-trained with Point Transformer V3, accepted at #CVPR2025! 🌍: xywu.me/sonata 📦: github.com/facebookresear… 🚀: github.com/Pointcept/Poin… 🔹Semantic-aware and spatial reasoning representations learned with no label; 🔹3x linear probing accuracy (from 21.8% to 72.5%) on ScanNet; 🔹2x data efficiency performance with only 1% of the data compared to previous approaches; 🔹As always, establish new SOTA results across indoor and outdoor 3D perception tasks. Our author team: @HengshuangZhao, @jstraub6, @rapideRobot, @ddetone, @NinjaDuncan, @TianweiS, @Christopher_Xie, @NanYang719.
Xiaoyang Wu tweet media
English
4
49
192
30.3K
Daniel DeTone 리트윗함
Boz
Boz@boztank·
The first generation of Aria glasses have made a big impact in the research community, can't wait to see all the new possibilities these will unlock meta.com/blog/project-a…
English
30
75
525
41.3K
Daniel DeTone
Daniel DeTone@ddetone·
What is the right benchmark for a 3D Egocentric Foundation model? We recently open-sourced a small, high quality egocentric benchmark consisting of 1) 3D surfaces 2) 3D objects. We released a simple 3D CNN baseline model called EVL: projectaria.com/research/efm3d/ Try to beat our model!
GIF
English
1
40
227
22K
/
/@gazorp5·
@ddetone Do you plan on releasing the EVL model as well?
English
1
0
0
398
Daniel DeTone
Daniel DeTone@ddetone·
Interested in 3D object detection and egocentric vision? We open sourced a small yet challenging dataset called Aria Everyday Objects (AEO). We use it as one of the tasks for benchmarking a new class of Egocentric 3D Foundation Models we are working on: arxiv.org/abs/2406.10224
GIF
English
14
149
775
56.5K