Daniel DeTone (@ddetone) - Twitter 프로필 | Zamantika Mersobahis Locabet

Daniel DeTone@ddetone·9h

@nickkarpov Feel free to file a GitHub issue if you have any problems! Will do my best to answer them quickly

English

0

2

309

Nick Karpov@nickkarpov·10h

@ddetone Awesome work, going to use this

English

1

0

2

338

Daniel DeTone@ddetone·11h

Today we release Boxer, a new lightweight approach that lifts open-world 2D bounding boxes to *metric* 3D: facebookresearch.github.io/boxer/ Here we show Boxer in action on an egocentric sequence captured from smart glasses:

English

17

107

824

37.5K

Daniel DeTone@ddetone·11h

For more details, check out the arxiv paper here: arxiv.org/abs/2604.05212

English

0

2

6

559

Daniel DeTone@ddetone·11h

BoxerNet runs FAST 🔥🔥, taking roughly 20ms on a 4090 with bfloat16 for ALL prompts in an image (e.g. 30 boxes in parallel)

English

1

0

5

563

Daniel DeTone@ddetone·28 Oca

Giving Claude new skills is the closest thing I’ve felt to this Matrix moment

GIF

English

0

1

198

Daniel DeTone@ddetone·19 Oca

Check out our recent work ShapeR, a generative model which gets high quality object 3D meshes from Aria glasses

Yawar Siddiqui@yawarnihal

Introducing ShapeR, a method for robust conditional 3D shape generation from casually captured sequences. ShapeR leverages a rectified flow transformer conditioned on per-object multimodal data to turn casual image sequences into full metric scene reconstructions. Project Page: facebookresearch.github.io/ShapeR Paper: arxiv.org/abs/2601.11514 Links to code and huggingface below ⬇️

English

1

0

18

2K

Daniel DeTone@ddetone·24 Mar

For those interested in 3D perception, check out the Sonata pre-trained backbone. I don’t think I’ll ever co-author another paper that triples performance (22% -> 72%) on a commonly used benchmark (linear probe scannet 3D sem seg). Released with a permissive license too

Xiaoyang Wu@XiaoyangWu_

📢Sonata: Self-Supervised Learning of Reliable Point Representations📢 Meet Sonata, our"3D-DINO" pre-trained with Point Transformer V3, accepted at #CVPR2025! 🌍: xywu.me/sonata 📦: github.com/facebookresear… 🚀: github.com/Pointcept/Poin… 🔹Semantic-aware and spatial reasoning representations learned with no label; 🔹3x linear probing accuracy (from 21.8% to 72.5%) on ScanNet; 🔹2x data efficiency performance with only 1% of the data compared to previous approaches; 🔹As always, establish new SOTA results across indoor and outdoor 3D perception tasks. Our author team: @HengshuangZhao, @jstraub6, @rapideRobot, @ddetone, @NinjaDuncan, @TianweiS, @Christopher_Xie, @NanYang719.

English

0

17

930

Daniel DeTone 리트윗함

Xiaoyang Wu@XiaoyangWu_·21 Mar

📢Sonata: Self-Supervised Learning of Reliable Point Representations📢 Meet Sonata, our"3D-DINO" pre-trained with Point Transformer V3, accepted at #CVPR2025! 🌍: xywu.me/sonata 📦: github.com/facebookresear… 🚀: github.com/Pointcept/Poin… 🔹Semantic-aware and spatial reasoning representations learned with no label; 🔹3x linear probing accuracy (from 21.8% to 72.5%) on ScanNet; 🔹2x data efficiency performance with only 1% of the data compared to previous approaches; 🔹As always, establish new SOTA results across indoor and outdoor 3D perception tasks. Our author team: @HengshuangZhao, @jstraub6, @rapideRobot, @ddetone, @NinjaDuncan, @TianweiS, @Christopher_Xie, @NanYang719.

English

4

49

192

30.3K

Daniel DeTone 리트윗함

Boz@boztank·27 Şub

The first generation of Aria glasses have made a big impact in the research community, can't wait to see all the new possibilities these will unlock meta.com/blog/project-a…

English

30

75

525

41.3K

Daniel DeTone@ddetone·3 Eki

@ZMurez @gazorp5 heres more details on the EVL model you were asking about

English

0

1

556

Daniel DeTone@ddetone·3 Eki

cc @ZMurez remember 3D surfaces? :)

English

1

0

1

587

Daniel DeTone@ddetone·3 Eki

What is the right benchmark for a 3D Egocentric Foundation model? We recently open-sourced a small, high quality egocentric benchmark consisting of 1) 3D surfaces 2) 3D objects. We released a simple 3D CNN baseline model called EVL: projectaria.com/research/efm3d/ Try to beat our model!

GIF

English

1

40

227

22K

Daniel DeTone@ddetone·3 Eki

@gazorp5 yes, will make a post on that today

English

1

0

2

279

/@gazorp5·3 Eki

@ddetone Do you plan on releasing the EVL model as well?

English

1

0

398

Daniel DeTone@ddetone·2 Eki

Interested in 3D object detection and egocentric vision? We open sourced a small yet challenging dataset called Aria Everyday Objects (AEO). We use it as one of the tasks for benchmarking a new class of Egocentric 3D Foundation Models we are working on: arxiv.org/abs/2406.10224