Paul-Edouard Sarlin

409 posts

Paul-Edouard Sarlin

Paul-Edouard Sarlin

@pesarlin

Researcher at @Google, 3D computer vision & machine learning. Previously PhD at ETH Zurich, intern at @Google, @Meta, @Microsoft, @magicleap.

Zurich, Switzerland شامل ہوئے Ocak 2019
466 فالونگ6.6K فالوورز
Paul-Edouard Sarlin ری ٹویٹ کیا
Satyam Kumar
Satyam Kumar@_satyam_ai·
Meta recently open-sourced Boxer, a model that lifts 2D bounding boxes into 3D oriented bounding boxes (OBBs) for scene understanding. The catch? It was designed for Aria AR glasses, not regular cameras. So I built a pipeline to make it work with any phone video. The hard part: Boxer expects gravity from Aria's IMU. COLMAP doesn't know "up" from "down." Had to estimate gravity from camera poses and rotate the entire reconstruction. @Meta #ComputerVision #3DReconstruction #MetaAI #SceneUnderstanding
Daniel DeTone@ddetone

Today we release Boxer, a new lightweight approach that lifts open-world 2D bounding boxes to *metric* 3D: facebookresearch.github.io/boxer/ Here we show Boxer in action on an egocentric sequence captured from smart glasses:

English
2
1
12
583
Paul-Edouard Sarlin
Paul-Edouard Sarlin@pesarlin·
There are papers that I love and could spend hours reviewing (like this one, which I unfortunately didn’t get assigned to in peer-review), and others that I would rather not read. I like the bidding system of NeurIPS but it has never worked for me, I always get random papers!
Dmytro Mishkin 🇺🇦@ducha_aiki

github.com/zju3dv/Efficie… This is the depth of conversations between @pesarlin and Yifan Wang, one would dream to see in peer review. I'd dare to say, that is exactly peer review we want to have.

English
0
1
21
5.1K
Paul-Edouard Sarlin ری ٹویٹ کیا
Dmytro Mishkin 🇺🇦
Dmytro Mishkin 🇺🇦@ducha_aiki·
Scaling Image Geo-Localization to Continent Level Philipp Lindenberger @pesarlin Jan Hosang, Matteo Balice @mapo1 Simon Lynen, Eduard Trulls tl;dr: combine ground image with aerial to get cell-prototype. Acc similar to ~1.5Tb ground photo retrieval @ 42Gb arxiv.org/abs/2510.26795
Dmytro Mishkin 🇺🇦 tweet mediaDmytro Mishkin 🇺🇦 tweet mediaDmytro Mishkin 🇺🇦 tweet mediaDmytro Mishkin 🇺🇦 tweet media
English
3
11
47
6.4K
Paul-Edouard Sarlin ری ٹویٹ کیا
Gabriele Berton
Gabriele Berton@gabriberton·
Google and ETH have joined the large scale localization effort with a banger I really did not expect this And now I'm really hoping to make it to NeurIPS where the paper will be presented I'll read it and report a summary here in the next few days
Gabriele Berton tweet media
English
9
28
281
18.5K
Paul-Edouard Sarlin ری ٹویٹ کیا
ZurichAI
ZurichAI@zurichnlp·
ZurichCV#11 will be on November 20th at the @ETH_AI_Center! Paul-Edouard Sarlin (@pesarlin, Google) will talk about 3D Reconstruction for Localization. Nando Metzger (@NandoMetzger, ETH Zurich) will discuss how to create High-Resolution Maps. RSVP below.
English
1
3
28
10.4K
Paul-Edouard Sarlin
Paul-Edouard Sarlin@pesarlin·
Classical systems rule because they’re efficient (45min @ 30 FPS!) and combine multiple sensors (IMU, camera rig, GPS) in a principled manner. How to achieve this with end-to-end ML models remains an open question - here is a benchmark to help answer it, and data for training 😉
English
1
0
1
454
Paul-Edouard Sarlin
Paul-Edouard Sarlin@pesarlin·
This week at #ICCV2025 we’re presenting LaMAria, a large-scale dataset to push the boundaries visual-inertial SLAM for wearable sensors. Learn more about it in our dedicated tutorial TODAY at ICCV, 1pm HST room 325 B More info: lamaria.ethz.ch 🧵⬇️ 1/
English
1
2
22
998