Paul-Edouard Sarlin

409 posts

Paul-Edouard Sarlin

@pesarlin

Researcher at @Google, 3D computer vision & machine learning. Previously PhD at ETH Zurich, intern at @Google, @Meta, @Microsoft, @magicleap.

Zurich, Switzerland شامل ہوئے Ocak 2019

466 فالونگ6.6K فالوورز

پن کیا گیا ٹویٹ

Paul-Edouard Sarlin@pesarlin·24 Eyl

Introducing GeoCalib for camera calibration: gravity & intrinsics from a single image 📸 ➡️Differentiable geometry optimization FTW! ➡️Video youtu.be/uOmTwvKreM4 ➡️Demo veichta-geocalib.hf.space ➡️Paper arxiv.org/pdf/2409.06704 by @veichta with @PhilippCSE @mapo1 for #ECCV2024 1/

YouTube

GIF

English

239

30.7K

Paul-Edouard Sarlin ری ٹویٹ کیا

Chris Sweeney@the_shweenz·2d

If only there were a benchmark for egocentric SLAM and VIO.... lamaria.ethz.ch

FPV Labs@fpv_labs

We are publishing our first deep dive on what we believe is one of the most challenging layers in egocentric data - SLAM and VIO in the context of long-horizon state tracking. We break down how SLAM and VIO fail in egocentric settings - visual features vanish at close range, depth sensors saturate, fast head motion blurs frames, and these failures don't always occur in isolation. They hit at the exact same moment, leading to compounding errors and making the downstream data unusable. We believe the foundation for high-quality egocentric data demands sub-centimeter precision over long episodes ranging from a few minutes to up to an hour.

English

11.8K

Paul-Edouard Sarlin@pesarlin·1d

@ddetone @_satyam_ai Awesome! Congrats on the release of Boxer, neat!

English

Daniel DeTone@ddetone·3d

@_satyam_ai @pesarlin GeoCalib looking solid 💪

English

Satyam Kumar@_satyam_ai·3d

Meta recently open-sourced Boxer, a model that lifts 2D bounding boxes into 3D oriented bounding boxes (OBBs) for scene understanding. The catch? It was designed for Aria AR glasses, not regular cameras. So I built a pipeline to make it work with any phone video. The hard part: Boxer expects gravity from Aria's IMU. COLMAP doesn't know "up" from "down." Had to estimate gravity from camera poses and rotate the entire reconstruction. @Meta #ComputerVision #3DReconstruction #MetaAI #SceneUnderstanding

Daniel DeTone@ddetone

Today we release Boxer, a new lightweight approach that lifts open-world 2D bounding boxes to *metric* 3D: facebookresearch.github.io/boxer/ Here we show Boxer in action on an egocentric sequence captured from smart glasses:

English

583

Paul-Edouard Sarlin ری ٹویٹ کیا

Dmytro Mishkin 🇺🇦@ducha_aiki·9 Şub

RaCo: Ranking and Covariance for Practical Learned Keypoints Abhiram Shenoi @PhilippCSE @pesarlin @mapo1 tl;dr: ALIKED arch + DaD RL training with full 360deg rotaug with detector+covariance heads, separate ranker (sorter) model . No IMC eval #3DV2026 openreview.net/forum?id=BWtdg…

English

7.5K

Paul-Edouard Sarlin@pesarlin·14 Oca

There are papers that I love and could spend hours reviewing (like this one, which I unfortunately didn’t get assigned to in peer-review), and others that I would rather not read. I like the bidding system of NeurIPS but it has never worked for me, I always get random papers!

Dmytro Mishkin 🇺🇦@ducha_aiki

github.com/zju3dv/Efficie… This is the depth of conversations between @pesarlin and Yifan Wang, one would dream to see in peer review. I'd dare to say, that is exactly peer review we want to have.

English

5.1K

Paul-Edouard Sarlin ری ٹویٹ کیا

Philipp Lindenberger@PhilippCSE·3 Ara

At #NeurIPS2025, we're presenting our work on Scaling Image Geo-Localization to Continent Level. Website: scaling-geoloc.github.io Paper: arxiv.org/pdf/2510.26795… If you are at the conference, say 👋: 📍 Poster #4812, Dec 3 (Wed), 4:30–7:30 PM PST

English

1.8K

Paul-Edouard Sarlin ری ٹویٹ کیا

Dmytro Mishkin 🇺🇦@ducha_aiki·3 Kas

Scaling Image Geo-Localization to Continent Level Philipp Lindenberger @pesarlin Jan Hosang, Matteo Balice @mapo1 Simon Lynen, Eduard Trulls tl;dr: combine ground image with aerial to get cell-prototype. Acc similar to ~1.5Tb ground photo retrieval @ 42Gb arxiv.org/abs/2510.26795

English

6.4K

Paul-Edouard Sarlin ری ٹویٹ کیا

Gabriele Berton@gabriberton·2 Kas

Google and ETH have joined the large scale localization effort with a banger I really did not expect this And now I'm really hoping to make it to NeurIPS where the paper will be presented I'll read it and report a summary here in the next few days

English

281

18.5K

Paul-Edouard Sarlin ری ٹویٹ کیا

ZurichAI@zurichnlp·31 Eki

ZurichCV#11 will be on November 20th at the @ETH_AI_Center! Paul-Edouard Sarlin (@pesarlin, Google) will talk about 3D Reconstruction for Localization. Nando Metzger (@NandoMetzger, ETH Zurich) will discuss how to create High-Resolution Maps. RSVP below.

English

10.4K

Paul-Edouard Sarlin ری ٹویٹ کیا

CVG@cvg_ethz·19 Eki

#ICCV2025 is starting today 🌺 you can find us at: 🥽 LaMAria Tutorial: 325B, 1:00PM ➡️lamaria.ethz.ch/tutorial ☀️ OpenSUN3D Workshop: 306 B, 1:30PM ➡️opensun3d.github.io @pesarlin @efedele16 @FrancisEngelman @AlexDelitzas @mapo1 @ICCVConference

English

2.1K

Paul-Edouard Sarlin@pesarlin·19 Eki

Kudos to @nushakrishnan , @B1ueber2y @monge_maurizio @rapideRobot @jajuengel @mapo1 & collaborators at @RealityLabs & @ETH_en @cvg_ethz for the amazing work! Check it out and let us know! 7/7

English

452

Paul-Edouard Sarlin@pesarlin·19 Eki

Classical systems rule because they’re efficient (45min @ 30 FPS!) and combine multiple sensors (IMU, camera rig, GPS) in a principled manner. How to achieve this with end-to-end ML models remains an open question - here is a benchmark to help answer it, and data for training 😉

English

454

Paul-Edouard Sarlin@pesarlin·19 Eki

This week at #ICCV2025 we’re presenting LaMAria, a large-scale dataset to push the boundaries visual-inertial SLAM for wearable sensors. Learn more about it in our dedicated tutorial TODAY at ICCV, 1pm HST room 325 B More info: lamaria.ethz.ch 🧵⬇️ 1/

English

998

Paul-Edouard Sarlin ری ٹویٹ کیا

Andrew Davison@AjdDavison·16 Eki

A reminder that accurate motion estimation sparse visual SLAM has been in the domain of industry for many years now, and what you might often see in academic papers as the "state of the art" is fairly meaningless. (From @pesarlin.bsky.social)

Chris Offner@chrisoffner3d

Industry SLAM systems are far ahead of academic open source systems.

English

169

19.9K

Paul-Edouard Sarlin@pesarlin·15 Eki

@chrisoffner3d We might need more water given the size of the fire in 2025 😅

English

935

Paul-Edouard Sarlin ری ٹویٹ کیا