Marwan Taher (@marwan_ptr) - Twitter-Profil | Zamantika Mersobahis Locabet

Angehefteter Tweet

Marwan Taher@marwan_ptr·19 Ara

How can we run reconstruction models like π³ and Depth Anything 3 in real-time? We present KV-Tracker, a training-free approach, for real-time tracking of scenes and objects. Achieving up to 30 FPS! With @alzugarayign, @makezur, @XinKong_IC and @AjdDavison

English

10

97

704

64.7K

Marwan Taher@marwan_ptr·26 Şub

KV-Tracker has been accepted to #CVPR2026!

Marwan Taher@marwan_ptr

How can we run reconstruction models like π³ and Depth Anything 3 in real-time? We present KV-Tracker, a training-free approach, for real-time tracking of scenes and objects. Achieving up to 30 FPS! With @alzugarayign, @makezur, @XinKong_IC and @AjdDavison

Nederlands

2

19

225

16.8K

Marwan Taher@marwan_ptr·19 Ara

More results and the paper can be found here: marwan99.github.io/kv_tracker Video: youtu.be/ZVNnvZZxhoI

YouTube

English

0

1

29

3.1K

Marwan Taher@marwan_ptr·19 Ara

KV-Tracker enables object-level reconstruction and tracking when provided with an object mask. The KV-cache can be saved and used later without any special initialisation procedure.

English

2

0

19

2K

Marwan Taher@marwan_ptr·19 Ara

How can we run reconstruction models like π³ and Depth Anything 3 in real-time? We present KV-Tracker, a training-free approach, for real-time tracking of scenes and objects. Achieving up to 30 FPS! With @alzugarayign, @makezur, @XinKong_IC and @AjdDavison

English

10

97

704

64.7K

Marwan Taher@marwan_ptr·18 Ara

Per-frame geometry from π³ is split into primitives via segmentation and tracked over time using dense 2D point tracks. With a compact per primitive pose, geometry is densely aligned, stitching primitives to create a complete reconstruction of the observed scene components.

Kirill Mazur@makezur

Introducing 4D Primitive-Mâché (4DPM), a new method for replayable 4D reconstruction from monocular videos. We split dynamic scenes into 3D primitives and recover their motion. 4DPM can infer object positions even after they leave view. Joint work with @marwan_ptr @AjdDavison

English

0

1

9

536

Marwan Taher@marwan_ptr·18 Ara

ACE-SLAM naturally handles loop closure without special treatment, robustly deals with dynamic objects, while remaining lightweight (small MLP) and computationally efficient—making this representation compelling for SLAM.

Ignacio Alzugaray@alzugarayign

Excited to present ACE-SLAM, the first neural SLAM to use Scene Coordinate Regression as an implicit map representation Efficient (real-time from live stream), compressive (neural maps <1MB) and robust to dynamic scenes With @marwan_ptr and @AjdDavison ialzugaray.github.io/ace-slam

English

0

2

11

964

Marwan Taher retweetet

Xin Kong@XinKong_IC·10 Eyl

🚀 Excited to share CausNVS: Autoregressive Multi-view Diffusion for Flexible 3D Novel View Synthesis! Let’s recon 3D world generatively. CausNVS handles any number of input views, synthesizes novel views autoregressively, enables interactive streaming and flexible N-to-M NVS.

English

2

18

106

10.2K

Marwan Taher retweetet

Riku Murai@rmurai0610·16 Ara

Introducing MASt3R-SLAM, the first real-time monocular dense SLAM with MASt3R as a foundation. Easy to use like DUSt3R/MASt3R, from an uncalibrated RGB video it recovers accurate, globally consistent poses & a dense map. With @eric_dexheimer*, @AjdDavison (*Equal Contribution)

English

42

255

1.4K

203.2K

Marwan Taher@marwan_ptr·21 Haz

@_ayoungk @rmurai0610

QAM

0

1

433

Ayoung@_ayoungk·21 Haz

3D Gaussian Splatting SLAM demo!

English

9

35

392

63.5K

Marwan Taher@marwan_ptr·21 Haz

@_Sir_MuEl @_ayoungk rmurai.co.uk/projects/Gauss…

QME

2

0

4

101

Samuel@_Sir_MuEl·21 Haz

@_ayoungk What's the name of this paper please?

English

1

0

1

563

Marwan Taher@marwan_ptr·20 Haz

EscherNet will be presented tomorrow at #CVPR. But *now* you can drop a couple of images into our Hugging Face demo to try it out! huggingface.co/spaces/kxic/Es…

Xin Kong@XinKong_IC

Tired of single image to 3D? Check out EscherNet tomorrow @CVPR that can take flexible number of views for 3D generation! THURSDAY, JUNE 20 ORAL: 9:00-10:30, SUMMIT BALLROOM (TOP FLOOR) POSTER: 10:30-12:00, ARCH 4A-E, #69 Try our @Gradio online demo huggingface.co/spaces/kxic/Es…

English

0

6

420

Marwan Taher@marwan_ptr·14 Haz

Don't miss the real-time demo of SuperPrimitives at #CVPR!!

Kirill Mazur@makezur

SuperPrimitives will be presented at #CVPR next week (Wednesday), along with a 𝗿𝗲𝗮𝗹-𝘁𝗶𝗺𝗲 𝗱𝗲𝗺𝗼 on Friday! Our new representation enables dense monocular 3D reconstruction in real-time. No poses required! Project page: makezur.github.io/SuperPrimitive/

English

0

5

379

Marwan Taher retweetet

Aalok Patwardhan@AalokPat·26 Mar

From RGB images we can estimate camera rotation, *without* knowledge of camera intrinsics. This also leads to some cool downstream applications - it can complement an IMU .. we call it U-ARE-ME! @AjdDavison @DoC_Rhodes94 @BaeGwangbin

Gwangbin Bae@BaeGwangbin

𝗜𝗠𝗨? How about 𝗨-𝗔𝗥𝗘-𝗠𝗘? In this work, we show how monocular surface normal cues can be used for rotation estimation. callum-rhodes.github.io/U-ARE-ME/ collab w/ @AalokPat, Callum Rhodes, @AjdDavison

English

1

6

42

4.4K

Marwan Taher@marwan_ptr·7 Mar

Super impressive reconstruction quality!

Kirill Mazur@makezur

SuperPrimitives got accepted to #CVPR2024! The code will be released soon and see you all in Seattle!

English

0

1

3

261

Marwan Taher@marwan_ptr·21 Şub

This allows highly accurate pose estimation even for challenging small and specular objects, which can enable precise manipulation. @ieee_ras_icra (3/3)

English

0

4

716

Marwan Taher@marwan_ptr·21 Şub

Using *RGB* images, a NeRF is trained via Instant-NGP. A depth map of the object is rendered to obtain an initial coarse position estimate. The object model is then fitted to the reconstructed density field via a multi-hypothesis iterative optimization scheme. (2/3)

English

1

0

10

1K

Marwan Taher@marwan_ptr·21 Şub

Excited to announce Fit-NGP which will be presented in #ICRA2024! Fit-NGP accurately estimates 6-DoF object poses (~ 1.6mm) leveraging Instant-NGP's density field. With @alzugarayign & @AjdDavison. Project page: marwan99.github.io/Fit-NGP Video: youtu.be/KQ7yH_em3Qg (1/3)

YouTube

English

4

34

181

35.3K

Marwan Taher

Entdecken