Adam W. Harley

272 posts

Adam W. Harley

@AdamWHarley

Research Scientist at Meta. I did a postdoc at Stanford and PhD at CMU. I work on computer vision and machine learning.

Redmond, WA Katılım Ağustos 2020

128 Takip Edilen2.3K Takipçiler

Sabitlenmiş Tweet

Adam W. Harley@AdamWHarley·1 Tem

We made a @gradio demo for AllTracker! AllTracker is the current state-of-the-art for general-purpose point tracking. The demo gives a good sense of the accuracy---try your own videos and see for yourself! 🔗 Demo: huggingface.co/spaces/aharley… 💻 Code: github.com/aharley/alltra…

English

366

41.1K

Adam W. Harley retweetledi

Xindi Wu@cindy_x_wu·20 Oca

New #NVIDIA Paper We introduce Motive, a motion-centric, gradient-based data attribution method that traces which training videos help or hurt video generation. By isolating temporal dynamics from static appearance, Motive identifies which training videos shape motion in video generation. 🔗 research.nvidia.com/labs/sil/proje… 1/10

English

112

541

72.8K

Adam W. Harley retweetledi

Yawar Siddiqui@yawarnihal·19 Oca

Introducing ShapeR, a method for robust conditional 3D shape generation from casually captured sequences. ShapeR leverages a rectified flow transformer conditioned on per-object multimodal data to turn casual image sequences into full metric scene reconstructions. Project Page: facebookresearch.github.io/ShapeR Paper: arxiv.org/abs/2601.11514 Links to code and huggingface below ⬇️

English

144

61.2K

Adam W. Harley retweetledi

Dima Damen@dimadamen·31 Eki

📢 New Paper PointSt3R: Point Tracking through 3D Grounded Correspondence Can point tracking be re-formulated as pairwise frame correspondence solely? We fine-tuning MASt3R with dynamic correspondences and a visibility loss and achieve competitive point tracking results 1/3

English

124

10.4K

Adam W. Harley retweetledi

mattie ✨@mattierialgirl·27 Eki

Generative Point Tracking with Flow Matching My latest project with @AdamWHarley @CSProfKGD @DerekRenderling @chrisjpal Project page: mtesfaldet.net/genpt_projpage/ Paper: arxiv.org/abs/2510.20951 Code: github.com/tesfaldet/genpt

English

135

10.4K

Adam W. Harley retweetledi

Kosta Derpanis (sabbatical in Munich 🇩🇪)@CSProfKGD·24 Eki

Happy 80th birthday to the legend Takeo Kanade! 🐐 Thanks @katerinafrag for sharing!

Kosta Derpanis (sabbatical in Munich 🇩🇪) tweet media

English

123

13.7K

Adam W. Harley@AdamWHarley·27 Tem

@CSProfKGD @2prime_PKU Must be a different Adam!

English

781

Kosta Derpanis (sabbatical in Munich 🇩🇪)@CSProfKGD·26 Tem

@2prime_PKU Are you a co-author @AdamWHarley? 😂

English

Yiping Lu@2prime_PKU·25 Tem

Anyone knows adam?

English

265

445

4.8K

633.5K

Adam W. Harley@AdamWHarley·12 Tem

Yup! But I wouldn't say "single shot" -- AllTracker, inheriting from CoTracker and PIPs and RAFT, is an iterative approach. In practice, we iterate 4x to get the final answer (and this sums to 11 FPS at 576x1024).

Wildminder@wildmindai

AllTracker. Hi-res, dense point tracking across long video ranges in a single shot. Tracks every pixel. Fast, lightweight (16M params). Sparse tracking is obsolete. alltracker.github.io

English

972

Adam W. Harley@AdamWHarley·5 Tem

Alternate sampling rates.

English

713

Adam W. Harley@AdamWHarley·5 Tem

The visualizer in the AllTracker repo is now GPU-based (much faster), with new options to improve clarity. Very satisfying to see 3D shape "pop out" from the tracking. github.com/aharley/alltra…

English

327

20.4K

Adam W. Harley@AdamWHarley·4 Tem

@cam_sentinel @_akhaliq @Gradio Real-time version of this should be doable very soon. On-device is possible, but depends on the device of course. The model shown here is only 16M parameters (66mb), but needs a GPU.

English

Cameron Sentinel@cam_sentinel·4 Tem

@AdamWHarley @_akhaliq @Gradio Impressive advance-how do you see point tracking optimizing for real-time, on-device inference, especially where cloud connectivity is intermittent or data locality is critical?

English

Adam W. Harley@AdamWHarley·1 Tem

English

366

41.1K

Adam W. Harley@AdamWHarley·3 Tem

@georgtrof @Gradio Well, it's great failure case for me to stare at, so thanks for sharing! I think point tracking should eventually be the backbone of pose estimation.

English

George Trofimov 🚀@georgtrof·2 Tem

@AdamWHarley @Gradio I think it looks better now. There are still minor issues but it's more a pose estimation problem than tracking.

English

125

Adam W. Harley@AdamWHarley·2 Tem

@georgtrof @Gradio I was thinking static ROI yes (e.g., just crop the video with ffmpeg). And yes, blur and self-occlusion make it harder for the model.

English

104

George Trofimov 🚀@georgtrof·2 Tem

@AdamWHarley @Gradio The camera comes with a fixed focus so cannot move too close. Maybe too many issues here - blurry, dynamic plus self occlusions. Or maybe I shouldn't be tracking the object and just use a static roi?

English

Adam W. Harley@AdamWHarley·2 Tem

@mkocab_ @andrew_n_carr @andrew_n_carr I bet you are right that you can classify real/fake from here, but @mkocab_ is right that the visually apparent "cracking" artifact in the background is pretty common overall. This comes from subpixel errors, and maybe subpixel ambiguity in general.

English

Muhammed Kocabas@mkocab_·2 Tem

@andrew_n_carr Those carving artifacts don't only happen with generated videos. They occur in real videos too. Here are results from real videos showing the same patterns. This is a failure of the point tracker. Assuming you used AllTracker, @AdamWHarley could probably speak to this better.

English

236

Andrew Carr 🤸@andrew_n_carr·2 Tem

synthetically generated videos have fewer and fewer visible artifacts. however, they still have substantial noise. one interesting way to detect this noise is to use dense pixel tracking. here are some veo 3 videos with each pixel tracked over time! look at that pattern

English

3.6K

Adam W. Harley@AdamWHarley·2 Tem

@georgtrof @Gradio Not too bad! A cheap way to improve results here might be to just move the camera closer, or crop and resize the region of interest.

English

371

George Trofimov 🚀@georgtrof·2 Tem

@AdamWHarley @Gradio Nice work, thanks. The model is slightly confused in my case. But it still looks great on deformable surfaces.

English

555

Adam W. Harley@AdamWHarley·29 Haz

Tricky sample for AllTracker. It has never seen water at training time, so I'm glad it knows to quickly discard the tracks there... The bird that travels right-to-left (behind the wings most of the time) could be handled better.

高橋かずひと@パワポLT職人@KzhtTkhs

alltracker味見②👀

English

9.9K

Adam W. Harley retweetledi

Kwang Moo Yi@kwangmoo_yi·20 Haz

Preprint of (not) today: Harley et al., "AllTracker: Efficient Dense Point Tracking at High Resolution" -- alltracker.github.io Efficient architecture to track all points/pixels in real-time. Matches multiple frames in real time at high resolution via RNN-based refinement.

English

166

9.8K

Adam W. Harley@AdamWHarley·26 Haz

Happy to report that AllTracker was accepted to #ICCV2025! The twists and turns and methodical experimentation here took at least 12 months in all. Super hard project, though in retrospect our solution is pretty simple. code: github.com/aharley/alltra… paper: arxiv.org/abs/2506.07310

Adam W. Harley@AdamWHarley

AllTracker: Efficient Dense Point Tracking at High Resolution If you're using any point tracker in any project, this is likely a drop-in upgrade—improving speed, accuracy, and density, all at once.

English

102

9.6K

Adam W. Harley@AdamWHarley·25 Haz

AllTracker is open source and the paper is on arXiv: 🌐 Project page: alltracker.github.io 💻 Code: github.com/aharley/alltra… 📄 Paper: arxiv.org/abs/2506.07310

English

721

Adam W. Harley@AdamWHarley·25 Haz

AllTracker works by estimating long-range optical flow between a query frame and every other frame in a video, incorporating temporal context in a sliding window. Along with flow, it estimates visibility and confidence, so unreliable tracks can be easily filtered out.

English

Adam W. Harley@AdamWHarley·25 Haz

AllTracker: Efficient Dense Point Tracking at High Resolution If you're using any point tracker in any project, this is likely a drop-in upgrade—improving speed, accuracy, and density, all at once.

English

237

29.8K

Keşfet

@CSProfKGD @DerekRenderling @chrisjpal @katerinafrag @2prime_PKU @cam_sentinel @_akhaliq @Gradio