
Ahan
41 posts














In-2-4D: Inbetweening from Two Single-View Images to 4D Generation Contributions: • To the best of our knowledge, In-2-4D is the first method for generative 4D inbetweening over two distant monocular frames spanning arbitrary motions. • Our novel hierarchical approach breaks the complex inbetweening into a series of simpler motion estimations via video, followed by 4D (i.e., dynamic 3DGS) generation. • To generate smooth 3D object and motion transitions, we further optimize the 3D trajectories using a bottom-up merging strategy with smoothing regularization. • We contribute a new 4D interpolation benchmark, I4D-15, on challenging object motions and real-world scenes.


1/n 🚨New preprint! Our work “Coordinate In and Value Out: Training Flow Transformers in Ambient Space” arxiv.org/abs/2412.03791 presents a domain-agnostic and end2end flow-matching generative model that effectively handles various modalities like images and point clouds.






