
In-2-4D: Inbetweening from Two Single-View Images to 4D Generation Contributions: • To the best of our knowledge, In-2-4D is the first method for generative 4D inbetweening over two distant monocular frames spanning arbitrary motions. • Our novel hierarchical approach breaks the complex inbetweening into a series of simpler motion estimations via video, followed by 4D (i.e., dynamic 3DGS) generation. • To generate smooth 3D object and motion transitions, we further optimize the 3D trajectories using a bottom-up merging strategy with smoothing regularization. • We contribute a new 4D interpolation benchmark, I4D-15, on challenging object motions and real-world scenes.



























