
Beyond photos/videos, what's the next medium for sharing visual experiences? 🤔 NeRF/3DGS are great but only capture a static 3D world and ignore ambient scene dynamics. In this paper, we show how we can reconstruct/render these essential scene elements even with a single causal video.








