To enable broader exploration of dynamic scenes, our model can generate new video clips of the same scene based on previously generated content and user-provided camera trajectories. This approach maintains dynamic capabilities, accurate camera control, and scene consistency throughout the extended exploration.
Our model enables precise camera control across diverse scenarios while preserving dynamic scene elements, e.g.
Our method can generate videos with strong 3D consistency, which enables high-quality 3D reconstruction using the camera-controlled videos.