CameraCtrl II: Dynamic Scene Exploration via Camera-controlled
Video Diffusion Models

Hao He1,2 Ceyuan Yang2,✝ Shanchuan Lin2 Yinghao Xu3 Meng Wei4 Liangke Gui2 Qi Zhao2 Gordon Wetzstein3 Lu Jiang2 Hongsheng Li1
1The Chinese University of Hong Kong 2ByteDance Seed 3Stanford University 4ByteDance
Corresponding Author
Research Paper
Dynamic Scenes Exploration

To enable broader exploration of dynamic scenes, our model can generate new video clips of the same scene based on previously generated content and user-provided camera trajectories. This approach maintains dynamic capabilities, accurate camera control, and scene consistency throughout the extended exploration.

Indoor exploration

Hotel Lobby Exploration

Outdoor exploration

Walking in Park
Exploration in different scenarios

Our model enables precise camera control across diverse scenarios while preserving dynamic scene elements, e.g.

City walking

European Road
Mediterranean Market Street

19th Century Foggy London

Foggy London Railway Station
Street View before Dawn

Fantasy Hiking Adventure

Mountain trail with flowing stream
Cliffside path with waterfall rainbow

Minecraft-Style Gaming Environment

Voxel village with windmills sunset
Voxel village with people, sheep

Realistic Indoor Settings

Old cozy library
Abandoned hospital
3D reconstruction of generated videos

Our method can generate videos with strong 3D consistency, which enables high-quality 3D reconstruction using the camera-controlled videos.

Generated Videos
Reconstruction Point Clouds