Hao He 「何 昊」

I am a third-year Ph.D. student in Multimedia Laboratory in the Chinese University of Hong Kong. My supervisors are Prof. Hongsheng Li and Prof. Xiaogang Wang.

Before CUHK, I received my Master's degree from Institute of Automation, Chinese Academy of Sciences in 2022, my supervisor is Prof. Shiming Xiang. I obtained my Bachelor degree from Northwestern Polytechnical University in 2019.

My research interests lie in the area of Unified models and embodied AI.

Email  /  Google Scholar  /  Github

profile photo

Researches

apt2 Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation
Shanchuan Lin, Ceyuan Yang, Hao He, Jianwen Jiang, Yuxi Ren, Xin Xia, Yang Zhao, Xuefeng Xiao, Lu Jiang.
NeurIPS 2025
[Paper]  /  [Project Page]
tar Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Jiaming Han, Hao Chen, Yang Zhao, Hanyu Wang, Qi Zhao, Ziyan Yang, Hao He, Xiangyu Yue, Lu Jiang.
NeurIPS 2025
[Paper]  /  [Code]
uigenie UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
Han Xiao, Guozhi Wang, Yuxiang Chai, Zimu Lu, Weifeng Lin, Hao He, Lue Fan, Liuyang Bian, Rui Hu, Liang Liu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Aojun Zhou, Hongsheng Li.
NeurIPS 2025
[Paper]  /  [Code]
cameractrl2 CameraCtrl II ++: Stable and Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
Hao He, Ceyuan Yang, Meng Wei Yinghao Xu, Jiaming Han, Lu Jiang Hongsheng Li,


cameractrlii CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
Hao He, Ceyuan Yang, Shanchuan Lin Yinghao Xu, Meng Wei, Liangke Gui, Qi Zhao, Gordon Wetzstein, Lu Jiang Hongsheng Li,
ICCV 2025
[Paper]  /  [Project Page]
cameractrl CameraCtrl: Enabling Camera Control for Video Diffusion Models
Hao He, Yinghao Xu, Yuwei Guo, Gordon Wetzstein, Bo Dai Hongsheng Li, Ceyuan Yang,
ICLR 2025
[Paper]  /  [Project Page]  /  [Code]
scaling_law Scaling Laws For Diffusion Transformers
Zhengyang Liang, Hao He, Ceyuan Yang, Bo Dai
ArXiv 2024
[Paper]
cvd Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
Zhengfei Kuang, Shengqu Cai, Hao He, Yinghao Xu, Hongsheng Li, Leonidas Guibas, Gordon Wetzstein
NeurIPS 2024
[Paper]  /  [Project Page]  /  [Code]
pgseg Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation
Fei Zhang, Tianfei Zhou, Boyang Li, Hao He, Chaofan Ma, Tianjiao Zhang, Jiangchao Yao, Ya Zhang, Yanfeng Wang
NeurIPS 2023
[Paper]  /  [Code]
tpr Improving Video Instance Segmentation via Temporal Pyramid Routing
Xiangtai Li *, Hao He *, Yibo Yang, Henghui Ding, Kuiyuan Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao
TPAMI 2022
[Paper]  /  [Code]
bss BoundarySqueeze: Image Segmentation as Boundary Squeezing
Hao He, Xiangtai Li, Yibo Yang, Guangliang Cheng, Shiming Xiang Yunhai Tong, Lubin Weng
IJCV 2022
[Paper]  /  [Code]
eblnet Enhanced Boundary Learning for Glass-like Object Segmentation
Hao He *, Xiangtai Li *, Guangliang Cheng, Jianping Shi, Yunhai Tong, Gaofeng Meng, Vésronique Prinet Lubin Weng
ICCV 2021
[Paper]  /  [Code]
pointflow PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation
Xiangtai Li *, Hao He *, Xia Li, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin
CVPR 2021
[Paper]  /  [Code]

Experiences

bytedance

Apr. 2024 - Present ,

Research Intern, Bytedance.

Mentor: Ceyuan Yang

shanghai_ai

Oct. 2022 - Apr. 2024 ,

Research Intern, Shanghai Artificial Intellience Laboratory.

Mentor: Ceyuan Yang and Weidi Xie

sensetime

Mar. 2020 - Mar. 2021 ,

Research Intern, Sensetime Research.

Mentor: Guangliang Cheng and Jianping Shi

Professional Activities

  • Reviewer for CVPR, ICCV, ICML, NeurIPS, ICLR.


© Hao He | Last updated: October 27h, 2025 | Website Template