Portrait of Hao Li

Hao Li (Leo Li)

3D Vision · Embodied AI · Multi-Modal Models

Research Scientist, Ropedia, Inc.
Visiting Student, MMLab@NTU
Advised by Prof. Ziwei Liu

Biography

I am a Research Scientist at Ropedia, Inc., and concurrently a visiting student at MMLab, Nanyang Technological University (NTU), advised by Prof. Ziwei Liu. I am also a Ph.D. student (2022–now) in the School of Automation at Northwestern Polytechnical University (NPU), supervised by Prof. Dingwen Zhang and Prof. Junwei Han (IEEE Fellow).

My research interests lie in 3D Vision, Embodied AI, and Multi-Modal Models.

Before that, I was a Research Intern at the LongCat Group, Meituan, Inc. (北斗人才计划), advised by Dr. Manyuan Zhang. I also worked as a research intern at StepFun Inc., led by Xuanyang Zhang and Dr. Gang Yu. From 2023 to 2024, I worked as a research intern at the Robotics Team, ByteDance AI Lab, under the mentorship of Minghan Qin. I also interned with the VIS at Baidu Inc., guided by Dr. Chenming Wu and Jingdong Wang (IEEE Fellow). In 2022 to 2023, I was a research intern at Zhejiang Lab, leading with Prof. Lechao Cheng.

NPU NTU Meituan StepFun ByteDance Baidu Zhejiang Lab

News

2026/06S-Agent released! 🎉
2026/06Spatial-TTT got accepted by ECCV 2026! 🎉
2026/05Holi-Spatial got accepted by ICML 2026 as Oral! 🎉
2026/03OmniVGGT got accepted by CVPR 2026 as Highlight! 🎉
2026/01IGGT and From Spatial to Actions got accepted by ICLR 2026! 🎉
2026/01Joining Ropedia, Inc. as Research Scientist, advised by Prof. Ziwei Liu!
2025/08Joining LongCat Group, Meituan, Inc. as Research Intern (北斗人才计划), advised by Dr. Manyuan Zhang!
2025/06STRIDER got accepted by NeurIPS 2025! 🎉
2025/06LangScene-X and CityGS-X got accepted by ICCV 2025! 🎉
2025/05Step1X-3D released by StepFun! 🎉
2025/04Visiting student at NTU, supervised by Prof. Ziwei Liu!
2025/03VDG got accepted by IEEE RA-L! 🎉
2025/02DGTR got accepted by ICRA 2025! 🎉
2025/01Joining AIGC Group, StepFun Inc., led by Xuanyang Zhang and Dr. Gang Yu!
2024/10XLD got accepted by 3DV 2025! 🎉
2024/08Joining ByteDance AI Lab, led by Minghan Qin!
2024/08Invited talk on GAMES Webinar.
2024/07GGRt got accepted by ECCV 2024!
2024/02GP-NeRF got accepted by CVPR 2024 and selected as Highlight (top 3.8%)! 🎉
2024/02LTGC got accepted by CVPR 2024 and selected as Oral (top 0.8%)! 🎉
2024/01Invited talk on 3D视觉工坊.
2023/12Joining VIS, Baidu, Inc. as Research Intern, led by Dr. Chenming Wu and Jingdong Wang (IEEE Fellow)!
2023/12Joining Zhejiang Lab as Research Intern, led by Prof. Lechao Cheng!
2023/11ASDT got accepted by TIP 2024! 🎉
2023/11Saliency Prompt got accepted by CVPR 2023! 🎉

Publications

Bold = me  ·  † equal contribution  ·  # corresponding author  ·  Project Lead  ·  Oral / Highlight selected for presentation

arXiv 2026 Project Lead

S-Agent: Spatial Tool Use Elicits Reasoning for Spatial Intelligence

Yalun Dai†, Hao Li†, Shulin Tian, Runmao Yao, Fangzhou Hong, Zhaoxi Chen, Leonardo Guibas, Ziwei Liu

ICML 2026Oral Project Lead

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Yuanyuan Gao†, Hao Li†, Yifei Liu, Xinhao Ji, Yuning Gong, Yiyi Liao, Fangfu Liu, Manyuan Zhang, Yi Yang, Dan Xu

OmniVGGT
CVPR 2026Highlight Project Lead

OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer

Haosong Peng†, Hao Li†, Yalun Dai, Yushi Lan, Yihang Luo, Tianyu Qi, Yufeng Zhan, Junfei Zhang, Wenchao Xu, Ziwei Liu

ICLR 2026

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Hao Li, Zhengyu Zou, Fangfu Liu, Xuanyang Zhang, Fangzhou Hong, Yukang Cao, Yushi Lan, Manyuan Zhang, Gang Yu, Dingwen Zhang, Ziwei Liu

From Spatial to Actions
ICLR 2026 Project Lead

From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors

Zhengshen Zhang†, Hao Li†, Yalun Dai, Zhengbang Zhu, Lei Zhou, Chenchen Liu, Dong Wang, Francis E. H. Tay, Sijin Chen, Ziwei Liu, Yuxiao Liu, Xinghang Li, Pan Zhou

Spatial-TTT
ECCV 2026

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Fangfu Liu, Di Wu, Jiawei Chi, Yue Cai, Yu-Hsiang Hung, Xiaofeng Yu, Hao Li, Hao Hu, Yongming Rao, Yueqi Duan

IEEE T-PAMI 2026

LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding

Hao Li, Minghan Qin#, Zhengyu Zou, Diqi He, Bohan Li, Bingquan Dai, Dingwen Zhang#, Junwei Han

NeurIPS 2026 · In Preparation

EgoTools: Benchmarking Physical Logic and Tool Affordances in Egocentric Videos

S-Lab Team

Step1X-3D
Tech Report 2025

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Weiyu Li, Xuanyang Zhang, Zheng Sun, Di Qi, Hao Li, Weiwei Cheng, Wanggui Cai, Shun Wu, Jie Liu, Ziwei Wang, Gang Yu

arXiv 2025

ExGS: Extreme 3D Gaussian Compression with Diffusion Priors

Jiaqi Chen, Xinhao Ji, Yuanyuan Gao, Hao Li, Yuning Gong, Yifei Liu, Zhihang Zhong, Dingwen Zhang, Dan Xu, Xiao Sun

STRIDER
NeurIPS 2025

STRIDER: Navigation via Instruction-Aligned Structural Decision Space Optimization

Diqi He, Xuehao Gao, Hao Li, Junwei Han, Dingwen Zhang

LangScene-X
ICCV 2025

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Fangfu Liu†, Hao Li†, Jiawei Chi, Hanyang Wang, Minghui Yang, Fudong Wang, Yueqi Duan

CityGS-X
ICCV 2025

CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction

Yuanyuan Gao†, Hao Li†, Jiaqi Chen, Zhihang Zhong, Zhengyu Zou, Dingwen Zhang, Xiao Sun, Junwei Han

DGTR
ICRA 2025

DGTR: Distributed Gaussian Turbo-Reconstruction for Sparse-View Vast Scenes

Hao Li, Yuanyuan Gao, Haosong Peng, Chenming Wu, Weicai Ye, Yufeng Zhan, Chen Zhao, Dingwen Zhang, Jingdong Wang, Junwei Han

XLD
3DV 2025

XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis

Hao Li, Chenming Wu, Chen Zhao, Haocheng Feng, Errui Ding, Dingwen Zhang#, Jingdong Wang

CoSurfGS
IJCV 2024

CoSurfGS: Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction

Yuanyuan Gao†, Yalun Dai†, Hao Li†, Weicai Ye, Jiaqi Chen, Dingwen Zhang, Tong He, Guofeng Zhang, Junwei Han

CVPR 2024Highlight

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding

Hao Li, Dingwen Zhang, Yalun Dai, Nian Liu, Lechao Cheng, Jingfeng Li, Jingdong Wang, Junwei Han

LTGC
CVPR 2024Oral

LTGC: Long-Tail Recognition via Leveraging LLMs-driven Generated Content

Qihao Zhao†, Yalun Dai†, Hao Li†, Wei Hu, Fan Zhang, Jun Liu

ECCV 2024

GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time

Hao Li, Yuanyuan Gao, Chenming Wu, Dingwen Zhang, Yalun Dai, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han

IEEE RA-L 2024

VDG: Vision-Only Dynamic Gaussian for Driving Simulation

Hao Li, Jingfeng Li, Dingwen Zhang, Chenming Wu, Jieqi Shi, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han

ASDT
IEEE TIP 2024

Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching

Dingwen Zhang, Hao Li, Wenyuan Zeng, Chaowei Fang, Lechao Cheng, Ming-Ming Cheng, Junwei Han

IEEE T-PAMI 2024

Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation

Dingwen Zhang, Hao Li, Diqi He, Nian Liu, Lechao Cheng, Jingdong Wang, Junwei Han

arXiv 2024

V2A-GS: End to End Reconstruction of Articulated Objects from Video Sequences

Hao Li, Zhengyu Zou, Wenke Xia, Fangcheng Zhong, Cengiz Oztireli, Dingwen Zhang, Junwei Han

Saliency Prompt
CVPR 2023

Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt

Hao Li, Dingwen Zhang, Nian Liu, Lechao Cheng, Yalun Dai, Xinggang Wang, Junwei Han

Talks

Honors and Awards