TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels

👥 Authors and Affiliations

1 The Hong Kong University of Science and Technology       2 University of Science and Technology of China

3 The Chinese University of Hong Kong       4 The University of Hong Kong       5 Xiamen University

6 Macau University of Science and Technology


📝 Abstract

TrackingWorld is a novel approach for dense, world-centric 3D tracking from monocular videos. Our method estimates accurate camera poses and disentangles 3D trajectories of both static and dynamic components — not limited to a single foreground object. It supports dense tracking of nearly all pixels, enabling robust 3D scene understanding from monocular inputs.

🌍 World-Centric Pose

Estimates accurate camera poses for consistent 3D world coordinate system anchoring.

🔄 Disentangled Trajectories

Separates 3D motion for static background and dynamic foreground components.

👀 Dense Pixel Coverage

Supports tracking of nearly all pixels, moving beyond sparse keypoints.


🧩 Pipeline and Methodology

Overview

TrackingWorld Pipeline

Figure: Overview of TrackingWorld Framework.


🎥 Teaser and Main Results

TrackingWorld provides dense, world-centric 3D trajectories for almost every visible pixel, enabling complete 4D scene reconstruction with high accuracy and robustness.

TrackingWorld 3D Output Comparison

Figure: TrackingWorld 3D reconstruction and 2D/3D tracking demonstration.


💻 Interactive 3D Demos (Click to select scene)


📄 Citation

If you find TrackingWorld useful for your research or applications, please consider citing our paper:

@inproceedings{
    lu2025trackingworld,
    title={TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels},
    author={Jiahao Lu and Weitao Xiong and Jiacheng Deng and Peng Li and Tianyu Huang and Zhiyang Dou and Cheng Lin and Sai-Kit Yeung and Yuan Liu},
    booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
    year={2025},
    url={https://openreview.net/forum?id=vDV912fa3t}
}