Shi, DongboCao, ShenWu, BojianGuo, JinhuiFan, LubinChen, RenjieLiu, LigangYe, JiepingBousseau, AdrienDay, Angela2025-05-092025-05-0920251467-8659https://doi.org/10.1111/cgf.70012https://diglib.eg.org/handle/10.1111/cgf70012In this paper, we introduce NoPe-NeRF++, a novel local-to-global optimization algorithm for training Neural Radiance Fields (NeRF) without requiring pose priors. Existing methods, particularly NoPe-NeRF, which focus solely on the local relationships within images, often struggle to recover accurate camera poses in complex scenarios. To overcome the challenges, our approach begins with a relative pose initialization with explicit feature matching, followed by a local joint optimization to enhance the pose estimation for training a more robust NeRF representation. This method significantly improves the quality of initial poses. Additionally, we introduce global optimization phase that incorporates geometric consistency constraints through bundle adjustment, which integrates feature trajectories to further refine poses and collectively boost the quality of NeRF. Notably, our method is the first work that seamlessly combines the local and global cues with NeRF, and outperforms state-of-the-art methods in both pose estimation accuracy and novel view synthesis. Extensive evaluations on benchmark datasets demonstrate our superior performance and robustness, even in challenging scenes, thus validating our design choices.Attribution-NonCommercial 4.0 InternationalCCS Concepts: Computing methodologies → Image-based renderingComputing methodologies → Imagebased renderingNoPe-NeRF++: Local-to-Global Optimization of NeRF with No Pose Prior10.1111/cgf.7001212 pages