Adaptive Joint Optimization for 3D Reconstruction with Differentiable Rendering

TVCG 2022

1City University of Hong Kong,
teaser

Abstract

Due to inevitable noises introduced during scanning and quantization, 3D reconstruction via RGB-D sensors suffers from errors both in geometry and texture, leading to artifacts such as camera drifting, mesh distortion, texture ghosting, and blurriness. Given an imperfect reconstructed 3D model, most previous methods have focused on the refinement of either geometry, texture, or camera pose. Or different optimization schemes and objectives for optimizing each component have been used in previous joint optimization methods, forming a complicated system. In this paper, we propose a novel optimization approach based on differentiable rendering, which integrates the optimization of camera pose, geometry, and texture into a unified framework by enforcing consistency between the rendered results and the corresponding RGB-D inputs. Based on the unified framework, we introduce a joint optimization approach to fully exploit the inter-relationships between geometry, texture, and camera pose, and describe an adaptive interleaving strategy to improve optimization stability and efficiency. Using differentiable rendering, an image-level adversarial loss is applied to further improve the 3D model, making it more photorealistic. Experiments on synthetic and real data using quantitative and qualitative evaluation demonstrated the superiority of our approach in recovering both fine-scale geometry and high-fidelity texture.


Pipeline

method

Overview of our adaptive joint optimization method. The inputs include a set of RGB-D frames, estimated camera poses, and an initial imperfect 3D textured model produced by an existing reconstructing method. A differentiable renderer is adopted to produce rendered frames of the inputted camera poses. Then, a series of losses are employed to measure the consistency of the rendered and inputted frames, and the gradients are back-propagated to adaptively update camera pose, geometry, and texture. Finally, the corrected camera poses, refined geometry, and optimized texture are outputted.


Video

BibTeX

@article{zhang2022adaptive,
      title={Adaptive Joint Optimization for 3D Reconstruction with Differentiable Rendering},
      author={Zhang, Jingbo and Wan, Ziyu and Liao, Jing},
      journal={IEEE Transactions on Visualization and Computer Graphics},
      year={2022},
      publisher={IEEE}
    }