MASR-PSN：低分光度立体图像的高分法向重建深度学习模型

举雅琨; 蹇木伟; 饶源; 张述; 高峰; 董军宇

发布时间： 2023-07-19
摘要点击次数： 640
全文下载次数： 587
DOI: 10.11834/jig.220050
2023 | Volume 28 | Number 7

MASR-PSN：低分光度立体图像的高分法向重建深度学习模型

举雅琨¹, 蹇木伟², 饶源¹, 张述¹, 高峰¹, 董军宇¹(1.中国海洋大学计算机科学与技术学院, 青岛 266100;2.山东财经大学计算机科学与技术学院, 济南 250014)

摘要

目的光度立体算法是一种单视角下的稠密三维重建方法，其利用相同视角下来自不同光照方向的一系列图像恢复像素级的表面法向。拍摄光度立体图像所用的高分辨率线性响应相机的成本十分昂贵且难以获取，很难通过传感器直接获取超高分辨率图像来恢复高分辨率表面法向。因此，提出一种基于深度神经网络的光度立体超分算法，以从低分光度立体图像中恢复出准确的高分表面法向。方法首先，对原始的低分光度立体图像进行归一化预处理操作，以消除剧烈变化的表面反射率影响，并消减过饱和镜面反射的影响。随后，提出多层聚合超分光度立体网络（multi-level aggregation super resolution photometric stereo network，MASR-PSN）。MASR-PSN包含一个新颖的深浅层融合的最大池化聚合框架、权值共享的特征回归器、并行设计的不同尺寸卷积核的并行回归器结构，能够在保留多尺度信息的同时，增强特征表示，防止模式坍塌学习到某一固定尺度相关的非重要特征，以及防止3×3卷积核带来空间域上的过度平滑。结果广泛的消融实验证明了提出的深浅层聚合层和并行权值共享回归器的有效性，能明显减少生成表面法向的平均角度误差（mean angular error，MAE）。本文方法仅需其他方法一半分辨率的光度立体图像，而能准确地恢复出复杂表面的结构。DiLiGenT benchmark数据集的定量实验和Light StageData Gallery数据集、Gourd数据集的定性实验显示，MASR-PSN在预测表面法向精确度方面有明显提升。在DiLiGenT benchmark数据集中，本文方法在仅使用其他方法一半分辨率的光度立体图像的情况下，以96幅图像为输入时，取得7.31°的平均角度误差，比最佳方法提升0.08°，以10幅图像为输入时，取得9.00°的平均角度误差，比最佳方法提升0.43°。结论提出的MASR-PSN方法提升了光度立体任务表面法向重建的准确性，在低分辨率的输入图像下，依然可以恢复出细节清晰的超分辨率表面法向。

关键词

三维重建光度立体表面法向恢复深度学习超分辨率

MASR-PSN: a low-resolution photometric stereo images-relevant deep learning model for high-resolution surface normal reconstruction

Ju Yakun¹, Jian Muwei², Rao Yuan¹, Zhang Shu¹, Gao Feng¹, Dong Junyu¹(1.School of Computer Science and Technology, Ocean University of China, Qingdao 266100, China;2.School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan 250014, China)

Abstract

Objective Three-dimensional（3D）reconstruction is currently focused on in computer vision. To optimize the problem of recovering fine details of the surface and dense reconstruction，a fixed scene-related photometric stereo technique can be used in terms of the pixel-wise surface normal under the circumstance of varying shading cues. It can recover per-pixel dense surface normal and improve weak texture-reconstructed objects to a certain extent beyond binocular and multi-view stereo in triangulate sparse 3D points. Photometric stereo can be used in the commonly-used high-precision 3D reconstruction domains like cultural relic reconstruction and industrial defect detection. To solve the complex threedimensional structure and alleviate the blur problem in the normal reconstruction，high-resolution surface normal can provide richer and more effective 3D information. However，due to the high-resolution linear response cameras are high involved，it is still challenged to recover high-resolution surface normal for photometric stereo images. Therefore，it is urgent to develop the high-resolution surface normal reconstruction in terms of low-resolution photometric stereo images analysis. Method We facilitate deep learning based super-resolution photometric stereo algorithm further to recover accurate high-resolution surface normal from low-resolution photometric stereo images. First，a normalized operation is employed to normalize in situ pixels in completed low-resolution photometric stereo images，which can alleviate the effectscontextual of severely changing surface reflectance and oversaturated specular reflection. This pre-processing method can be used to deal with steep color change-related objects for surfaces-homogeneous training. Furthermore，we develop a multi-level aggregation super resolution photometric stereo network（MASR-PSN）and a novel deep and shallow fusion maxpooling aggregation framework is designed. The proposed deep and shallow fusion max-pooling aggregation framework can be used to enhance feature representation and preserve multi-scale information because of receptive fields-derived deep and shallow features；to optimize effective learning features related to a certain fixed scale，a weight-shared feature regressor is developed as well，which can learn and reconstruct the surface normal from the features in multiple scales. The weightshared feature regressor can be paid attention on multiple scale features as the input，and the 4×4 super-resolution features can be output after that，which are fused in the following step；For the regressor，the parallel network structure of different sizes of convolution kernels are designed in parallel to the smooth transition-spatial preservation of 3×3 convolution kernel. But，due to excessive smoothing in the spatial domain，the loss of resolution details and blur is required to be resolved. To preserve the consistent details of super-resolution surface normal，we develop a paralleled network design， which consists of 3×3 convolution layers and 1×1 layers. Additionally，a joint loss function is demonstrated as well， which can optimize the MASR-PSN on the constraints of the normal gradient and normal angle. The normal angle constraint is melted into the average error value of the predicted normal only，but the details of the surface are sacrificed and the blur is generated. Therefore，the normal gradient constraint is introduced to focus on the adjacent pixels-between changes， which can concern of more details and preserve the clear recovered super-resolution surface normal. Result Extensive ablation experiments are carried out and the effectiveness are demonstrated in terms of our proposed deep and shallow aggregation layer and parallel shared-weight regressor，which can reduce the mean angle error（MAE）of the generated surface normal significantly. It is required of input photometric stereo images according to other related resolution-half methods，and a high-resolution normal map-relevant structure of complex surfaces can be reconstructed accurately. The comparative experiments are carried out on the DiLiGenT benchmark dataset quantitatively，as well as on the light stage data gallery dataset and Gourd dataset in qualitative. For the DiLiGenT benchmark dataset （only using half-resolution photometric stereo images compared with other methods），the proposed MASR-PSN can achieve an average angle error of 7. 31 degrees when 96 dense images are added as input，and 0. 12 degrees are improved，and an average angle error of 9. 00 degrees are optimized as well when 10 sparse images are added as input，which is higher of 0. 43 degrees. The robustness and effectiveness of the proposed MASR-PSN are shown based on more qualitative experiments on the light stage data gallery and gourd datasets. Conclusion To predict the super-resolution surface normal and clarify more details in the low-resolution input photometric stereo image，the photometric stereo task-oriented MASR-PSN is potential to improve the accuracy of surface normal reconstruction further.

Keywords

3D reconstruction photometric stereo surface normal recovery deep learning super resolution

在线采编平台

论文出版

年度会议

下载中心

年度信息