自监督深度残差函数映射网络的3维模型对应关系计算
Correspondence calculation of 3D models by a self-supervised deep residual functional maps network
- 2020年25卷第12期 页码:2603-2613
收稿:2019-11-09,
修回:2020-2-19,
录用:2020-2-26,
纸质出版:2020-12-16
DOI: 10.11834/jig.190568
移动端阅览

浏览全部资源
扫码关注微信
收稿:2019-11-09,
修回:2020-2-19,
录用:2020-2-26,
纸质出版:2020-12-16
移动端阅览
目的
2
针对传统非刚性3维模型的对应关系计算方法需要模型间真实对应关系监督的缺点,提出一种自监督深度残差函数映射网络(self-supervised deep residual functional maps network,SSDRFMN)。
方法
2
首先将局部坐标系与直方图结合以计算3维模型的特征描述符,即方向直方图签名(signature of histograms of orientations,SHOT)描述符;其次将源模型与目标模型的SHOT描述符输入SSDRFMN,利用深度函数映射(deep functional maps,DFM)层计算两个模型间的函数映射矩阵,并通过模糊对应层将函数映射关系转换为点到点的对应关系;最后利用自监督损失函数计算模型间的测地距离误差,对计算出的对应关系进行评估。
结果
2
实验结果表明,在MPI-FAUST数据集上,本文算法相比于有监督的深度函数映射(supervised deep functional maps,SDFM)算法,人体模型对应关系的测地误差减小了1.45;相比于频谱上采样(spectral upsampling,SU)算法减小了1.67。在TOSCA数据集上,本文算法相比于SDFM算法,狗、猫和狼等模型的对应关系的测地误差分别减小了3.13、0.98和1.89;相比于SU算法分别减小了2.81、2.22和1.11,并有效克服了已有深度函数映射方法需要模型间的真实对应关系来监督的缺点,使得该方法可以适用于不同的数据集,可扩展性大幅增强。
结论
2
本文通过自监督深度残差函数映射网络训练模型的方向直方图签名描述符,提升了模型对应关系的准确率。本文方法可以适应于不同的数据集,相比传统方法,普适性较好。
Objective
2
Calculating 3D shapes correspondence is a central problem in the field of geometry processing that plays an important role in shapes reconstruction
object recognition and classification
and other tasks. Therefore
finding a meaningful and accurate correspondence among shapes has important research significance and application value. In recent years
deep-learning-based calculation of the correspondence among 3D shapes has attracted the attention of many scholars in the field of geometry processing. Using neural networks to learn the feature descriptors on the surfaces of 3D shapes can help us obtain accurate and comprehensive feature information and provides a solid foundation for building accurate correspondences among 3D shapes. Deep-learning-based methods for calculating the correspondences can be roughly divided into 1) methods based on the end-to-end depth functional maps network
and 2) methods based on other neural networks. The first method uses deep functional map networks to learn the feature descriptors of 3D shapes and then use these descriptors to analyze the spatial structure characteristics of shapes. Afterward
theory of functional maps is applied to solve the shape matching problem
find out the functional maps matrix among shapes
and determine the correspondence among shapes. Meanwhile
the second method uses neural networks to learn the feature descriptors of shapes
takes the calculation of correspondences as part of the learning process
uses the learned shape spatial geometric structure features for shape matching
and obtains the ideal correspondence among shapes. The feature descriptors and the network of learning descriptors of shapes play a crucial role in calculating 3D shapes correspondence calculation methods ignore the important influence of feature descriptors on the representation of 3D shapes
and the calculated shape descriptors contain relatively few information that cannot solve the problems related to shape symmetry and shape boundary descriptor distortion. Moreover
in the subsequent correspondence calculation process
these methods are unable to generate an accurate functional map of the symmetrical part of shapes
thereby leading to inaccurate correspondence calculations. The existing 3D shape correspondence calculation methods based on deep learning all adopt a supervision mechanism
which limits the universality of these methods to a large extent. To address these problems
this paper proposes a self-supervised deep residual functional maps network (SSDRFMN) to calculate 3D shapes correspondence.
Method
2
The proposed method involves two steps. First
we calculate the feature descriptor of the 3D shape by combining the local coordinate system with a histogram
which is a signature of the histograms of orientations (SHOT) descriptor. We initially establish a local coordinate on the surface of shapes and then enhance the recognition ability of our descriptor by introducing the geometric information of the feature points. Afterward
we calculate the local histogram at a given point and use the calculated geometric information to form a histogram and a signature. Compared with traditional feature descriptors
hybrid feature descriptors can better represent the spatial structure and surface feature information of 3D shapes and provide high-quality inputs for network learning. Second
we use end-to-end SSDRFMN to calculate the correspondence among shapes. The SHOT descriptors of the source and target shapes are inputted into SSDRFMN
and the feature descriptor iteratively trains the neural network. The deep functional maps (DFM) layer is then used to calculate the function mapping matrix between two shapes. The corresponding relationship problem is transformed into solving the function mapping matrix problem
and the functional map relationship is converted into a point-to-point correspondence relationship through a fuzzy correspondence layer. The self-supervised loss function is then used to calculate the geodesic distance error between shapes and to evaluate the corresponding relationship. The loss function minimizes the geodesic distance error between shapes via network training and replaces the real correspondence between the manually labeled models with geometric constraints to achieve a self-supervised learning of the network.
Result
2
Experimental results show that compared with the supervised deep functional maps (SDFM) and spectral upsampling (SU) algorithms
the proposed algorithm reduces the geodesic error of correspondences between the human model and the MPI-FAUST dataset by 1.45% and 1.67%
respectively. Meanwhile
in the TOSCA dataset
the proposed algorithm reduces the geodesic errors of the correspondences of dog
cat
and wolf models by 3.13% (2.81%)
0.98% (2.22%)
and 1.89% (1.11%) compared with SDFM (SU)
respectively. Therefore
apart from its applicability to different datasets and its high scalability
this algorithm effectively overcomes the shortcomings of extant depth functional maps methods that require the true correspondence among shapes to supervise.
Conclusion
2
Experimental results show that the proposed method outperforms the existing methods for calculating the correspondence among 3D shapes. On the one hand
our proposed method trains the neural network through the SHOT descriptor of shapes
there by effectively solving the symmetry and boundary distortion of the model and producing a better representation of the surface features of 3D shapes. On the other hand
the proposed method uses a self-supervised deep neural network to learn the features of descriptors and to accurately calculate the correspondences among shapes. This method also shows excellent universality and accuracy
thereby highlighting its value in shapes matching
model recognition
segmentation
and retrieval.
Abbasi S and Tajeripour F. 2017. Detection of brain tumor in 3D MRI images using local binary patterns and histogram orientation gradient. Neurocomputing, 219:526-535[DOI:10.1016/j.neucom.2016.09.051]
Arbel N Y, Tal A and Zelnik-Manor L. 2019. Partial correspondence of 3D shapes using properties of the nearest-neighbor field. Computers and Graphics, 82:183-192[DOI:10.1016/j.cag.2019.05.011]
Aubry M, Schlickewei U and Cremers D. 2011. The wave kernel signature: a quantum mechanical approach to shape analysis//Proceedings of 2011 IEEE International Conference on Computer Vision Workshops. Barcelona, Spain: IEEE: 1626-1633[ DOI: 10.1109/ICCVW.2011.6130444 http://dx.doi.org/10.1109/ICCVW.2011.6130444 ]
Furuya T and Ohbuchi R. 2015. Diffusion-on-manifold aggregation of local features for shape-based 3D model retrieval//Proceedings of the 5th ACM on International Conference on Multimedia Retrieval. Shanghai, China: ACM: 171-178[ DOI: 10.1145/2671188.2749380 http://dx.doi.org/10.1145/2671188.2749380 ]
Groueix T, Fisher M, Kim V G, Russell B C and Aubry M. 2018. 3D-coded: 3D correspondences by deep deformation//Proceedings of the 15th European Conference on Computer Vision. Munich, Germany: Springer: 230-246[ DOI: 10.1007/978-3-030-01216-8_15 http://dx.doi.org/10.1007/978-3-030-01216-8_15 ]
Halimi O, Litany O, Rodolà E, Bronstein A and Kimmel R. 2019. Self-supervised learning of dense shape correspondence[EB/OL].[2019-07-25] . https://arxiv.org/pdf/1812.02415.pdf https://arxiv.org/pdf/1812.02415.pdf
Kim V G, Lipman Y and Funkhouser T. 2011. Blended intrinsic maps. ACM Transactions on Graphics, 30(4):#79[DOI:10.1145/2010324.1964974]
Litany O, Remez T, RodolàE, Bronstein A and Bronstein M. 2017. Deep functional maps: structured prediction for dense shape correspondence//Proceedings of 2017 IEEE International Conference on Computer Vision. Venice, Italy: IEEE: 5659-5667[ DOI: 10.1109/ICCV.2017.603 http://dx.doi.org/10.1109/ICCV.2017.603 ]
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C Y and Berg A C. 2016. SSD: single shot multibox detector//Proceedings of the 14th European Conference on Computer Version. Amsterdam, the Netherlands: Springer: 21-37[ DOI: 10.1007/978-3-319-46448-0_2 http://dx.doi.org/10.1007/978-3-319-46448-0_2 ]
Mahmoudi M and Sapiro G. 2009. Three-dimensional point cloud recognition via distributions of geometric distances. Graphical Models, 71(1):22-31[DOI:10.1016/j.gmod.2008.10.002]
Melzi S, Ren J, Rodolà E, Sharma A, Wonka P and Ovsjanikov M. 2019. ZoomOut:spectral upsampling for efficient shape correspondence. ACM Transactions on Graphics, 38(6):#155[DOI:10.1145/3355089.3356524]
Ovsjanikov M, Ben-Chen M, Solomon J, Butscher A and Guibas L. 2012. Functional maps:a flexible representation of maps between shapes. ACM Transactions on Graphics, 31(4):#30[DOI:10.1145/2185520.2185526]
Pottmann H, Wallner J, Huang Q X and Yang Y L. 2009. Integral invariants for robust geometry processing. Computer Aided Gemetric Design, 26(1):37-60
Rodolà E, Cosmo L, Bronstein M M, Torsello A and Cremers D. 2017. Partial functional correspondence. Computer Graphics Forum, 36(1):222-236[DOI:10.1111/cgf.12797]
Salti S, Tombari F and Di Stefano L. 2014. SHOT:unique signatures of histograms for surface and texture description. Computer Vision and Image Understanding, 125:251-264[DOI:10.1016/j.cviu.2014.04.011]
Su H, Maji S, Kalogerakis E and Learned-Miller E. 2015. Multi-view convolutional neural networks for 3D shape recognition//Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE: 945-953[ DOI: 10.1109/ICCV.2015.114 http://dx.doi.org/10.1109/ICCV.2015.114 ]
Sun J, Ovsjanikov M and Guibas L. 2009. A concise and provably informative multi-scale signature based on heat diffusion. Computer Graphics Forum, 28(5):1383-1392[DOI:10.1111/j.1467-8659.2009.01515.x]
Tombari F, Salti S and Di Stefano L. 2010. Unique signatures of histograms for local surface description//Proceedings of the 11th European Conference on Computer Vision. Heraklion, Greece: Springer: 356-369[ DOI: 10.1007/978-3-642-15558-1_26 http://dx.doi.org/10.1007/978-3-642-15558-1_26 ]
Wang Q, Teng Z, Xing J L, Gao J, Hu W M and Maybank S. 2018. Learning attentions: residual attentional siamese network for high performance online visual tracking//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE: 4854-4863[ DOI: 10.1109/CVPR.2018.00510 http://dx.doi.org/10.1109/CVPR.2018.00510 ]
Yang J and Shi J D. 2018. Coarse-to-fine calculation for 3D isometric shape correspondence. Journal of Chongqing University of Posts and Telecommunications (Natural Science Edition), 30(6):803-811
杨军, 史纪东. 2018.由粗到精的三维等距模型对应关系计算.重庆邮电大学学报(自然科学版), 30(6):803-811[DOI:10.3979/j.issn.1673-825X.2018.06.011]
Yang J and Yan H. 2018. An algorithm for calculating shape correspondences using functional maps by calibrating base matrix of 3D Shapes. Geomatics and Information Science of Wuhan University, 43(10):1518-1525
杨军, 闫寒. 2018.校准三维模型基矩阵的函数映射的对应关系计算.武汉大学学报(信息科学版), 43(10):1518-1525[DOI:10.13203/j.whugis20160493]
Yang J, Li L J, Tian Z H and Wang X P. 2014. Research on shape correspondence of 3D isometric models differing by non-rigid deformations. Journal of Frontiers of Computer Science and Technology, 8(8):1009-1016
杨军, 李龙杰, 田振华, 王小鹏. 2014.非刚性变换的三维等距模型的对应关系研究.计算机科学与探索, 8(8):1009-1016[DOI:10.3778/j.issn.1673-9418.1405013]
Yang J, Yan H and Wang M Z. 2016. Calculation of correspondences between three-dimensional isometric shapes with the use of a fused feature descriptor. Journal of Image and Graphics, 21(5):628-635
杨军, 闫寒, 王茂正. 2016.融合特征描述符约束的3维等距模型对应关系计算.中国图象图形学报, 21(5):628-635[DOI:10.11834/jig.20160510]
Zheng Y Z. 2012. Microsoft Kinect sensor and its effect. IEEE Multimedia, 19(2):4-10[DOI:10.1109/MMUL.2012.24]
相关文章
相关作者
相关机构
京公网安备11010802024621