形状的圆内距离变换
Inside-circle distance transform for shapes
- 2018年23卷第1期 页码:39-51
收稿:2017-07-09,
修回:2017-10-7,
纸质出版:2018-01-16
DOI: 10.11834/jig.170362
移动端阅览

浏览全部资源
扫码关注微信
收稿:2017-07-09,
修回:2017-10-7,
纸质出版:2018-01-16
移动端阅览
目的
2
形状的描述、匹配、相似性判定和检索是计算机视觉和图像识别的基本问题,也是一个开问题。在目前公开的方法中,除了只能应用于简单形状的几何复变换和基于边界的傅里叶描述子外,其他的方法均不能由构建的形状特征描述符重建原形状,因此不能保证所建立的形状特征能客观地描述原形状。本文提出了形状的圆内距离变换,该方法所建立的描述符可用于形状匹配、相似性度量和形状检索。该方法是可逆的,也就是可以从形状描述符重建原形状。
方法
2
形状的圆内距离变换通过在形状的最小外接圆内旋转和切分形状,求出形状相邻切分点之间的距离,并由此构建形状的特征矩阵。对于任意相似的形状,从理论上证明了形状的圆内距离变换具有缩放、旋转和位移不变性。
结果
2
对发生了形变、扭曲和仿射变换的形状,采用圆内距离变换方法进行了形状的相似性度量、检索和重建实验,结果表明,形状的圆内距离变换可以准确地描述形状、度量形状的相似性、检索形状并重建原形状。在形状的相似性度量上,形状的圆内距离变换能给出与人类视觉一致的结果,并且当两个形状相似时,还能计算出它们的尺度缩放和角度旋转。通过与经典的方法,包括形状上下文方法、傅里叶描述子方法、拉东柱状图方法,针对典型的MPEG-7形状库进行对比实验,发现形状的圆内距离变换在形状检索的综合得分上相比这些经典方法提高了近20%。
结论
2
形状的圆内距离变换在形状的描述、相似性判定和检索上是有效和可逆的,具有广泛的可适用性且优于本文比较的其他经典方法。
Objective
2
Description
matching
similarity measurement
and retrieval of shapes are basic tasks in computer vision
image recognition
and machine intelligence
and they remain as open issues. Except for the methods of geometry complex transform and shape contour-based Fourier descriptor
all other methods are not information preserving
which means that the original shapes cannot be reconstructed from their descriptors. Consequently
the descriptors cannot be guaranteed to fully represent the characteristics of original shapes. Although geometry complex transform and shape contour-based Fourier descriptor are information preserving
they are only applicable to simple closed shapes or some other special types of shapes
which limits their applications. We propose a generic shape descriptor
which is named inside-circle distance transform (ICD). The ICD method can be used for matching
similarity measurement
and retrieval of any shape with obvious contours
and it is information preserving.
Method
2
In the ICD method
we initially calculate the minimum circumscribed circle of a shape. Then
we draw a set of equidistant parallel lines that are perpendicular to x-axes
calculate the intersections of each line and shape contour
and compute the distance vector for each line. We finally form a distance matrix with all distance vectors. We yield another distance matrix by rotating the shape anticlockwise around the center of its minimum circumscribed circle by $θ$ degree and repeating the aforementioned process. A set of distance matrices is generated by repeating the process for[360/$θ$] times. With all distance matrices of the shape in hand
we construct the feature matrix of the shape. The feature matrix of a shape is a representation of the original shape with rich information
which can be used to describe the original shape
to measure the similarity of two shapes and reconstruct the original shape. Therefore
ICD is an information-preserving method. This capability of ICD to reconstruct original shapes is useful. We can thoroughly understand the intrinsic of shapes using the ICD method. Feature matrix is a powerful tool for shape representation and shape matching. We prove that ICD is scaling
rotation
and translation invariant
which is an important property in shape description
matching
and retrieval.
Results
2
We construct 40 shapes to verify the capability and test the effectiveness of the ICD method. The shapes are categorized into eight classes
with five shapes in each class. In each of these eight classes
one basic shape exists
and the others are deformations of the basic shape through modifications
such as twisting the contour or adding noise. We further expand the set of shapes by performing affine transformation with random scale factor
random rotation angle
and random translation of position to generate two new shapes for each of the 40 shapes. This expansion results in a total of 120 shapes. We initially perform a similarity measurement between each of the eight basic shapes and each of 12 randomly selected shapes. Experimental result shows that in similarity measurement of shapes
the ICD method generates the same vision results as those of human vision. If two shapes are determined to be similar
then the ICD method can calculate their scaling factor and orientation differences. We also test the effectiveness of retrieval through the well-known method of "Bullseye score." Results show that 38 out of 40 subclasses achieve a score of 100
which is extremely satisfactory. We compare the ICD method with three other classic shape description and matching methods
namely
shape context method
histogram of Radon transform
and generic Fourier descriptor
on the basis of the widely used shape database of MPEG-7. This database consists of 70 classes
with 20 shapes in each class and hence a total of 1400 shapes. The Bullseye score method is adopted. Results indicate that all the methods under evaluation have their own advantages and disadvantages with respect to different shape classes. Moreover
on average
the test score of the ICD method is approximately 20 points higher than those of the other three classic methods. ICD shows significant effects outperforming those of the other methods in all experiments. In the reconstruction experiments
we randomly select four shapes from the MPEG-7 database and reconstruct them by using the ICD method with varying parameters $k$ and $θ$
where $k$=30
50
150 and $θ$=1
6. Experiment results show that the reconstructed shapes become accurate with the increase of parameter $k$ and decrease of parameter $θ$. The reconstruction experiments also imply that shape reconstruction is more sensitive to parameter $k$ than to parameter $θ$. In application scenarios in which the rotational angle is insignificant
$θ$=3 is an optimal recommendation by our experiments.
Conclusion
2
The ICD method and its corresponding feature matrix can be used to represent
match
and retrieve shapes effectively. This method has the prominent feature of being information preserving
thereby assuring that it represents a shape without losing information. When this method is used to compute similarity of shapes
it can generate the same result as that of human vision. For two similar shapes
the ICD method can compute their scale factor and rotation differences. Theoretical analysis
mathematical proofs
and experiments show that ICD is effective
useful
and information preserving
and it outperforms several important classic methods.
Yang M Q, Kpalma K, Ronsin J. A survey of shape feature extraction techniques[M]//Yin P Y. Pattern Recognition, Techniques, Technology and Applications. Vienna, Austria:I-Tech, 2008:43-90.[ DOI:10.5772/6237 http://dx.doi.org/10.5772/6237 ]
Belongie S, Malik J, Puzicha J. Shape matching and object recognition using shape contexts[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(4):509-522.[DOI:10.1109/34.993558]
Ling H B, Jacobs D W. Shape classification using the inner-distance[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(2):286-299.[DOI:10.1109/TPAMI.2007.41]
Iivarinen J, Visa A J E. Shape recognition of irregular objects[C]//Proceedings of the Volume 2904, Intelligent Robots and Computer. Boston, MA, United States:SPIE, 1996:25-32.[ DOI:10.1117/12.256280 http://dx.doi.org/10.1117/12.256280 ]
Wu S G, Wang K, Lu L J, et al. GCT transform and similarity determination of geometry shapes[J]. Journal of Image and Graphics, 2016, 21(12):1671-1684.
吴绍根, 王康, 路利军, 等. GCT变换及几何图形形状相似性判定[J].中国图象图形学报, 2016, 21(12):1671-1684.][DOI:10.11834/jig.20161212
Presles B, Debayle J. A distance-based shape descriptor invariant to similitude and its application to shape classification[C]//Proceedings of the 23rd International Conference on Pattern Recognition. Cancun, México:IEEE, 2016:2598-2603.[ DOI:10.1109/ICPR.2016.7900027 http://dx.doi.org/10.1109/ICPR.2016.7900027 ]
Revollo N V, Delrieux C A, González-José R. Set of bilateral and radial symmetry shape descriptor based on contour information[J]. IET Computer Vision, 2017, 11(3):226-236.[DOI:10.1049/iet-cvi.2015.0413]
Zhang D S, Lu G J. A comparative study of fourier descriptors for shape representation and retrieval[C]//Proceedings of the 5th Asian Conference on Computer Vision. Melbourne, Australia:ACCV, 2002:646-651.
Zhang D S, Lu G J. Shape-based image retrieval using generic Fourier descriptor[J]. Signal Processing:Image Communication, 2002, 17(10):825-848.[DOI:10.1016/S0923-5965(02)00084-X]
Tabbone S, Terrades O R, Barrat S. Histogram of radon transform. A useful descriptor for shape retrieval[C]//Proceedings of the 19th International Conference on Pattern Recognition. Tampa, FL, USA:IEEE, 2008:1-4.[ DOI:10.1109/ICPR.2008.4761555 http://dx.doi.org/10.1109/ICPR.2008.4761555 ]
Hoang T V, Tabbone S. The generalization of the $R$-transform for invariant pattern representation[J]. Pattern Recognition, 2012, 45(6):2145-2163.[DOI:10.1016/j.patcog.2011.11.007]
Hasegawa M, Tabbone S. Amplitude-only log Radon transform for geometric invariant shape descriptor[J]. Pattern Recognition, 2014, 47(2):643-658.[DOI:10.1016/j.patcog.2013.07.024]
Hasegawa M, Tabbone S. Histogram of Radon transform with angle correlation matrix for distortion invariant shape descriptor[J]. Neurocomputing, 2016, 173:24-35.[DOI:10.1016/j.neucom.2015.04.100]
Wang B, Gao Y S. Structure integral transform versus radon transform:a 2D mathematical tool for invariant shape recognition[J]. IEEE Transactions on Image Processing, 2016, 25(12):5635-5648.[DOI:10.1109/TIP.2016.2609816]
Mokhtarian F. Silhouette-based object recognition with occlusion through curvature scale space[C]//Proceedings of the 4th European Conference on Computer Vision. Cambridge, UK:Springer, 1996:566-578.[ DOI:10.1007/BFb0015567 http://dx.doi.org/10.1007/BFb0015567 ]
Hong B W, Soatto S. Shape matching using multiscale integral invariants[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(1):151-160.[DOI:10.1109/TPAMI.2014.2342215]
Yang J Y, Wang H X, Yuan J S, et al. Invariant multi-scale descriptor for shape representation, matching and retrieval[J]. Computer Vision and Image Understanding, 2016, 145:43-58.[DOI:10.1016/j.cviu.2016.01.005]
相关作者
相关机构
京公网安备11010802024621