复合域的显著性目标检测方法
Saliency object detection method based on complex domains
- 2018年23卷第6期 页码:846-856
收稿:2017-09-08,
修回:2018-1-2,
纸质出版:2018-06-16
DOI: 10.11834/jig.170488
移动端阅览

浏览全部资源
扫码关注微信
收稿:2017-09-08,
修回:2018-1-2,
纸质出版:2018-06-16
移动端阅览
目的
2
针对显著性目标检测方法生成显著图时存在背景杂乱、检测区域不准确的问题,提出基于复合域的显著性目标检测方法。
方法
2
首先,在空间域用多尺度视网膜增强算法对原图像进行初步处理;然后,在初步处理过的图像上建立无向图并提取节点特征,重构超复数傅里叶变换到频域上得到平滑振幅谱、相位谱和欧拉谱,通过多尺度高斯核的平滑,得到背景抑制图;同时,利用小波变换在小波域上的具有多层级特性对图像提取多特征,并计算出多特征的显著性图;最后,利用提出的自适应阈值选择法将背景抑制图与多特征的显著性图进行融合,选择得到最终的显著图。
结果
2
对标准测试数据集MSRA10K和THUR15K中的图像进行显著性目标检测实验,同目前较流行的6种显著性目标检测方法对比,结果表明上述问题通过本文方法得到了很好地解决,即使在背景复杂的情况下,本文算法的准确率、召回率均高于对比算法,在MSRA10K数据集中,平均绝对误差(MAE)值为0.106,在THUR15K数据集中,平均绝对误差(MAE)值降低至0.068,平均结构性指标S-measure值为0.844 9。
结论
2
基于复合域的显著性目标检测方法,融合多个域的优势,在抑制杂乱的背景的同时提高了准确率,适用于自然景物、生物、建筑以及交通工具等显著性目标图像的检测。
Objective
2
Saliency object detection with development of human visual attention mechanism has been widely studied by computer vision researchers. Visual significance is an important mechanism of human visual system. It simulates the human visual attention mechanism
extracts the most interesting areas of the scene quickly and accurately
and ignores redundant information. Saliency object detection has been widely used in image compression
segmentation
redirection
video coding
target detection
recognition
and many other tasks. Although numerous significant target detection methods are available
problems remain. For example
the detection results look well when the background is simple
but when the background is complex
the results may have some uncertainty as regards the environment
cluttered background in the area around the target
or influence of selection on the significant target detection method. The problem of cluttered background and inaccurate detection area often occurs when the salient object detection method generates significant graphs. To solve these problems
saliency object detection method is proposed based on complex domain. The complex domain combines frequency
spatial
and wavelet domains; takes advantage of the complex domain to combine the advantages on three domains; and suppresses the background to obtain an accurate and clear salient target area.
Method
2
Environmental conditions are one of the key factors that influence saliency object detection; for example
weak light or foggy days can cause unclear images and lead to poor results of significant target detection. Multi-scale retinex is an image enhancement algorithm based on color theory. By introducing multi-scale retinex algorithm
the image restoration is realized by linear weighting in the process of dynamically scaling a picture. First
multi-scale retinex enhancement algorithm is used to preliminarily process the original image in spatial domain and exclude environmental impacts. After image processing
the brightness becomes more appropriate to the real scene brightness
and the foreground and background contrast is also significantly improved. In addition to the environmental impact
the background areas of the non-significant target often occupy most of the image space in the saliency object detection images. These background areas increase the error detection problem and reduce the accuracy rate. Experiments found that most background areas are the sky
trees
grasslands
and buildings
which are beyond the scope of this study. The characteristics of the background areas with repeatability can be suppressed by hyper-complex Fourier transform. Then
undirected graph is established and node features on the images are extracted preliminarily. The hyper-complex Fourier transform in the frequency domain is reconstructed to acquire the smoothing amplitude spectrum
phase spectrum
and Euler spectrum. Then
background suppression graphs are obtained through the smoothness of multi-scale Gaussian kernel. At the same time
the multi-level feature of wavelet transform in the wavelet domain is utilized to extract multiple features in terms of images
and the saliency graph of multiple features is calculated. The saliency graph effectively preserves the details of the image because of the unique localization characteristics of the wavelet domain. Finally
the proposed adaptive threshold selection method is used to fuse the background suppression diagram with the saliency graph of multiple features and the final saliency graph is selected and obtained. The final saliency figure suppresses the background while preserving the details of the image.
Result
2
To make the experimental effect persuasive
saliency object detection experiments in the standard test dataset images MSRA10K and THUR15k are conducted. MSRA10K datasets consist of 10 000 images of hand-annotated and accurate to pixel-level salient target annotations
including images of natural scenery
biology
architecture
and transportation. THUR15K datasets consist of 15 000 web images with five keywords
namely
butterflies
airplanes
giraffes
cups
and dogs
representing significant targets with pixel precision as the former datasets. The two datasets are public standard image databases and are widely used in salient target detection and image segmentation. A total of 300 background-complex pictures are selected from each dataset
under the same experimental conditions
and compared with six popular significant target detection methods. Results show that the problems presented by our method had a good solution. Even in a complex environment
the accuracy and recall rate of the algorithm are higher than those of state-of-the-art contrast algorithms. In MSRA10k datasets
the mean absolute error (MAE) value is 0.106; in THUR15K datasets
the mean absolute error value was reduced to 0.068
and the average structure (s) measure value was 0.844 9. The result of the MAE evaluation reflects the advantage of a saliency object detection method based on complex domain in terms of overall performance
and the s-measure indicates that the detected target is highly similar to the structure of the target of the ground truth graph.
Conclusion
2
Saliency object detection is a promising preprocessing operation in image processing and analysis. In this study
a new saliency object detection method based on complex domain is proposed. Multi-scale retinex algorithm in spatial domain can be used for pretreatment of images; it enhances contrast and prevents images from being affected by environmental factors. Hyper-complex Fourier transform in the frequency domain can suppress complex repetitive background regions
and the significant target detection method in the wavelet domain can completely describe the details of the target. Moreover
the proposed algorithm integrates the advantages of multiple domains and improves the accuracy while suppressing background clutter. Thus
the proposed algorithm is suitable for detecting significant target images
such as natural scenery
biology
architecture
and transportation. To improve the algorithm speed
our next research project aims to reduce the complexity of the algorithm by using the influence of wavelet transform function on time complexity.
Du H. Visual saliency detection research based on multiple domains and features[D]. Changchun: Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, 2016. http://cdmd.cnki.com.cn/Article/CDMD-80139-1017013963.htm .
杜慧. 基于视觉显著性的空频域多特征的目标检测方法研究[D]. 长春: 中国科学院长春光学精密机械与物理研究所, 2016.
Qian K, Li F, Wen L M, et al. Color and space distance based salient region detection using fixed threshold segmentation[J]. Computer Science, 2016, 43(1):103-106, 144.
钱堃, 李芳, 文益民, 等.基于颜色和空间距离的显著性区域固定阈值分割算法[J].计算机科学, 2016, 43(1):103-106, 144. [DOI:10.11896/j.issn.1002-137x.2016.01.024]
Xu W, Tang Z M. Exploiting hierarchical prior estimation for salient object detection[J]. Acta Automatica Sinica, 2015, 41(4):799-812.
徐威, 唐振民.利用层次先验估计的显著性目标检测[J].自动化学报, 2015, 41(4):799-812. [DOI:10.16383/j.aas.2015.c140281]
Liang Y, Yu J, Lang C Y, et al. Research on salient region extraction technology[J]. Computer Science, 2016, 43(3):27-32.
梁晔, 于剑, 郎丛妍, 等.显著区域检测技术研究[J].计算机科学, 2016, 43(3):27-32. [DOI:10.11896/j.issn.1002-137X.2016.03.005]
Lee G, Tai Y W, Kim J. ELD-Net:An efficient deep learning architecture for accurate saliency detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017. [DOI:10.1109/TPAMI.2017.2737631]
Li Y Y, Xu Y L, Ma S P, et al. Saliency detection based on deep convolutional neural network[J]. Journal of Image and Graphics, 2016, 21(1):53-59.
李岳云, 许悦雷, 马时平, 等.深度卷积神经网络的显著性检测[J].中国图象图形学报, 2016, 21(1):53-59. [DOI:10.11834/jig.20160107]
Du Y L, Li J Z, Zhang Y, et al. Saliency detection based on deep cross CNN and non-interaction GrabCut[J]. Computer Engineering and Applications, 2017, 53(3):32-40.
杜玉龙, 李建增, 张岩, 等.基于深度交叉CNN和免交互GrabCut的显著性检测[J].计算机工程与应用, 2017, 53(3):32-40. [DOI:10.3778/j.issn.1002-8331.1607-0134]
Yao Z J, Tan T Z. Saliency detection combining background and foreground prior[J]. Journal of Image and Graphics, 2017, 22(10):1381-1391.
姚钊健, 谭台哲.结合背景和前景先验的显著性检测[J].中国图象图形学报, 2017, 22(10):1381-1391. [DOI:10.11834/jig.170114]
Itti L, Koch C, Niebur E. A model of saliency-based visual attention for rapid scene analysis[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998, 20(11):1254-1259. [DOI:10.1109/34.730558]
Harel J, Koch C, Perona P. Graph-based visual saliency[C]//Proceedings of 19th International Conference on Neural Information Processing Systems. Cambridge, USA: MIT Press, 2006: 545-552.
Hou X D, Zhang L Q. Saliency detection: a spectral residual approach[C]//Proceedings of 2007 IEEE Conference on Computer Vision and Pattern Recognition. Minneapolis, USA: IEEE, 2007: 1-8. [ DOI:10.1109/CVPR.2007.383267 http://dx.doi.org/10.1109/CVPR.2007.383267 ]
Achanta R, Hemami S, Estrada F, et al. Frequency-tuned salient region detection[C]//Proceedings of 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, USA: IEEE, 2009: 1597-1604. [ DOI:10.1109/CVPR.2009.5206596 http://dx.doi.org/10.1109/CVPR.2009.5206596 ]
Cheng M M, Zhang G X, Mitra N J, et al. Global contrast based salient region detection[C]//Proceedings of 2011 IEEE Conference on Computer Vision and Pattern Recognition. Colorado Springs, USA: IEEE, 2011: 409-416. [ DOI:10.1109/CVPR.2011.5995344 http://dx.doi.org/10.1109/CVPR.2011.5995344 ]
LI J, Levine M D, AN X J, et al. Visual saliency based on scale-space analysis in the frequency domain[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(4):996-1010. [DOI:10.1109/TPAMI.2012.147]
Imamoglu N, Lin W, Fang Y. A saliency detection model using low-level features based on wavelet transform[J]. IEEE Transactions on Multimedia, 2013, 15(1):96-105. [DOI:10.1109/TMM.2012.2225034]
Peng H W, Li B, Ling H B, et al.Salient object detection via structured matrix decomposition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):818-832. [DOI:10.1109/TPAMI.2016.2562626]
Petro A B, Sbert C, Morel J M. Multiscale retinex[J]. Image Processing on Line, 2014, 4:71-88. [DOI:10.5201/ipol.2014.107]
Guo Y C, Feng Y H, Yan G, et al. Image saliency detection in wavelet domain based on the contrast sensitivity function[J]. Journal on Communications, 2015, 36(10):47-55.
郭迎春, 冯艳红, 阎刚, 等.基于对比敏感度的小波域图像显著性检测[J].通信学报, 2015, 36(10):47-55. [DOI:10.11959/j.issn.1000-436x.2015262]
Chen Z X, He C, Liu C Y. Image saliency target detection based on global features and local features[J]. Control and Decision, 2016, 31(10):1899-1902.
陈振学, 贺超, 刘成云.基于局部特征与全局特征的图像显著性目标检测[J].控制与决策, 2016, 31(10):1899-1902. [DOI:10.13195/j.kzyjc.2015.1017]
Fan D P, Cheng M M, Liu Y, et al. Structure-measure: A new way to evaluate foreground maps[C]//Proceedings of 2017 IEEE International Conference on Computer Vision. Venice, Italy: IEEE, 2017: 4558-4567. [ DOI:10.1109/ICCV.2017.487 http://dx.doi.org/10.1109/ICCV.2017.487 ]
相关作者
相关机构
京公网安备11010802024621