反映人的认知习惯的商品检索方法
Commodity retrieval method reflecting people's cognitive habits
- 2019年24卷第4期 页码:573-582
收稿:2018-05-31,
修回:2018-9-28,
纸质出版:2019-04-16
DOI: 10.11834/jig.180348
移动端阅览

浏览全部资源
扫码关注微信
收稿:2018-05-31,
修回:2018-9-28,
纸质出版:2019-04-16
移动端阅览
目的
2
便捷的商品检索是用户网络购物体验良好的关键环节。由于电商对商品描述方式的规范性要求以及用户对商品属性理解差异等问题,基于关键词的检索方法在商品检索的应用并不理想。近年来,以图搜图的检索方式在各大电商平台上得到越来越多的应用,但检索结果往往不尽如人意。为此,提出了一种新的检索思路,从商品外观设计特征出发,将人们对商品的认知模式引入到商品图片的检索过程,从而获得更符合人们预期的检索结果。
方法
2
以时尚女包商品为例,在分析设计师的设计规范的基础上,将外观设计特征分解为形状特征、颜色特征和设计元素特征。利用深度卷积神经网络建模、提取特征,并使用哈希方法和Top3类内检索算法加快检索速度。
结果
2
利用建立的商品数据集构建3个对应的特征模型,并进行分类识别和图像检索实验。结果表明,各个模型Top1的识别准确率均小于95%,而Top3的识别准确率均在98.5%以上;商品检索速度加快了将近3.5倍。实验及用户调查结果表明,本文提出的检索方法与淘宝、百度图片等基于图像的检索工具相比,检索结果更为多样,与原图像相似度更高。
结论
2
本文提出的从商品外观设计规范出发、与人的认知模式相结合的商品检索方法,更能满足用户的检索意图,可用于时尚女包商品检索,对基于图像的其他商品的检索方法的研究具有借鉴意义。
Objective
2
Rapid and convenient product retrieval is the key for excellent user experience in online shopping. The application of keyword-based retrieval in commodity retrieval is ineffective because of problems
such as standardization of the description of goods and the differences in the understanding of the attributes of the goods by the users. In recent years
"search by image" has been increasingly used in e-commerce platforms. Retrieval technology is constantly improving
from text-based image retrieval to content-based image retrieval
and then to utilizing deep learning to achieve image retrieval. However
retrieval results are often unsatisfactory. These methods cannot rapidly and accurately retrieve results that satisfy people's expectations
thereby lacking excellent user experience. Therefore
a new method of commodity retrieval is proposed. From the features of the commodity design
the image feature is obtained using the complete picture information as well as the human cognition of the goods
which is introduced into the retrieval process of the commodity picture to obtain the desired results.
Method
2
Human cognition of commodities is a type of subconsciousness formed by human experience
which corresponds to the designers' norms. We can obtain results that are consistent with human cognitive retrieval results by studying the commodity design specifications and designing commodity features and then using these features for commodity retrieval. We select fashionable women's bags as the research object. Women's bags are a necessity and favorable to women; thus
bags have practical relevance to the study. Moreover
the design elements of women's bags are relatively independent and flexible. Thus
using traditional image retrieval methods is difficult to satisfy user's retrieval intentions. Therefore
studying similar searches of women's bags is necessary. The design features are decomposed into shape
color
and design element features based on the designers' specifications (such as tassel
chain
and zipper). A deep convolution neural network is used to construct classification models for the three features. The features of each picture are then extracted
and three feature sets are established for similarity comparison in retrieval. The shape
color
and design element picture sets are established to construct the feature models that correspond to shape
color
and local design elements
respectively. Each picture set must be marked in advance. The shape picture set is marked by 14 categories
including shell
Boston
and platinum bags. The color picture set is marked by 13 categories
including red
orange
and yellow. The design element picture set is marked by 11 categories
including strip closure
zipper decoration
and diamond grille. Adding a Hashing layer into the deep convolution neural network and extracting Hashing layer data as image features can provide feature binarization and simplify the calculation. At the same time
in the retrieval process
using the proposed Top3 within-class retrieval algorithm can reduce the algorithm complexity. Searching can be according to the classification features
namely
shape
color
and design elements
selected by users in real time. Thus
the retrieval results reflect the users' intention of commodity search. Given a picture of a fashion woman bag image to be retrieved
the corresponding classification model is called after the user selects the classification features. First
the classification of the image under a feature is recognized
and the image feature is then extracted. Subsequently
the Euclidean distance is calculated with all the images in Top3. Finally
the retrieval results are returned in order of similarity.
Result
2
The dataset is currently the only one dedicated to the search of fashionable women's bags. Notably
the design element picture set contains not only the overall picture of bags but also the segmented design element picture. The dataset and feature models are used for classification recognition and image retrieval experiments. Results show that the recognition accuracy of each model of the Top1 algorithm is less than 95%
whereas the recognition accuracy of the Top3 is more than 98.5%. Using Top3 within-class retrieval algorithm can speed up the retrieval and ensure the accuracy of the retrieval results as much as possible. At the same time
the use of Hashing method and Top3 within-class retrieval algorithm results in nearly 3.5 times faster retrieval speed and greatly improves the retrieval efficiency. When multiple features for commodity retrieval are used
the corresponding weights of color
shape
and design elements are 0.6
0.2
and 0.2 respectively. These weights can be defined by the users in real time to reflect the changes of users' attention to different features during the retrieval process.
Conclusion
2
A method of commodity retrieval that is based on the commodity appearance design criterion and is combined with people's cognitive model
is proposed. In comparison with image-based retrieval tools
such as Taobao and Baidu
the retrieval results are more similar to the original image and more in line with people's expectations. At the same time
according to the user's preference
the proposed method can synthetically query according to single and multiple features
and the retrieval results are diversified. In addition
we use the global features of shape and color and the local feature of design elements to conduct a survey of online users' retrieval satisfaction. The survey results show that the user satisfaction of Taobao and Baidu pictures is similar. However
the user satisfaction of women's bag retrieval results obtained by the proposed method is remarkably higher than those of Taobao and Baidu pictures
which is more consistent with human cognition. The proposed method is suitable for the retrieval of fashionable women's bags and can be used for reference in the research of image-based retrieval methods for other goods. At present
for a given bag picture
the design elements are obtained by interactive manual segmentation in the process of similar bag retrieval. In future works
we can study the method of identifying the design elements of women's package to realize the automatic identification and segmentation of design elements
thereby improving the automation of women's package retrieval and the practical value of the proposed method.
Yu J Q, Wu Z B, Wu F, et al. Multimedia technology 2016:advances and trends in image retrieval[J].Journal of Image and Graphics, 2017, 22(11):1467-1485.
于俊清, 吴泽斌, 吴飞, 等.多媒体工程:2016——图像检索研究进展与发展趋势[J].中国图象图形学报, 2017, 22(11):1467-1485. [DOI:10.11834/jig.170503]
Li W, Duan L X, Xu D, et al. Text-based image retrieval using progressive multi-instance learning[C]//Proceedings of 2011 International Conference on Computer Vision. Barcelona, Spain: IEEE, 2011: 2049-2055.[ DOI:10.1109/ICCV.2011.6126478 http://dx.doi.org/10.1109/ICCV.2011.6126478 ]
Baysal S, Kurt M C, Aydogdu G, et al. Content-based image retrieval in store catalogs[C]//Proceedings of the 17th IEEE Signal Processing and Communications Applications Conference. Antalya, Turkey: IEEE, 2009: 808-811.[ DOI:10.1109/SIU.2009.5136519 http://dx.doi.org/10.1109/SIU.2009.5136519 ]
Jain S, Dhar J. Image based search engine using deep learning[C]//Proceedings of the 10th International Conference on Contemporary Computing. Noida, India: IEEE, 2017: 1-7.[ DOI:10.1109/IC3.2017.8284301 http://dx.doi.org/10.1109/IC3.2017.8284301 ]
Juan L, Gwon O. A comparison of SIFT, PCA-SIFT and SURF[J]. International Journal of Image Processing, 2009, 3(4):143-152.
Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks[C]//Proceedings of the 25th International Conference on Neural Information Processing Systems. Lake Tahoe, Nevada: ACM, 2012: 1097-1105.
Hinton G, Deng L, Yu D, et al. Deep neural networks for acoustic modeling in speech recognition:the shared views of four research groups[J]. IEEE Signal Processing Magazine, 2012, 29(6):82-97.[DOI:10.1109/MSP.2012.2205597]
Dong Y, Zhang T, Hou X D, et al. A new steganalysis paradigm based on image retrieval of similar image-inherent statistical properties and outlier detection[C]//Proceedings of 2015 International Conference on Wireless Communications & Signal Processing. Nanjing, China: IEEE, 2015: 1-5.[ DOI:10.1109/WCSP.2015.7340977 http://dx.doi.org/10.1109/WCSP.2015.7340977 ]
Liu Z W, Luo P, Qiu S, et al. DeepFashion: powering robust clothes recognition and retrieval with rich annotations[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016: 1096-1104.[ DOI:10.1109/CVPR.2016.124 http://dx.doi.org/10.1109/CVPR.2016.124 ]
Huang J S, Feris R, Chen Q, et al. Cross-domain image retrieval with a dual attribute-aware ranking network[C]//Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015: 1062-1070.[ DOI:10.1109/ICCV.2015.127 http://dx.doi.org/10.1109/ICCV.2015.127 ]
Kiapour M H, Han X F, Lazebnik S, et al. Where to buy it: matching street clothing photos in online shops[C]//Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015: 3343-3351.[ DOI:10.1109/ICCV.2015.382 http://dx.doi.org/10.1109/ICCV.2015.382 ]
Wright J, Ganesh A, Zhou Z H, et al. Demo: robust face recognition via sparse representation[C]//Proceedings of the 8th IEEE International Conference on Automatic Face & Gesture Recognition. Amsterdam, Netherlands: IEEE,2008: 1-2.[ DOI:10.1109/AFGR.2008.4813404 http://dx.doi.org/10.1109/AFGR.2008.4813404 ]
Li T, Cao W Q, Liu J J, et al. Similar face search based on face cognition[J]. Journal of Image and Graphics, 2016, 21(10):1365-1375.
李彤, 曹卫群, 刘晶晶, 等.基于人脸认知模式的相似脸搜索[J].中国图象图形学报, 2016, 21(10):1365-1375. [DOI:10.11834/jig.20161011]
Wang L X. Luggage Making Technology and Production Management[M]. Beijing:Chemical Industry Publishing House, 2008:176-177.
王立新.箱包制作技术与生产经营管理[M].北京:化学工业出版社, 2008:176-177.
Liu Y, Pan Y, Xia R K, et al. FP-CNNH:a fast image hashing algorithm based on deep convolutional neural network[J]. Computer Science, 2016, 43(9):39-46, 51.
刘冶, 潘炎, 夏榕楷, 等. FP-CNNH:一种基于深度卷积神经网络的快速图像哈希算法[J].计算机科学, 2016, 43(9):39-46, 51. [DOI:10.11896/j.issn.1002-137X.2016.09.007]
Wen Z K, Zhu W Z, Ouyang J, et al. A robust and discriminative image perceptual hash algorithm[C]//Proceedings of the 4th International Conference on Genetic and Evolutionary Computing. Shenzhen, China: IEEE, 2011: 709-712.[ DOI:10.1109/ICGEC.2010.180 http://dx.doi.org/10.1109/ICGEC.2010.180 ]
Shi Z Z. Cognitive Science[M]. Hefei:University of Science&Technology of China Press, 2008:15-18.
史忠植.认知科学[M].合肥:中国科学技术大学出版社, 2008:15-18.
Zhong Y P, Ye M L. Cognitive Psychology[M]. Hefei:Anhui Peoples Publishing House, 2010:1-4.
钟毅平, 叶茂林.认知心理学高级教程[M].合肥:安徽人民出版社, 2010:1-4.
Lin K, Yang H F, Hsiao J H, et al. Deep learning of binary hash codes for fast image retrieval[C]//Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops. Boston, MA, USA: IEEE, 2015: 27-35.[ DOI:10.1109/CVPRW.2015.7301269 http://dx.doi.org/10.1109/CVPRW.2015.7301269 ]
Cheng Y Q. Bag Design and Out of Print[M]. Guangzhou:South China University of Technology Press, 2005:6-8.
程远强.包袋设计与出格[M].广州:华南理工大学出版社, 2005:6-8.
Wang L X. Luggage and Art Design[M]. Beijing:Chemical Industry Press, 2006:56-58.
王立新.箱包艺术设计[M].北京:化学工业出版社, 2006:56-58.
Shi H K. Similarity measurement in image retrieval[J]. Fujian Computer, 2010, 26(3):50-51.
时慧琨.图像检索中的相似性度量方法[J].福建电脑, 2010, 26(3):50-51. [DOI:10.3969/j.issn.1673-2782.2010.03.033]
相关作者
相关机构
京公网安备11010802024621