基于自步学习的鲁棒多样性多视角聚类
Robust and diverse multi-view clustering based on self-paced learning
- 2019年24卷第8期 页码:1338-1348
收稿:2018-12-18,
修回:2019-3-7,
纸质出版:2019-08-16
DOI: 10.11834/jig.180669
移动端阅览

浏览全部资源
扫码关注微信
收稿:2018-12-18,
修回:2019-3-7,
纸质出版:2019-08-16
移动端阅览
目的
2
大数据环境下的多视角聚类是一个非常有价值且极具挑战性的问题。现有的适合大规模多视角数据聚类的方法虽然在一定程度上能够克服由于目标函数非凸性导致的局部最小值,但是缺乏对异常点鲁棒性的考虑,且在样本选择过程中忽略了视角多样性。针对以上问题,提出一种基于自步学习的鲁棒多样性多视角聚类模型(RD-MSPL)。
方法
2
1)通过在目标函数中引入结构稀疏范数
$$\mathrm{L}_{2
1}$$
来建模异常点;2)通过在自步正则项中对样本权值矩阵施加反结构稀疏约束来增加在多个视角下所选择样本的多样性。
结果
2
在Extended Yale B、Notting-Hill、COIL-20和Scene15公开数据集上的实验结果表明:1)在4个数据集上,所提出的RD-MSPL均优于现有的2个最相关多视角聚类方法。与鲁棒多视角聚类方法(RMKMC)相比,聚类准确率分别提升4.9%,4.8%,3.3%和1.3%;与MSPL相比,准确率分别提升7.9%,4.2%,7.1%和6.5%。2)通过自对比实验,证实了所提模型考虑鲁棒性和样本多样性的有效性;3)与单视角以及多个视角简单拼接的实验对比表明,RD-MSPL能够更有效地探索视角之间关联关系。
结论
2
本文提出一种基于自步学习的鲁棒多样性多视角聚类模型,并针对该模型设计了一种高效求解算法。所提方法能够有效克服异常点对聚类性能的影响,在聚类过程中逐步加入不同视角下的多样性样本,在避免局部最小值的同时,能更好地获取不同视角的互补信息。实验结果表明,本文方法优于现有的相关方法。
Objective
2
In real-world applications
datasets naturally comprise multiple views. For instance
in computer vision
images can be described by different features
such as color
edge
and texture; a web page can be described by the words appearing on the web page itself and the hyperlinks pointing to them; and a person can be recognized by their face
fingerprint
iris
and signature. Clustering aims to explore meaningful patterns in an unsupervised manner. In the era of big data
with the rapid increase of multi-view data
obtaining better clustering performance than any single view by using complementary information from different views is a valuable and challenging task. Popular multi-view clustering methods can be roughly divided into two categories:spectral clustering based and nonnegative matrix factorization (NMF) based. Multi-view spectral clustering methods can have superior performance in nonlinear separate data partitioning. However
the high computational complexity due to the feature decomposition of Laplacian matrix limits their applications in large-scale data clustering. Conversely
the classical
$$K$$
-means clustering method
which has been proven to be equivalent to NMF
is often used in the big data environment because of its low computational complexity and convenient parallelization. Several studies have extended single-view
$$K$$
-means to a multi-view setting. To a certain extent
multi-view self-paced learning (MSPL) can overcome bad local minima due to non-convex objective functions. However
two drawbacks need to be solved. First
MSPL lacks robustness for the data outliers. Second
MSPL considers only the criterion that samples should be added to the clustering process from easy to more complex sequences while ignoring the diversity in the sample selection process. To solve the above two problems
we propose a robust and diverse multi-view clustering model based on self-paced learning (RD-MSPL).
Method
2
The robust
$$K$$
-means clustering method is needed to achieve a more stable clustering performance with respect to a fixed initialization. To address this problem
we introduce a structural sparsity norm (
$$\mathrm{L}_{2
1}$$
-norm) into the objective function to replace the
$$\mathrm{L}_{2}$$
-norm. The
$$\mathrm{L}_{2
1}$$
-norm-based clustering objective enforces the
$$\mathrm{L}_{1}$$
-norm along the data point direction of data matrix and
$$\mathrm{L}_{2}$$
-norm along the feature direction. Thus
the effect of outlier data points in clustering is reduced by the
$$\mathrm{L}_{1}$$
-norm. In addition
ideal self-paced learning should utilize not only easy but also diverse examples that are sufficiently dissimilar from what has already been learned. To achieve this goal
we apply the negative
$$\mathrm{L}_{2
1}$$
-norm constraints to the sample weight matrix in the self-paced regularization. As discussed above
the
$$\mathrm{L}_{2
1}$$
-norm leads to group-wise sparse representation (i.e.
nonzero entries tend to be concentrated in a small number of groups). By contrast
the negative
$$\mathrm{L}_{2
1}$$
-norm should have a countereffect to groupwise sparsity (i.e.
nonzero entries tend to be scattered across a large number of groups). The anti-structure sparse constraint is expected to realize the diversity of samples selected from multiple views. The difficulty of solving the proposed objective comes from the
$$\mathrm{L}_{2
1}$$
-norm non-smoothness. In this study
we propose an effective algorithm to handle this problem.
Result
2
We perform experiments on four public datasets
namely
extended Yale B
Notting-Hill
COIL-20
and Scene15. The clustering performance is measured using six popular metrics:normalized mutual information (NMI)
accuracy (ACC)
adjusted rank index (AR)
F-score
precision
and recall. Higher metrics correspond to improved performance. Those metrics favor different properties in the clustering such that a comprehensive evaluation can be achieved. In all datasets
the reported final results on those metrics are measured by the average and standard derivation of 20 runs. We highlight the best values in bold in each table. First
we compare our proposal with robust multi-view
$$K$$
-means clustering (RMKMC) and MSPL
which are the most relevant multi-view clustering methods. The experimental results indicate that the proposed RD-MSPL is superior to these two methods in almost all metrics except for the recall metric on the Notting-Hill dataset. Then
we experimentally prove the importance of two key components in the proposed model (i.e.
model robustness and sample diversity). Finally
we compare the proposed RD-MSPL with single view and concatenated multiple views. Its superior performance confirms that RD-MSPL can better capture complementary information and explore the relationship among multiple views. In the proposed model
two self-paced learning parameters influence the clustering performance. These two parameters control the pace at which the model learns new and diverse examples separately
and they usually increase iteratively during optimization. In this study
we conduct further parameter sensitivity analysis to better understand the characteristics of our RD-MSPL model. The experimental results show that although these two parameters play an important role in performance
most results are still better than the single-view baseline.
Conclusion
2
In this paper
a new model called RD-MSPL is proposed to perform large-scale multi-view data clustering. The proposed model can effectively overcome the effect of outliers. In the clustering process
with the gradual addition of diverse samples from different views
our proposed method can better obtain complementary information from different views while avoiding the local minima. We conduct a series of comparative analyses with several existing methods on multiple datasets. The experimental results show that the proposed model is superior to the existing related multi-view clustering methods. Future research will focus on 1) expanding the applicability of the method to a wider range of data with kernel trick because the proposed method is based on the assumption that all the features are on linear manifolds and 2) the importance of the adaptive learning approach for the self-paced learning parameter in such unsupervised setting.
Xie Y, Tao D C, Zhang W S, et al. On unifying multi-view self-representation for clustering by tensor multi-rank minimization[J]. International Journal of Computer Vision, 2018, 126(11):1157-1179.[DOI:10.1007/s11263-018-1086-2]
Chao G Q, Sun S L, Bi J B. A survey on multi-view clustering[J]. arXiv: 1712.06246, 2017.
Zhao Y W, Zhang E H, Lu J W, et al. Gait recognition via multiple features and views information fusion[J]. Journal of Image and Graphics, 2009, 14(3):388-393.
赵永伟, 张二虎, 鲁继文, 等.多特征和多视角信息融合的步态识别[J].中国图象图形学报, 2009, 14(3):388-393. [DOI:10.11834/jig.20090302]
Zong L L. Research on multi-view clustering[D]. Dalian: Dalian University of Technology, 2017. http://cdmd.cnki.com.cn/Article/CDMD-10141-1017188571.htm .
宗林林.多视角聚类研究[D].大连: 大连理工大学, 2017.
Liu X P, Lu J T, Xie W J. Foot plant detection based on spectral clustering algorithm for motion capture data[J]. Journal of Image and Graphics, 2014, 19(9):1306-1315.
刘晓平, 陆劲挺, 谢文军.运动捕捉数据中足迹的谱聚类检测方法[J].中国图象图形学报, 2014, 19(9):1306-1315. [DOI:10.11834/jig.20140907]
Wang X B, Guo X J, Lei Z, et al. Exclusivity-consistency regularized multi-view subspace clustering[C]//Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, HI, USA: IEEE, 2017: 1-9.[ DOI: 10.1109/CVPR.2017.8 http://dx.doi.org/10.1109/CVPR.2017.8 ]
Wang Y, Wu L, Lin X M, et al. Multiview spectral clustering via structured low-rank matrix factorization[J]. IEEE Transactions on Neural Networks and Learning Systems, 2018, 29(10):4833-4843.[DOI:10.1109/TNNLS.2017.2777489]
Yin M, Gao J B, Xie S L, et al. Multiview subspace clustering via tensorial t-product representation[J]. IEEE Transactions on Neural Networks and Learning Systems, 2019, 30(3):851-864.[DOI:10.1109/TNNLS.2018.2851444]
Zhang C Q, Fu H Z, Liu S, et al. Low-rank tensor constrained multiview subspace clustering[C]//Proceedings of the 15th IEEE International Conference on Computer Vision. Santiago: IEEE, 2015: 1582-1590.[ DOI: 10.1109/ICCV.2015.185 http://dx.doi.org/10.1109/ICCV.2015.185 ]
Li T, Ding C. The relationships among various nonnegative matrix factorization methods for clustering[C]//Proceedings of the 6th IEEE International Conference on Data Mining. Hong Kong, China: IEEE, 2006: 362-371.[ DOI: 10.1109/ICDM.2006.160 http://dx.doi.org/10.1109/ICDM.2006.160 ]
Cai X, Nie F P, Huang H. Multi-view K -means clustering on big data[C]//Proceedings of the 23rd International Joint Conference on Artificial Intelligence. Beijing: AAAI Press, 2013: 2598-2604.
Xu J L, Han J W, Nie F P, et al. Re-weighted discriminatively embedded K -means for multi-view clustering[J]. IEEE Transactions on Image Processing, 2017, 26(6):3016-3027.[DOI:10.1109/TIP.2017.2665976]
Xu C, Tao D C, Xu C. Multi-view self-paced learning for clustering[C]//Proceedings of the 24th International Conference on Artificial Intelligence. Buenos Aires, Argentina: AAAI Press, 2015: 3974-3980.
Ding C, He X F, Simon H D. Nonnegative Lagrangian relaxation of K -means and spectral clustering[C]//Proceedings of the 16th European Conference on Machine Learning. Porto, Portugal: Springer, 2005: 530-538.[ DOI: 10.1007/11564096_51 http://dx.doi.org/10.1007/11564096_51 ].
Jiang L, Meng D Y, Yu S I, et al. Self-paced learning with diversity[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Montreal: MTI Press, 2014: 2078-2086.
Zhang D W, Meng D Y, Han J W. Co-saliency detection via a self-paced multiple-instance learning framework[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(5):865-878.[DOI:10.1109/TPAMI.2016.2567393]
Ojala T, Pietikainen M, Maenpaa T. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7):971-987.[DOI:10.1109/TPAMI.2002.1017623]
Lades M, Vorbruggen J C, Buhmann J, et al. Distortion invariant object recognition in the dynamic link architecture[J]. IEEE Transactions on Computers, 1993, 42(3):300-311.[DOI:10.1109/12.210173]
Zhang Y F, Xu C S, Lu H Q, et al. Character identification in feature-length films using global face-name matching[J]. IEEE Transactions on Multimedia, 2009, 11(7):1276-1288.[DOI:10.1109/TMM.2009.2030629]
Cao X C, Zhang C Q, Fu H Z, et al. Diversity-induced multi-view subspace clustering[C]//Proceedings of the 28th IEEE Conference on Computer Vision and Pattern Recognition. Boston: IEEE, 2015: 586-594.[ DOI: 10.1109/CVPR.2015.7298657 http://dx.doi.org/10.1109/CVPR.2015.7298657 ]
Bosch A, Zisserman A, Munoz X. Image classification using random forests and ferns[C]//Proceedings of the 11th IEEE International Conference on Computer Vision. Rio de Janeiro, Brazil: IEEE, 2007: 1-8.[ DOI: 10.1109/ICCV.2007.4409066 http://dx.doi.org/10.1109/ICCV.2007.4409066 ]
Qi X B, Xiao R, Li C G, et al. Pairwise rotation invariant co-occurrence local binary pattern[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36(11):2199-2213.[DOI:10.1109/TPAMI.2014.2316826]
Wu J X, Rehg J M. CENTRIST:a visual descriptor for scene categorization[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011, 33(8):1489-1501.[DOI:10.1109/TPAMI.2010.224]
Cai D, He X F, Han J W. Document clustering using locality preserving indexing[J]. IEEE Transactions on Knowledge and Data Engineering, 2005, 17(12):1624-1637.[DOI:10.1109/TKDE.2005.198]
Manning C D, Raghavan P, Schütze H. Introduction to Information Retrieval[M]. Cambridge:Cambridge University Press, 2008.
相关作者
相关机构
京公网安备11010802024621