» Articles » PMID: 30640600

Multiple Kernel K-Means with Incomplete Kernels

Overview
Authors
Affiliations
Soon will be listed here.
Abstract

Multiple kernel clustering (MKC) algorithms optimally combine a group of pre-specified base kernel matrices to improve clustering performance. However, existing MKC algorithms cannot efficiently address the situation where some rows and columns of base kernel matrices are absent. This paper proposes two simple yet effective algorithms to address this issue. Different from existing approaches where incomplete kernel matrices are first imputed and a standard MKC algorithm is applied to the imputed kernel matrices, our first algorithm integrates imputation and clustering into a unified learning procedure. Specifically, we perform multiple kernel clustering directly with the presence of incomplete kernel matrices, which are treated as auxiliary variables to be jointly optimized. Our algorithm does not require that there be at least one complete base kernel matrix over all the samples. Also, it adaptively imputes incomplete kernel matrices and combines them to best serve clustering. Moreover, we further improve this algorithm by encouraging these incomplete kernel matrices to mutually complete each other. The three-step iterative algorithm is designed to solve the resultant optimization problems. After that, we theoretically study the generalization bound of the proposed algorithms. Extensive experiments are conducted on 13 benchmark data sets to compare the proposed algorithms with existing imputation-based methods. Our algorithms consistently achieve superior performance and the improvement becomes more significant with increasing missing ratio, verifying the effectiveness and advantages of the proposed joint imputation and clustering.

Citing Articles

Exploring feature sparsity for out-of-distribution detection.

Chen Q, Li K, Chen Z, Maul T, Yin J Sci Rep. 2024; 14(1):28444.

PMID: 39558072 PMC: 11574038. DOI: 10.1038/s41598-024-79934-7.


[An MRI multi-sequence feature imputation and fusion mutual-aid model based on sequence deletion for differentiation of high-grade from low-grade glioma].

Wu C, Zhong W, Xie J, Yang R, Wu Y, Xu Y Nan Fang Yi Ke Da Xue Xue Bao. 2024; 44(8):1561-1570.

PMID: 39276052 PMC: 11378041. DOI: 10.12122/j.issn.1673-4254.2024.08.15.


A simulation study on missing data imputation for dichotomous variables using statistical and machine learning methods.

Ge Y, Li Z, Zhang J Sci Rep. 2023; 13(1):9432.

PMID: 37296269 PMC: 10256703. DOI: 10.1038/s41598-023-36509-2.


Community Detection in Semantic Networks: A Multi-View Approach.

Yang H, Liu Q, Zhang J, Ding X, Chen C, Wang L Entropy (Basel). 2022; 24(8).

PMID: 36010804 PMC: 9407108. DOI: 10.3390/e24081141.


Kernel Probabilistic K-Means Clustering.

Liu B, Zhang T, Li Y, Liu Z, Zhang Z Sensors (Basel). 2021; 21(5).

PMID: 33800353 PMC: 7962817. DOI: 10.3390/s21051892.


References
1.
Xu C, Tao D, Xu C . Multi-View Learning With Incomplete Views. IEEE Trans Image Process. 2015; 24(12):5812-25. DOI: 10.1109/TIP.2015.2490539. View

2.
Zhang C, Fu H, Hu Q, Zhu P, Cao X . Flexible Multi-View Dimensionality Co-Reduction. IEEE Trans Image Process. 2016; 26(2):648-659. DOI: 10.1109/TIP.2016.2627806. View

3.
Liu X, Wang L, Yin J, Zhu E, Zhang J . An Efficient Approach to Integrating Radius Information into Multiple Kernel Learning. IEEE Trans Cybern. 2012; 43(2):557-69. DOI: 10.1109/TSMCB.2012.2212243. View

4.
Cao X, Zhang C, Zhou C, Fu H, Foroosh H . Constrained Multi-View Video Face Clustering. IEEE Trans Image Process. 2015; 24(11):4381-93. DOI: 10.1109/TIP.2015.2463223. View

5.
Yu S, Tranchevent L, Liu X, Glanzel W, Suykens J, De Moor B . Optimized data fusion for kernel k-means clustering. IEEE Trans Pattern Anal Mach Intell. 2012; 34(5):1031-9. DOI: 10.1109/TPAMI.2011.255. View