TY - JOUR
T1 - Semi-supervised maximum margin clustering with pairwise constraints
AU - Zeng, Hong
AU - Cheung, Yiu Ming
N1 - Funding Information:
The work described in this paper was supported by the Research Grant Council of Hong Kong SAR under Project HKBU 210306 & HKBU 210309, the Faculty Research Grant of Hong Kong Baptist University with the Project Code: FRG2/08-09/122, the National Natural Science Foundation of China (61105048, 61104206), the Doctoral Fund of Ministry of Education of China (20100092120012, 20110092120034) and by the Open Fund of Jiangsu Province Key Laboratory for Remote Measuring and Control (YCCK201005). Yiu-Ming Cheung is the corresponding author for this paper.
PY - 2012/5
Y1 - 2012/5
N2 - The pairwise constraints specifying whether a pair of samples should be grouped together or not have been successfully incorporated into the conventional clustering methods such as k-means and spectral clustering for the performance enhancement. Nevertheless, the issue of pairwise constraints has not been well studied in the recently proposed maximum margin clustering (MMC), which extends the maximum margin framework in supervised learning for clustering and often shows a promising performance. This paper therefore proposes a pairwise constrained MMC algorithm. Based on the maximum margin idea in MMC, we propose a set of effective loss functions for discouraging the violation of given pairwise constraints. For the resulting optimization problem, we show that the original nonconvex problem in our approach can be decomposed into a sequence of convex quadratic program problems via constrained concave-convex procedure (CCCP). Subsequently, we present an efficient subgradient projection optimization method to solve each convex problem in the CCCP sequence. Experiments on a number of real-world data sets show that the proposed constrained MMC algorithm is scalable and outperforms the existing constrained MMC approach as well as the typical semi-supervised clustering counterparts.
AB - The pairwise constraints specifying whether a pair of samples should be grouped together or not have been successfully incorporated into the conventional clustering methods such as k-means and spectral clustering for the performance enhancement. Nevertheless, the issue of pairwise constraints has not been well studied in the recently proposed maximum margin clustering (MMC), which extends the maximum margin framework in supervised learning for clustering and often shows a promising performance. This paper therefore proposes a pairwise constrained MMC algorithm. Based on the maximum margin idea in MMC, we propose a set of effective loss functions for discouraging the violation of given pairwise constraints. For the resulting optimization problem, we show that the original nonconvex problem in our approach can be decomposed into a sequence of convex quadratic program problems via constrained concave-convex procedure (CCCP). Subsequently, we present an efficient subgradient projection optimization method to solve each convex problem in the CCCP sequence. Experiments on a number of real-world data sets show that the proposed constrained MMC algorithm is scalable and outperforms the existing constrained MMC approach as well as the typical semi-supervised clustering counterparts.
KW - constrained concave-convex procedure.
KW - maximum margin clustering
KW - pairwise constraints
KW - Semi-supervised clustering
UR - http://www.scopus.com/inward/record.url?scp=84859732263&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2011.68
DO - 10.1109/TKDE.2011.68
M3 - Journal article
AN - SCOPUS:84859732263
SN - 1041-4347
VL - 24
SP - 926
EP - 939
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 5
M1 - 5728817
ER -