TY - JOUR
T1 - Hierarchical Information-Theoretic Co-Clustering for high dimensional data
AU - Wang, Yuanyuan
AU - Ye, Yunming
AU - Li, Xutao
AU - Ng, Kwok Po
AU - Huang, Joshua
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2011/1
Y1 - 2011/1
N2 - Hierarchical clustering is an important technique for hierarchical data exploration applications. However, most existing hierarchial methods are based on traditional one-side clustering, which is not effective for handling high dimensional data. In this paper, we develop a partitional hierarchical co-clustering framework and propose a Hierarchical Information-Theoretical Co-Clustering (HITCC) algorithm. The algorithm conducts a series of binary partitions of objects on a data set via the Information- Theoretical Co-Clustering (ITCC) procedure, and generates a hierarchical management of object clusters. Due to simultaneously clustering of features and objects in the process of building a cluster tree, the HITCC algorithm can identify subspace clusters at different-level abstractions and acquire good clustering hierarchies. Compared with the flat ITCC algorithm and six state-of-the-art hierarchical clustering algorithms on various data sets, the new algorithm demonstrated much better performance. ICIC International
AB - Hierarchical clustering is an important technique for hierarchical data exploration applications. However, most existing hierarchial methods are based on traditional one-side clustering, which is not effective for handling high dimensional data. In this paper, we develop a partitional hierarchical co-clustering framework and propose a Hierarchical Information-Theoretical Co-Clustering (HITCC) algorithm. The algorithm conducts a series of binary partitions of objects on a data set via the Information- Theoretical Co-Clustering (ITCC) procedure, and generates a hierarchical management of object clusters. Due to simultaneously clustering of features and objects in the process of building a cluster tree, the HITCC algorithm can identify subspace clusters at different-level abstractions and acquire good clustering hierarchies. Compared with the flat ITCC algorithm and six state-of-the-art hierarchical clustering algorithms on various data sets, the new algorithm demonstrated much better performance. ICIC International
KW - Co-clustering
KW - Hierarchical clustering
KW - Text clustering
UR - http://www.ijicic.org/contents.htm
UR - http://www.scopus.com/inward/record.url?scp=78650921557&partnerID=8YFLogxK
M3 - Journal article
AN - SCOPUS:78650921557
SN - 1349-4198
VL - 7
SP - 487
EP - 500
JO - International Journal of Innovative Computing, Information and Control
JF - International Journal of Innovative Computing, Information and Control
IS - 1
ER -