Abstract
This correspondence describes extensions to the fuzzy k-means algorithm for clustering categorical data. By using a simple matching dissimilarity measure for categorical objects and modes instead of means for clusters, a new approach is developed, which allows the use of the k-means paradigm to efficiently cluster large categorical data sets. A fuzzy k-modes algorithm is presented and the effectiveness of the algorithm is demonstrated with experimental results.
Original language | English |
---|---|
Pages (from-to) | 446-452 |
Number of pages | 7 |
Journal | IEEE Transactions on Fuzzy Systems |
Volume | 7 |
Issue number | 4 |
DOIs | |
Publication status | Published - Aug 1999 |
User-Defined Keywords
- Categorical data
- Clustering
- Data mining
- Fuzzy partitioning
- k-means algorithm