Abstract
This correspondence describes extensions to the fuzzy k-means algorithm for clustering categorical data. By using a simple matching dissimilarity measure for categorical objects and modes instead of means for clusters, a new approach is developed, which allows the use of the k-means paradigm to efficiently cluster large categorical data sets. A fuzzy k-modes algorithm is presented and the effectiveness of the algorithm is demonstrated with experimental results.
| Original language | English |
|---|---|
| Pages (from-to) | 446-452 |
| Number of pages | 7 |
| Journal | IEEE Transactions on Fuzzy Systems |
| Volume | 7 |
| Issue number | 4 |
| DOIs | |
| Publication status | Published - Aug 1999 |
User-Defined Keywords
- Categorical data
- Clustering
- Data mining
- Fuzzy partitioning
- k-means algorithm