A fuzzy k-modes algorithm for clustering categorical data

Zhexue Huang*, Michael K. Ng

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

433 Citations (Scopus)

Abstract

This correspondence describes extensions to the fuzzy k-means algorithm for clustering categorical data. By using a simple matching dissimilarity measure for categorical objects and modes instead of means for clusters, a new approach is developed, which allows the use of the k-means paradigm to efficiently cluster large categorical data sets. A fuzzy k-modes algorithm is presented and the effectiveness of the algorithm is demonstrated with experimental results.

Original languageEnglish
Pages (from-to)446-452
Number of pages7
JournalIEEE Transactions on Fuzzy Systems
Volume7
Issue number4
DOIs
Publication statusPublished - Aug 1999

User-Defined Keywords

  • Categorical data
  • Clustering
  • Data mining
  • Fuzzy partitioning
  • k-means algorithm

Fingerprint

Dive into the research topics of 'A fuzzy k-modes algorithm for clustering categorical data'. Together they form a unique fingerprint.

Cite this