On the impact of dissimilarity measure in κ-modes clustering algorithm

  • Michael K. Ng*
  • , Mark Junjie Li
  • , Joshua Zhexue Huang
  • , Zengyou He
  • *Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

194 Citations (Scopus)

Abstract

This correspondence describes extensions to the κ-modes algorithm for clustering categorical data. By modifying a simple matching dissimilarity measure for categorical objects, a heuristic approach was developed in [4], [12] which allows the use of the k-modes paradigm to obtain a cluster with strong intrasimilarity and to efficiently cluster large categorical data sets. The main aim of this paper is to rigorously derive the updating formula of the k-modes clustering algorithm with the new dissimilarity measure and the convergence of the algorithm under the optimization framework.

Original languageEnglish
Pages (from-to)503-507
Number of pages5
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume29
Issue number3
DOIs
Publication statusPublished - Mar 2007

User-Defined Keywords

  • κ-modes algorithm
  • Categorical data
  • Clustering
  • Data mining

Fingerprint

Dive into the research topics of 'On the impact of dissimilarity measure in κ-modes clustering algorithm'. Together they form a unique fingerprint.

Cite this