Robust Clustering by Pruning Outliers

Jiang She Zhang*, Yiu Wing LEUNG

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

32 Citations (Scopus)

Abstract

In many applications of C-means clustering, the given data set often contains noisy points. These noisy points will affect the resulting clusters, especially if they are far away from the data points. In this paper, we develop a pruning approach for robust C-means clustering. This approach identifies and prunes the outliers based on the sizes and shapes of the clusters so that the resulting clusters are least affected by the outliers. The pruning approach is general, and it can improve the robustness of many existing C-means clustering methods. In particular, we apply the pruning approach to improve the robustness of hard C-means clustering, fuzzy C-means clustering, and deterministic-annealing C-means clustering. As a result, we obtain three clustering algorithms that are the robust versions of the existing ones. In addition, we integrate the pruning approach with the fuzzy approach and the possibilistic approach to design two new algorithms for robust C-means clustering. The numerical results demonstrate that the pruning approach can achieve good robustness.

Original languageEnglish
Pages (from-to)983-999
Number of pages17
JournalIEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Volume33
Issue number6
DOIs
Publication statusPublished - Dec 2003

Scopus Subject Areas

  • Control and Systems Engineering
  • Software
  • Information Systems
  • Human-Computer Interaction
  • Computer Science Applications
  • Electrical and Electronic Engineering

User-Defined Keywords

  • Deterministic annealing
  • Fuzzy
  • Possibility theory
  • Robust clustering

Fingerprint

Dive into the research topics of 'Robust Clustering by Pruning Outliers'. Together they form a unique fingerprint.

Cite this