SDENK: Unbiased Subspace Density-k-Clustering

Rong Zou, Yunfan Zhang, Mingjie Zhao, Zexi Tan, Yiqun Zhang*, Yiu-ming Cheung*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Clustering is one of the most important data analysis techniques, as it extracts knowledge without requiring data labels, making it crucial in many unsupervised application scenarios. However, conventional -clustering struggles to detect irregularly distributed clusters owing to its inherent preference for convex clusters, while density-based clustering often lacks the ability to customise an appropriate metric space for different clustering tasks owing to the lack of a task-oriented optimisation process. To simultaneously address these biases in cluster shape and metric space, this paper proposes an unbiased hybrid framework to perform density-based clustering in subspaces. These subspaces are constructed through a newly developed attribute-weighted -clustering paradigm. To exploit the irregularly distributed clusters obtained via density-based clustering for subspace learning, a novel strategy is designed to subdivide the clusters into compact sub-clusters, which are more suitable for evaluating attribute importance through -clustering. As a result, the proposed subspace density -clustering algorithm inherits the shape flexibility of density-based clustering and the metric adaptiveness of -clustering. Moreover, the learnable design enables mutual optimisation between density clusters and subspaces, yielding robust and superior clustering performance across various datasets. Comprehensive evaluations, including comparative clustering performance evaluation, ablation studies, significance tests, noise-robustness evaluation, and hyper-parameter sensitivity studies, are conducted. Among 10 compared methods, SDENK achieves an average rank of 1.75 on 12 datasets in terms of the clustering accuracy metric ARI.
Original languageEnglish
Article number131225
Number of pages12
JournalNeurocomputing
Volume653
Early online date8 Aug 2025
DOIs
Publication statusPublished - 7 Nov 2025

User-Defined Keywords

  • Subspace clustering
  • Attributes weighting
  • Density-based clustering
  • Unsupervised learning

Fingerprint

Dive into the research topics of 'SDENK: Unbiased Subspace Density-k-Clustering'. Together they form a unique fingerprint.

Cite this