Robust Qualitative Data Clustering via Learnable Multi-Metric Space Fusion

Sen Feng, Mingjie Zhao, Zhanpei Huang, Yuzhu Ji, Yiqun Zhang*, Yiu-Ming Cheung

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Understanding categorical data with vague qualitative values by forming clusters is crucial in many data-driven AI fields. Compared with numerical data with its quantitative values embedded in well-defined Euclidean distance space, distances of the qualitative values are naturally unknown and are specially defined for certain data types or tasks. This paper, therefore, proposes a distance metric space fusion framework, which learns to fuse multiple distance metrics to form a statistical information-complete and prior knowledge-comprehensive metric for robust and accurate cluster analysis of qualitative data. To better serve various clustering tasks, the metric fusion objective is incorporated into the clustering objective through iterative learning. It turns out that the proposed method stably demonstrates superiority on various challenging real benchmark datasets. Extensive experiments including significance tests, ablation studies, etc. validate its efficacy. Source code of the proposed method is available at https://github.com/Sen-Feng/ICASSP-MSF/tree/main/CODE.
Original languageEnglish
Title of host publicationProceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
EditorsBhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta
PublisherIEEE
Number of pages5
ISBN (Electronic)9798350368741
ISBN (Print)9798350368758
DOIs
Publication statusPublished - 6 Apr 2025
Event2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025 - Hyderabad, India
Duration: 6 Apr 202511 Apr 2025
https://ieeexplore.ieee.org/xpl/conhome/10887540/proceeding

Publication series

NameProceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublisherIEEE

Conference

Conference2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025
Country/TerritoryIndia
CityHyderabad
Period6/04/2511/04/25
Internet address

User-Defined Keywords

  • Cluster analysis
  • categorical data
  • unsupervised learning
  • metric space learning
  • robust and accurate clustering

Fingerprint

Dive into the research topics of 'Robust Qualitative Data Clustering via Learnable Multi-Metric Space Fusion'. Together they form a unique fingerprint.

Cite this