Het2Hom: Representation of Heterogeneous Attributes into Homogeneous Concept Spaces for Categorical-and-Numerical-Attribute Data Clustering

Yiqun Zhang, Yiu-ming Cheung*, An Zeng

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

4 Citations (Scopus)

Abstract

Data sets composed of a mixture of categorical and numerical attributes (also called mixed data hereinafter) are common in real-world cluster analysis. However, insightful analysis of such data under an unsupervised scenario using clustering is extremely challenging because the information provided by the two different types of attributes is heterogeneous, being at different concept hierarchies. That is, the values of a categorical attribute represent a set of different concepts (e.g., professor, lawyer, and doctor of the attribute "occupation"), while the values of a numerical attribute describe the tendencies toward two different concepts (e.g., low and high of the attribute "income"). To appropriately use such heterogeneous information in clustering, this paper therefore proposes a novel attribute representation learning method called Het2Hom, which first converts the heterogeneous attributes into a homogeneous form, and then learns attribute representations and data partitions on such a homogeneous basis. Het2Hom features low time complexity and intuitive interpretability. Extensive experiments show that Het2Hom outperforms the state-of-the-art counterparts.
Original languageEnglish
Title of host publicationProceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022
EditorsLuc De Raedt
PublisherInternational Joint Conferences on Artificial Intelligence
Pages3758-3765
Number of pages8
ISBN (Electronic)9781956792003
DOIs
Publication statusPublished - 23 Jul 2022
Event31th International Joint Conference on Artificial Intelligence, IJCAI 2022 - Messe Wien, Vienna, Austria
Duration: 23 Jul 202229 Jul 2022
https://ijcai-22.org/
https://www.ijcai.org/proceedings/2022/

Publication series

NameIJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)1045-0823

Conference

Conference31th International Joint Conference on Artificial Intelligence, IJCAI 2022
Country/TerritoryAustria
CityMesse Wien, Vienna
Period23/07/2229/07/22
Internet address

Fingerprint

Dive into the research topics of 'Het2Hom: Representation of Heterogeneous Attributes into Homogeneous Concept Spaces for Categorical-and-Numerical-Attribute Data Clustering'. Together they form a unique fingerprint.

Cite this