Abstract
Data summarization has wide applications in real world, e.g. attributes filter, image set labeling and personalized recommendation. In this work, we study a new problem HSD to summarize a dataset using k concepts in a hierarchical taxonomy. Different from the existed works of whole hierarchy summarization, we focus on the accurate coverage of the given query set Q. The objective is to cover more items in Q and less items not in Q. To tackle it, we first propose a dynamic programming based algorithm on the tree hierarchy, which is a simple instance of HSD problem. Furthermore, we propose a heuristic method to assign the vertex to one of its in-neighbors for HDAGs and apply the tree algorithm on it. The experimental results confirm the quality of our methods on both tree and HDAG datasets.
Original language | English |
---|---|
Title of host publication | SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data |
Publisher | Association for Computing Machinery (ACM) |
Pages | 2914–2916 |
Number of pages | 3 |
ISBN (Print) | 9781450383431 |
DOIs | |
Publication status | Published - 18 Jun 2021 |
Event | ACM SIGMOD International Conference on Management of Data, SIGMOD 2021 - Virtual, Online, China Duration: 20 Jun 2021 → 25 Jun 2021 https://dl.acm.org/doi/proceedings/10.1145/3448016 |
Conference
Conference | ACM SIGMOD International Conference on Management of Data, SIGMOD 2021 |
---|---|
Country/Territory | China |
Period | 20/06/21 → 25/06/21 |
Internet address |