Abstract
An increasing body of literature shows that predicting gene clusters related to human cancer disease using biological networks is significant in bioinformation, it would help to understand disease mechanisms, and benefit the development of diagnostics and therapeutics. However, due to noise and preprocessing of data, a single network or graph generated from one cancer disease is insufficient to cluster genes. As some cancer diseases are correlated with each other in practice, by integrating several gene expression networks generated from those associated cancer diseases, more accurate and robust partition of genes can be obtained. In this paper, we propose a multiple graph spectral clustering method with graph association, it helps us to discover functional modules in each cancer disease more accurately and comprehensively, meanwhile the degree of association among cancer diseases can also be determined. Our idea is to construct a block adjacency matrix to integrate the adjacency matrix of each graph and the degree of association among multiple graphs together, then spectral clustering would be employed to calculate clusters for each graph. The proposed algorithm is based on a self-consistent field iteration such that both the degree of association and gene clusters can be identified during iterations. Moreover, we establish the condition under which convergence of the proposed algorithm is guaranteed with some assumptions. Experimental results on two datasets of human cancer diseases are presented, which demonstrate that the proposed method can not only identify gene functional modules, but also calculate the degree of association among different cancer diseases accurately.
Original language | English |
---|---|
Article number | 20 |
Number of pages | 16 |
Journal | Network Modeling Analysis in Health Informatics and Bioinformatics |
Volume | 11 |
Issue number | 1 |
Early online date | 7 Apr 2022 |
DOIs | |
Publication status | Published - Dec 2022 |
Scopus Subject Areas
- Computer Science (miscellaneous)
- Biochemistry, Genetics and Molecular Biology (miscellaneous)
- Computer Science Applications
- Computer Networks and Communications
- Urology
- Computational Mathematics
User-Defined Keywords
- Gene clusters
- Gene coexpression data
- Nonlinear eigenvalue problem
- Self-consistent field iteration
- Spectral clustering