A new clustering algorithm for genes with multiple cancer diseases by self-consistent field iteration method

Ye Liu*, Michael K. Ng

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

An increasing body of literature shows that predicting gene clusters related to human cancer disease using biological networks is significant in bioinformation, it would help to understand disease mechanisms, and benefit the development of diagnostics and therapeutics. However, due to noise and preprocessing of data, a single network or graph generated from one cancer disease is insufficient to cluster genes. As some cancer diseases are correlated with each other in practice, by integrating several gene expression networks generated from those associated cancer diseases, more accurate and robust partition of genes can be obtained. In this paper, we propose a multiple graph spectral clustering method with graph association, it helps us to discover functional modules in each cancer disease more accurately and comprehensively, meanwhile the degree of association among cancer diseases can also be determined. Our idea is to construct a block adjacency matrix to integrate the adjacency matrix of each graph and the degree of association among multiple graphs together, then spectral clustering would be employed to calculate clusters for each graph. The proposed algorithm is based on a self-consistent field iteration such that both the degree of association and gene clusters can be identified during iterations. Moreover, we establish the condition under which convergence of the proposed algorithm is guaranteed with some assumptions. Experimental results on two datasets of human cancer diseases are presented, which demonstrate that the proposed method can not only identify gene functional modules, but also calculate the degree of association among different cancer diseases accurately.

Original languageEnglish
Article number20
Number of pages16
JournalNetwork Modeling Analysis in Health Informatics and Bioinformatics
Volume11
Issue number1
Early online date7 Apr 2022
DOIs
Publication statusPublished - Dec 2022

Scopus Subject Areas

  • Computer Science (miscellaneous)
  • Biochemistry, Genetics and Molecular Biology (miscellaneous)
  • Computer Science Applications
  • Computer Networks and Communications
  • Urology
  • Computational Mathematics

User-Defined Keywords

  • Gene clusters
  • Gene coexpression data
  • Nonlinear eigenvalue problem
  • Self-consistent field iteration
  • Spectral clustering

Cite this