GD-RDA: A New Regularized Discriminant Analysis for High-Dimensional Data

Yan Zhou, Baoxue Zhang, Gaorong Li, Tiejun TONG, Xiang WAN*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)

Abstract

High-throughput techniques bring novel tools and also statistical challenges to genomic research. Identification of which type of diseases a new patient belongs to has been recognized as an important problem. For high-dimensional small sample size data, the classical discriminant methods suffer from the singularity problem and are, therefore, no longer applicable in practice. In this article, we propose a geometric diagonalization method for the regularized discriminant analysis. We then consider a bias correction to further improve the proposed method. Simulation studies show that the proposed method performs better than, or at least as well as, the existing methods in a wide range of settings. A microarray dataset and an RNA-seq dataset are also analyzed and they demonstrate the superiority of the proposed method over the existing competitors, especially when the number of samples is small or the number of genes is large. Finally, we have developed an R package called "GDRDA" which is available upon request.

Original languageEnglish
Pages (from-to)1099-1111
Number of pages13
JournalJournal of Computational Biology
Volume24
Issue number11
DOIs
Publication statusPublished - Nov 2017

Scopus Subject Areas

  • Modelling and Simulation
  • Molecular Biology
  • Genetics
  • Computational Mathematics
  • Computational Theory and Mathematics

User-Defined Keywords

  • bias correction
  • classification
  • diagonalization
  • discriminant
  • geometric
  • microarray
  • RNA-seq

Fingerprint

Dive into the research topics of 'GD-RDA: A New Regularized Discriminant Analysis for High-Dimensional Data'. Together they form a unique fingerprint.

Cite this