TY - JOUR
T1 - Shrinkage-based diagonal Hotelling's tests for high-dimensional small sample size data
AU - Dong, Kai
AU - Pang, Herbert
AU - Tong, Tiejun
AU - Genton, Marc G.
N1 - Funding Information:
Tiejun Tong’s research was supported in part by Hong Kong Research grant HKBU202711 and Hong Kong Baptist University FRG grants FRG1/10-11/031 , FRG2/13-14/062 , and FRG1/14-15/044 . The authors thank the editor, the associate editor and the referees for their constructive comments that led to a substantial improvement of the paper.
PY - 2016/1/1
Y1 - 2016/1/1
N2 - High-throughput expression profiling techniques bring novel tools and also statistical challenges to genetic research. In addition to detecting differentially expressed genes, testing the significance of gene sets or pathway analysis has been recognized as an equally important problem. Owing to the "large p small n" paradigm, the traditional Hotelling's T2 test suffers from the singularity problem and therefore is not valid in this setting. In this paper, we propose a shrinkage-based diagonal Hotelling's test for both one-sample and two-sample cases. We also suggest several different ways to derive the approximate null distribution under different scenarios of p and n for our proposed shrinkage-based test. Simulation studies show that the proposed method performs comparably to existing competitors when n is moderate or large, but it is better when n is small. In addition, we analyze four gene expression data sets and they demonstrate the advantage of our proposed shrinkage-based diagonal Hotelling's test.
AB - High-throughput expression profiling techniques bring novel tools and also statistical challenges to genetic research. In addition to detecting differentially expressed genes, testing the significance of gene sets or pathway analysis has been recognized as an equally important problem. Owing to the "large p small n" paradigm, the traditional Hotelling's T2 test suffers from the singularity problem and therefore is not valid in this setting. In this paper, we propose a shrinkage-based diagonal Hotelling's test for both one-sample and two-sample cases. We also suggest several different ways to derive the approximate null distribution under different scenarios of p and n for our proposed shrinkage-based test. Simulation studies show that the proposed method performs comparably to existing competitors when n is moderate or large, but it is better when n is small. In addition, we analyze four gene expression data sets and they demonstrate the advantage of our proposed shrinkage-based diagonal Hotelling's test.
KW - Diagonal Hotelling's test
KW - High-dimensional data
KW - Microarray data
KW - Null distribution
KW - Optimal variance estimation
UR - http://www.scopus.com/inward/record.url?scp=84943140441&partnerID=8YFLogxK
U2 - 10.1016/j.jmva.2015.08.022
DO - 10.1016/j.jmva.2015.08.022
M3 - Journal article
AN - SCOPUS:84943140441
SN - 0047-259X
VL - 143
SP - 127
EP - 142
JO - Journal of Multivariate Analysis
JF - Journal of Multivariate Analysis
ER -