A Simple Two-Sample Test in High Dimensions Based on L2-Norm

Jin Ting Zhang, Jia Guo, Bu Zhou, Ming Yen Cheng*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

41 Citations (Scopus)

Abstract

Testing the equality of two means is a fundamental inference problem. For high-dimensional data, the Hotelling’s T2-test either performs poorly or becomes inapplicable. Several modifications have been proposed to address this issue. However, most of them are based on asymptotic normality of the null distributions of their test statistics which inevitably requires strong assumptions on the covariance. We study this problem thoroughly and propose an L2-norm based test that works under mild conditions and even when there are fewer observations than the dimension. Specially, to cope with general nonnormality of the null distribution we employ the Welch–Satterthwaite χ2-approximation. We derive a sharp upper bound on the approximation error and use it to justify that χ2-approximation is preferred to normal approximation. Simple ratio-consistent estimators for the parameters in the χ2-approximation are given. Importantly, our test can cope with singularity or near singularity of the covariance which is commonly seen in high dimensions and is the main cause of nonnormality. The power of the proposed test is also investigated. Extensive simulation studies and an application show that our test is at least comparable to and often outperforms several competitors in terms of size control, and the powers are comparable when their sizes are. Supplementary materials for this article are available online.

Original languageEnglish
Pages (from-to)1011-1027
Number of pages17
JournalJournal of the American Statistical Association
Volume115
Issue number530
DOIs
Publication statusPublished - 2 Apr 2020

Scopus Subject Areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

User-Defined Keywords

  • High-dimensional data
  • Hotelling’s T2-test
  • Welch–Satterthwaite χ2-approximation
  • χ2-type mixtures

Fingerprint

Dive into the research topics of 'A Simple Two-Sample Test in High Dimensions Based on L2-Norm'. Together they form a unique fingerprint.

Cite this