Regularized t distribution: definition, properties and applications

Zongliang Hu, Yiping Yang*, Gaorong Li, Tiejun Tong*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

For gene expression data analysis, an important task is to identify genes that are differentially expressed between two or more groups. Nevertheless, as biological experiments are often measured with a relatively small number of samples, how to accurately estimate the variances of gene expression becomes a challenging issue. To tackle this problem, we introduce a regularized t distribution and derive its statistical properties including the probability density function and the moment generating function. The noncentral regularized t distribution is also introduced for computing the statistical power of hypothesis testing. For practical applications, we apply the regularized t distribution to establish the null distribution of the regularized t statistic, and then formulate it as a regularized t-test for detecting the differentially expressed genes. Simulation studies and real data analysis show that our regularized t-test performs much better than the Bayesian t-test in the “limma” package, in particular when the sample sizes are small.
Original languageEnglish
Pages (from-to)1-29
Number of pages29
JournalScandinavian Journal of Statistics
DOIs
Publication statusE-pub ahead of print - 14 Apr 2023

Scopus Subject Areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

User-Defined Keywords

  • Bayesian t-test
  • hypothesis testing
  • noncentral regularized t distribution
  • regularized t distribution
  • regularized t-test

Fingerprint

Dive into the research topics of 'Regularized t distribution: definition, properties and applications'. Together they form a unique fingerprint.

Cite this