TY - JOUR
T1 - Variable selection in high-dimensional partially linear additive models for composite quantile regression
AU - Guo, Jie
AU - Tang, Man Lai
AU - Tian, Maozai
AU - Zhu, Kai
N1 - Funding Information:
The work was partially supported by National Natural Science Foundation of China (No. 11271368 ), Major Project of Humanities Social Science Foundation of Ministry of Education (No. 08JJD910247 ), Key Project of Chinese Ministry of Education (No. 108120 ), Beijing Planning Office of Philosophy and Social Science (No. 12JGB051 ), Fundamental Research Funds for the Central Universities, and the Research Funds of Renmin University of China (No. 10XNL018 ), (No. 10XNK025 ) and China Statistical Research Project (No. 2011LZ031 ).
PY - 2013/9
Y1 - 2013/9
N2 - A new estimation procedure based on the composite quantile regression is proposed for the semiparametric additive partial linear models, of which the nonparametric components are approximated by polynomial splines. The proposed estimation method can simultaneously estimate both the parametric regression coefficients and nonparametric components without any specification of the error distributions. The proposed estimation method is empirically shown to be much more efficient than the popular least-squares-based estimation method for non-normal random errors, especially for Cauchy error, and almost as efficient for normal random errors. To achieve sparsity in high-dimensional and sparse additive partial linear models, of which the number of linear covariates is much larger than the sample size but that of significant covariates is small relative to the sample size, a variable selection procedure based on adaptive Lasso is proposed to conduct estimation and variable selection simultaneously. The procedure is shown to possess the oracle property, and is much superior to the adaptive Lasso penalized least-squares-based method regardless of the random error distributions. In particular, two kinds of weights in the penalty are considered, namely the composite quantile regression estimates and Lasso penalized composite quantile regression estimates. Both types of weights perform very well with the latter performing especially well in terms of precisely selecting significant variables. The simulation results are consistent with the theoretical properties. A real data example is used to illustrate the application of the proposed methods.
AB - A new estimation procedure based on the composite quantile regression is proposed for the semiparametric additive partial linear models, of which the nonparametric components are approximated by polynomial splines. The proposed estimation method can simultaneously estimate both the parametric regression coefficients and nonparametric components without any specification of the error distributions. The proposed estimation method is empirically shown to be much more efficient than the popular least-squares-based estimation method for non-normal random errors, especially for Cauchy error, and almost as efficient for normal random errors. To achieve sparsity in high-dimensional and sparse additive partial linear models, of which the number of linear covariates is much larger than the sample size but that of significant covariates is small relative to the sample size, a variable selection procedure based on adaptive Lasso is proposed to conduct estimation and variable selection simultaneously. The procedure is shown to possess the oracle property, and is much superior to the adaptive Lasso penalized least-squares-based method regardless of the random error distributions. In particular, two kinds of weights in the penalty are considered, namely the composite quantile regression estimates and Lasso penalized composite quantile regression estimates. Both types of weights perform very well with the latter performing especially well in terms of precisely selecting significant variables. The simulation results are consistent with the theoretical properties. A real data example is used to illustrate the application of the proposed methods.
KW - Adaptive Lasso
KW - Composite quantile regression
KW - High-dimension
KW - Semiparametric additive partial linear model
KW - Spline approximation
KW - Variable selection
UR - http://www.scopus.com/inward/record.url?scp=84885018499&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2013.03.017
DO - 10.1016/j.csda.2013.03.017
M3 - Journal article
AN - SCOPUS:84885018499
SN - 0167-9473
VL - 65
SP - 56
EP - 67
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
ER -