TY - JOUR
T1 - Two-stage iteratively reweighted smoothing splines for baseline correction
AU - Wei, Jiajin
AU - Zhu, Chen
AU - Zhang, Zhi-Min
AU - He, Ping
N1 - Copyright:
© 2022 Elsevier B.V.
Funding Information:
Ping He's research was supported by the Internal Research Grant ( R202010 ) of BNU-HKBU United International College , and the National Natural Science Foundation of China ( 62076029 ).
PY - 2022/8/15
Y1 - 2022/8/15
N2 - This paper reviewed several iteratively reweighted baseline correction methods. We note in the literature that the estimated baselines are susceptible to random noises in a low signal-to-noise signal. When the acquired signals are complex-structured, the estimated baselines may still contain the peak information. This paper proposes a new approach named two-stage iteratively reweighted smoothing splines (RWSS) to cope with those situations. The proposed method estimates the baselines by applying weighted smoothing splines in two stages. The first stage applies the smoothing splines with Tukey's Bisquare weights to estimate the baselines, while the second stage is designed to fine-tune the first stage's result. Specifically, the weighted smoothing splines are applied again to remove the remaining peak information, where the weights for the peak regions are inversely proportional to the error variances. By simulation studies, the performance of the two-stage RWSS algorithm is among the best in terms of the root mean square error. Finally, we conducted three real data studies, i.e., chromatograms, infrared spectra, and Raman spectra, to verify the reliability of our new algorithm in practical tasks by evaluating principal components and classification accuracy. The new algorithm is implemented in R language, where the source code is available at https://github.com/rwss2021/rwss.
AB - This paper reviewed several iteratively reweighted baseline correction methods. We note in the literature that the estimated baselines are susceptible to random noises in a low signal-to-noise signal. When the acquired signals are complex-structured, the estimated baselines may still contain the peak information. This paper proposes a new approach named two-stage iteratively reweighted smoothing splines (RWSS) to cope with those situations. The proposed method estimates the baselines by applying weighted smoothing splines in two stages. The first stage applies the smoothing splines with Tukey's Bisquare weights to estimate the baselines, while the second stage is designed to fine-tune the first stage's result. Specifically, the weighted smoothing splines are applied again to remove the remaining peak information, where the weights for the peak regions are inversely proportional to the error variances. By simulation studies, the performance of the two-stage RWSS algorithm is among the best in terms of the root mean square error. Finally, we conducted three real data studies, i.e., chromatograms, infrared spectra, and Raman spectra, to verify the reliability of our new algorithm in practical tasks by evaluating principal components and classification accuracy. The new algorithm is implemented in R language, where the source code is available at https://github.com/rwss2021/rwss.
KW - Baseline correction
KW - Chromatograms
KW - Raman spectra
KW - Robust weight
KW - Smoothing spline
UR - https://www.scopus.com/record/display.uri?eid=2-s2.0-85133617618&origin=inward
U2 - 10.1016/j.chemolab.2022.104606
DO - 10.1016/j.chemolab.2022.104606
M3 - Journal article
SN - 0169-7439
VL - 227
JO - Chemometrics and Intelligent Laboratory Systems
JF - Chemometrics and Intelligent Laboratory Systems
M1 - 104606
ER -