Prediction of Total Soluble Solids in Apricot Using Adaptive Boosting Ensemble Model Combined with NIR and High-Frequency UVE-Selected Variables

Feng Gao, Yage Xing, Jialong Li, Lin Guo*, Yiye Sun, Wen Shi, Leiming Yuan*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

Abstract

Total soluble solids (TSSs) serve as a crucial maturity indicator and quality determinant in apricots, influencing harvest timing and postharvest management decisions. This study develops an advanced framework integrating adaptive boosting (Adaboost) ensemble learning with high-frequency spectral variables selected by uninformative variable elimination (UVE) for the rapid non-destructive detection of fruit quality. Near-infrared (NIR) spectra (1000~2500 nm) were acquired and then preprocessed through robust principal component analysis (ROBPCA) for outlier detection combined with z-score normalization for spectral pretreatment. Subsequent data processes included three steps: (1) 100 continuous runs of UVE identified characteristic wavelengths, which were classified into three levels—high-frequency (≥90 times), medium-frequency (30–90 times), and low-frequency (≤30 times) subsets; (2) the development of the base optimal partial least squares regression (PLSR) models for each wavelength subset; and (3) the execution of adaptive weight optimization through the Adaboost ensemble algorithm. The experimental findings revealed the following: (1) The model established based on high-frequency wavelengths outperformed both full-spectrum model and full-characteristic wavelength model. (2) The optimized UVE-PLS-Adaboost model achieved the peak performance (R = 0.889, RMSEP = 1.267, MAE = 0.994). This research shows that the UVE-Adaboost fusion method enhances model prediction accuracy and generalization ability through multi-dimensional feature optimization and model weight allocation. The proposed framework enables the rapid, non-destructive detection of apricot TSSs and provides a reference for the quality evaluation of other fruits in agricultural applications.

Original languageEnglish
Article number1543
Number of pages12
JournalMolecules
Volume30
Issue number7
Early online date30 Mar 2025
DOIs
Publication statusPublished - 1 Apr 2025

User-Defined Keywords

  • apricot
  • ensemble learning
  • feature selection optimization
  • spectral analysis
  • total soluble solids

Fingerprint

Dive into the research topics of 'Prediction of Total Soluble Solids in Apricot Using Adaptive Boosting Ensemble Model Combined with NIR and High-Frequency UVE-Selected Variables'. Together they form a unique fingerprint.

Cite this