Abstract
In this paper, we investigate the early stopping strategy for the iterative regularization technique, which is based on gradient descent of convex loss functions in reproducing kernel Hilbert spaces without an explicit regularization term. This work shows that projecting the last iterate of the stopping time produces an estimator that can improve the generalization ability. Using the upper bound of the generalization errors, we establish a close link between the iterative regularization and Tikhonov regularization scheme and explain theoretically why the two schemes have similar regularization paths in the existing numerical simulations. We introduce a data-dependent way based on cross-validation to select the stopping time. We prove that the a-posteriori selection way can retain the comparable generalization errors to those obtained by our stopping rules with a-prior parameters.
Original language | English |
---|---|
Number of pages | 36 |
Journal | Journal of Machine Learning Research |
Volume | 23 |
Publication status | Published - Aug 2022 |
User-Defined Keywords
- iterative regularization
- early stopping
- reproducing kernel Hilbert spaces
- stopping rule
- cross-validation