TY - JOUR
T1 - Online outcome weighted learning with general loss functions
AU - Yang, Aoli
AU - Fan, Jun
AU - Xiang, Dao-Hong
N1 - The work by J. Fan is partially supported by the Research Grants Council of Hong Kong [Project No. HKBU 12302923 and 12303220], Guangdong Basic and Applied Basic Research Fund [Project No. 2024A1515011878], and Hong Kong Baptist University [Project No. RC-FNRAIG/22-23/SCI/02]. The work by D. H. Xiang is supported by the National Natural Science Foundation of China (Grant No. 12471487) and Zhejiang Provincial Natural Science Foundation of China (Grant No.LY23A010009).
Publisher Copyright:
© 2025 Elsevier Inc.
PY - 2025/6
Y1 - 2025/6
N2 - The pursuit of individualized treatment rules in precision medicine has generated significant interest due to its potential to optimize clinical outcomes for patients with diverse treatment responses. One approach that has gained attention is outcome weighted learning, which is tailored to estimate optimal individualized treatment rules by leveraging each patient's unique characteristics under a weighted classification framework. However, traditional offline learning algorithms, which process all available data at once, face limitations when applied to high-dimensional electronic health records data due to its sheer volume. Additionally, the dynamic nature of precision medicine requires that learning algorithms can effectively handle streaming data that arrives in a sequential manner. To overcome these challenges, we present a novel framework that combines outcome weighted learning with online gradient descent algorithms, aiming to enhance precision medicine practices. Our framework provides a comprehensive analysis of the learning theory associated with online outcome weighted learning algorithms, taking into account general classification loss functions. We establish the convergence of these algorithms for the first time, providing explicit convergence rates while assuming polynomially decaying step sizes, with (or without) a regularization term. Our findings present a non-trivial extension of online classification to online outcome weighted learning, contributing to the theoretical foundations of learning algorithms tailored for processing streaming input-output-reward type data.
AB - The pursuit of individualized treatment rules in precision medicine has generated significant interest due to its potential to optimize clinical outcomes for patients with diverse treatment responses. One approach that has gained attention is outcome weighted learning, which is tailored to estimate optimal individualized treatment rules by leveraging each patient's unique characteristics under a weighted classification framework. However, traditional offline learning algorithms, which process all available data at once, face limitations when applied to high-dimensional electronic health records data due to its sheer volume. Additionally, the dynamic nature of precision medicine requires that learning algorithms can effectively handle streaming data that arrives in a sequential manner. To overcome these challenges, we present a novel framework that combines outcome weighted learning with online gradient descent algorithms, aiming to enhance precision medicine practices. Our framework provides a comprehensive analysis of the learning theory associated with online outcome weighted learning algorithms, taking into account general classification loss functions. We establish the convergence of these algorithms for the first time, providing explicit convergence rates while assuming polynomially decaying step sizes, with (or without) a regularization term. Our findings present a non-trivial extension of online classification to online outcome weighted learning, contributing to the theoretical foundations of learning algorithms tailored for processing streaming input-output-reward type data.
KW - Learning theory
KW - comparison theorems
KW - convergence rates
KW - online learning
KW - outcome weighted learning
KW - streaming data
UR - http://www.scopus.com/inward/record.url?scp=85218341675&partnerID=8YFLogxK
U2 - 10.1016/j.jco.2025.101931
DO - 10.1016/j.jco.2025.101931
M3 - Journal article
SN - 0885-064X
VL - 88
JO - Journal of Complexity
JF - Journal of Complexity
M1 - 101931
ER -