TY - GEN
T1 - Partially observable reinforcement learning for sustainable active surveillance
AU - Chen, Hechang
AU - Yang, Bo
AU - LIU, Jiming
N1 - Funding Information:
This work was funded by the National Natural Science Foundation of China under grants 61373053 and 61572226, and in part by the Jilin Province Key Scientific and Technological Research and Development Project under Grant 20180201067GX and Grant 20180201044GX.
PY - 2018
Y1 - 2018
N2 - Active surveillance is the most effective strategy in the applications of infectious disease prevention, road network optimization, crime reconnaissance, etc. However, the incomplete data collected from partially monitored regions by active surveillance disables existing models to maintain a sustainable performance in the future. To address this issue, this article presents a sustainable active surveillance framework (SAS), which consists of a predictor, a classifier, and a planner, by developing a novel partially observable reinforcement learning algorithm. The predictor estimates priorities of candidate regions for monitoring. The classifier assigns candidate regions with similar features into the same groups, so that the data collected from monitored regions can be shared with unmonitored regions within the group. The planner determines where and when to allocate limited resources, considering the outcomes of available resources and model sustainability. An empirical case study on infectious disease prevention showed that the proposed SAS method significantly outperforms the state-of-the-art methods.
AB - Active surveillance is the most effective strategy in the applications of infectious disease prevention, road network optimization, crime reconnaissance, etc. However, the incomplete data collected from partially monitored regions by active surveillance disables existing models to maintain a sustainable performance in the future. To address this issue, this article presents a sustainable active surveillance framework (SAS), which consists of a predictor, a classifier, and a planner, by developing a novel partially observable reinforcement learning algorithm. The predictor estimates priorities of candidate regions for monitoring. The classifier assigns candidate regions with similar features into the same groups, so that the data collected from monitored regions can be shared with unmonitored regions within the group. The planner determines where and when to allocate limited resources, considering the outcomes of available resources and model sustainability. An empirical case study on infectious disease prevention showed that the proposed SAS method significantly outperforms the state-of-the-art methods.
KW - Neural networks
KW - Reinforcement learning
KW - Resources allocation
KW - Sustainable active surveillance
UR - http://www.scopus.com/inward/record.url?scp=85052195988&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-99247-1_38
DO - 10.1007/978-3-319-99247-1_38
M3 - Conference proceeding
AN - SCOPUS:85052195988
SN - 9783319992464
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 425
EP - 437
BT - Knowledge Science, Engineering and Management - 11th International Conference, KSEM 2018, Proceedings
A2 - Liu, Weiru
A2 - Giunchiglia, Fausto
A2 - Yang, Bo
PB - Springer Verlag
T2 - 11th International Conference on Knowledge Science, Engineering and Management, KSEM 2018
Y2 - 17 August 2018 through 19 August 2018
ER -