TY - JOUR
T1 - IterMask3D: Unsupervised anomaly detection and segmentation with test-time iterative mask refinement in 3D brain MRI
AU - Liang, Ziyun
AU - Guo, Xiaoqing
AU - Xu, Wentian
AU - Ibrahim, Yasin
AU - Voets, Natalie
AU - Pretorius, Pieter M.
AU - Alzheimer's Disease Neuroimaging Initiative
AU - Noble, J. Alison
AU - Kamnitsas, Konstantinos
N1 - For this work, ZL is supported by a scholarship provided by the EPSRC Doctoral Training Partnerships programme [EP/W524311/1]. YI is supported by the EPSRC Centre for Doctoral Training in Health Data Science (EP/S02428X/1). The authors also acknowledge UKRI grant reference3 [EP/X040186/1] and EPSRC grant [EP/T028572/1]. NLV gratefully acknowledges support from the NIHR Oxford Health Biomedical Research Centre [NIHR203316]. The views expressed are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care. The Wellcome Centre for Integrative Neuroimaging is supported by core funding from the Wellcome Trust (203139/Z/16/Z and 203139/A/16/Z). The authors also acknowledge the use of the University of Oxford Advanced Research Computing (ARC) facility in carrying out this work ( http://dx.doi.org/10.5281/zenodo.22558).
Publisher Copyright:
© 2025 The Authors. Published by Elsevier B.V.
PY - 2025/8/28
Y1 - 2025/8/28
N2 - Unsupervised anomaly detection and segmentation methods train a model to learn the training distribution as ‘normal’. In the testing phase, they identify patterns that deviate from this normal distribution as ‘anomalies’. To learn the ‘normal’ distribution, prevailing methods corrupt the images and train a model to reconstruct them. During testing, the model attempts to reconstruct corrupted inputs based on the learned ‘normal’ distribution. Deviations from this distribution lead to high reconstruction errors, which indicate potential anomalies. However, corrupting an input image inevitably causes information loss even in normal regions, leading to suboptimal reconstruction and an increased risk of false positives. To alleviate this, we propose IterMask3D, an iterative spatial mask-refining strategy designed for 3D brain MRI. We iteratively spatially mask areas of the image as corruption and reconstruct them, then shrink the mask based on reconstruction error. This process iteratively unmasks ‘normal’ areas to the model, whose information further guides reconstruction of ‘normal’ patterns under the mask to be reconstructed accurately, reducing false positives. In addition, to achieve better reconstruction performance, we also propose using high-frequency image content as additional structural information to guide the reconstruction of the masked area. Extensive experiments on the detection of both synthetic and real-world imaging artifacts, as well as segmentation of various pathological lesions across multiple MRI sequences, consistently demonstrate the effectiveness of our proposed method. Code is available at https://github.com/ZiyunLiang/IterMask3D.
AB - Unsupervised anomaly detection and segmentation methods train a model to learn the training distribution as ‘normal’. In the testing phase, they identify patterns that deviate from this normal distribution as ‘anomalies’. To learn the ‘normal’ distribution, prevailing methods corrupt the images and train a model to reconstruct them. During testing, the model attempts to reconstruct corrupted inputs based on the learned ‘normal’ distribution. Deviations from this distribution lead to high reconstruction errors, which indicate potential anomalies. However, corrupting an input image inevitably causes information loss even in normal regions, leading to suboptimal reconstruction and an increased risk of false positives. To alleviate this, we propose IterMask3D, an iterative spatial mask-refining strategy designed for 3D brain MRI. We iteratively spatially mask areas of the image as corruption and reconstruct them, then shrink the mask based on reconstruction error. This process iteratively unmasks ‘normal’ areas to the model, whose information further guides reconstruction of ‘normal’ patterns under the mask to be reconstructed accurately, reducing false positives. In addition, to achieve better reconstruction performance, we also propose using high-frequency image content as additional structural information to guide the reconstruction of the masked area. Extensive experiments on the detection of both synthetic and real-world imaging artifacts, as well as segmentation of various pathological lesions across multiple MRI sequences, consistently demonstrate the effectiveness of our proposed method. Code is available at https://github.com/ZiyunLiang/IterMask3D.
KW - 3D brain MRI
KW - Anomaly detection
KW - Unsupervised anomaly segmentation
UR - http://www.scopus.com/inward/record.url?scp=105015551479&partnerID=8YFLogxK
U2 - 10.1016/j.media.2025.103763
DO - 10.1016/j.media.2025.103763
M3 - Journal article
C2 - 40945172
AN - SCOPUS:105015551479
SN - 1361-8415
VL - 107, Part A
JO - Medical Image Analysis
JF - Medical Image Analysis
M1 - 103763
ER -