TY - JOUR
T1 - Multi-Scale Cascaded with Cross-Attention Network-Based Deformation Vector Field Estimation for Motion-Compensated 4D-CBCT Reconstruction
AU - Yuan, Peng
AU - Lyu, Fei
AU - Gao, Zhiqiang
AU - Yang, Chunfeng
AU - Hu, Dianlin
AU - Zhu, Jian
AU - Wu, Zhan
AU - Lyu, Tianling
AU - Zhao, Wei
AU - Dong, Jianmin
AU - Chen, Yang
N1 - Funding Information:
This work was supported in part by the Tibet Autonomous Region Science and Technology Plan Project under Grant XZ202401JD0009, the State Key Project of Research and Development Plan under Grants 2022YFC2408500, the National Natural Science Foundation of China under Grant T2225025, and the Key Research and Development Programs in Jiangsu Province of China under Grant BE2021703 and BE2022768, and in part by the Natural Science Foundation of Zhejiang Province under Grant LZ23A050002 and National Natural Science Foundation of China under Grant 12175012.
Publisher Copyright:
© 2025 IEEE.
PY - 2025/4/16
Y1 - 2025/4/16
N2 - Four-Dimensional Cone Beam Computed Tomography (4D-CBCT) imaging technology offers enhanced image quality and spatial resolution for intraoperative guidance, facilitating real-time tracking of tumor position changes during radiotherapy. However, this is still a task of great challenges due to insufficient projections at each respiratory phase after phase-sorting, and the image phases reconstructed directly from phase-sorted data are discrete and discontinuous. To generate high-quality 4D-CBCT deformation vector fields (DVFs), this paper leverages the preoperative static prior image to guide intraoperative dynamic sparse-view reconstruction images for reducing anatomical structure differences, ultimately achieving continuous and dynamic 4D-CBCT imaging. In this paper, we propose a Multi-scale Cascaded residual deformable vector field estimation framework based on Cross-attention in Motioncompensated 4D-CBCT reconstruction (MCCM), which combines Multi-Scale Cascaded residual registration network (MSCNet), Cross-Attention Enhanced feature Fusion (CAEF) module and Structure-Enhanced Motion-Compensated (SEMC) module: 1) the MCCM employs a multi-scale cascaded residual network strategy, merging multi-receptive fields and multi-resolution feature maps for large-scale internal changes. 2) the CAEF is embedded into MSC-Net to facilitate effective communication and learning between features and promote the flow of information. 3) the SEMC is developed to reduce artifacts after intraoperative CBCT sparse-view reconstruction, restore global lung motion changes and local details, and enhance structural information through image fusion. The proposed method has been evaluated using simulated and clinical datasets and has shown promising results by comparative experiment. Our approach exhibits significant improvements across various evaluation metrics.
AB - Four-Dimensional Cone Beam Computed Tomography (4D-CBCT) imaging technology offers enhanced image quality and spatial resolution for intraoperative guidance, facilitating real-time tracking of tumor position changes during radiotherapy. However, this is still a task of great challenges due to insufficient projections at each respiratory phase after phase-sorting, and the image phases reconstructed directly from phase-sorted data are discrete and discontinuous. To generate high-quality 4D-CBCT deformation vector fields (DVFs), this paper leverages the preoperative static prior image to guide intraoperative dynamic sparse-view reconstruction images for reducing anatomical structure differences, ultimately achieving continuous and dynamic 4D-CBCT imaging. In this paper, we propose a Multi-scale Cascaded residual deformable vector field estimation framework based on Cross-attention in Motioncompensated 4D-CBCT reconstruction (MCCM), which combines Multi-Scale Cascaded residual registration network (MSCNet), Cross-Attention Enhanced feature Fusion (CAEF) module and Structure-Enhanced Motion-Compensated (SEMC) module: 1) the MCCM employs a multi-scale cascaded residual network strategy, merging multi-receptive fields and multi-resolution feature maps for large-scale internal changes. 2) the CAEF is embedded into MSC-Net to facilitate effective communication and learning between features and promote the flow of information. 3) the SEMC is developed to reduce artifacts after intraoperative CBCT sparse-view reconstruction, restore global lung motion changes and local details, and enhance structural information through image fusion. The proposed method has been evaluated using simulated and clinical datasets and has shown promising results by comparative experiment. Our approach exhibits significant improvements across various evaluation metrics.
KW - motion compensation
KW - fine 4D-CBCT deformation field
KW - multi-scale cascading residual network
KW - cross-attention
KW - prior-based sparse-view reconstruction
KW - tumor motion modeling
UR - http://www.scopus.com/inward/record.url?scp=105002846483&partnerID=8YFLogxK
U2 - 10.1109/TCI.2025.3561660
DO - 10.1109/TCI.2025.3561660
M3 - Journal article
AN - SCOPUS:105002846483
SN - 2573-0436
SP - 1
EP - 14
JO - IEEE Transactions on Computational Imaging
JF - IEEE Transactions on Computational Imaging
ER -