TY - JOUR
T1 - Vectorized building extraction from high-resolution remote sensing images using spatial cognitive graph convolution model
AU - Du, Zhuotong
AU - Sui, Haigang
AU - Zhou, Qiming
AU - Zhou, Mingting
AU - Shi, Weiyue
AU - Wang, Jianxun
AU - Liu, Junyi
N1 - This work was supported by the National Natural Science Foundation of China [grant No. 42271416], the National Natural Science Foundation of China Major Program [grant Nos. 42192580 and 42192583], the Guangxi Science and Technology Major Project [grant No. AA22068072], and the Hong Kong Research Grant Council General Research Fund [HKBU 12301820].
Publisher Copyright:
© 2024 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS)
PY - 2024/7
Y1 - 2024/7
N2 - Traditional approach from source image to application vectors in building extraction needs additional complex regularization of converted intermediate raster results. While in conversion, the lost detailed artifacts, unnecessary nodes, and messy paths would be labor-intensive to repair errors and topological issues, even aside the inherent problems of blob-like objects and blurry, jagged edges in first-stage extraction. This research explores new graph convolution-driven solution, the spatial-cognitive shaping model (SCShaping), to directly access vectorization form of individual buildings through spatial cognitive approximation to coordinates that form building boundaries. To strengthen graph nodes expressivity, this method enriches topological feature embedding travelling along the model architecture along with features contributed from convolutional neural network (CNN) extractor. To stimulate the neighboring aggregation in graphs, Graph-Encoder-Decoder mechanism is introduced to augment feature reuse integrating complementary graph convolution layers. The strong embedding guarantees effective feature tapping and the robust structure guarantees the feature mining. Comparative studies have been conducted between the proposed approach with five other methods on three challenging datasets. The results demonstrate the proposed approach yields unanimous and significant improvements in mask-wise metrics, which evaluate object integrity and accuracy, as well as edge-wise metrics, which assess contour regularity and precision. The outperformance also indicates better multi-scale object adaptability of SCShaping. The obtain-and-play SCShaping commands a pleasurable implementation way to advance ideal man–machine collaboration.
AB - Traditional approach from source image to application vectors in building extraction needs additional complex regularization of converted intermediate raster results. While in conversion, the lost detailed artifacts, unnecessary nodes, and messy paths would be labor-intensive to repair errors and topological issues, even aside the inherent problems of blob-like objects and blurry, jagged edges in first-stage extraction. This research explores new graph convolution-driven solution, the spatial-cognitive shaping model (SCShaping), to directly access vectorization form of individual buildings through spatial cognitive approximation to coordinates that form building boundaries. To strengthen graph nodes expressivity, this method enriches topological feature embedding travelling along the model architecture along with features contributed from convolutional neural network (CNN) extractor. To stimulate the neighboring aggregation in graphs, Graph-Encoder-Decoder mechanism is introduced to augment feature reuse integrating complementary graph convolution layers. The strong embedding guarantees effective feature tapping and the robust structure guarantees the feature mining. Comparative studies have been conducted between the proposed approach with five other methods on three challenging datasets. The results demonstrate the proposed approach yields unanimous and significant improvements in mask-wise metrics, which evaluate object integrity and accuracy, as well as edge-wise metrics, which assess contour regularity and precision. The outperformance also indicates better multi-scale object adaptability of SCShaping. The obtain-and-play SCShaping commands a pleasurable implementation way to advance ideal man–machine collaboration.
KW - Building extraction
KW - Graph convolutional networks
KW - Graph pooling
KW - High-resolution remote sensing images
KW - Shape encoding
UR - http://www.scopus.com/inward/record.url?scp=85194769203&partnerID=8YFLogxK
U2 - 10.1016/j.isprsjprs.2024.05.015
DO - 10.1016/j.isprsjprs.2024.05.015
M3 - Journal article
AN - SCOPUS:85194769203
SN - 0924-2716
VL - 213
SP - 53
EP - 71
JO - ISPRS Journal of Photogrammetry and Remote Sensing
JF - ISPRS Journal of Photogrammetry and Remote Sensing
ER -