TY - JOUR
T1 - Hyperspectral Image Classification Based on 3-D Octave Convolution with Spatial-Spectral Attention Network
AU - Tang, Xu
AU - Meng, Fanbo
AU - Zhang, Xiangrong
AU - Cheung, Yiu Ming
AU - Ma, Jingjing
AU - Liu, Fang
AU - Jiao, Licheng
N1 - Funding Information:
Manuscript received June 7, 2020; accepted June 22, 2020. Date of publication July 14, 2020; date of current version February 25, 2021. This work was supported in part by the National Natural Science Foundation of China under Grant 61801351, Grant 61802190, Grant 61772400, Grant 61672444, and Grant 61272366, in part by the Key Laboratory of National Defense Science and Technology Foundation Project under Grant 6142113180302, in part by the China Post-Doctoral Science Foundation Funded Project under Grant 2017M620441, in part by Xidian University New Teacher Innovation Fund Project under Grant XJS18032, in part by Hong Kong Baptist University (HKBU), Research Committee, Initiation Grant–Faculty Niche Research Areas (IG-FNRA) 2018/19 under Grant RC-FNRA-IG/18-19/SCI/03, in part by the Innovation and Technology Fund of Innovation and Technology Commission of the Government of the Hong Kong SAR under Grant ITS/339/18, in part by the Faculty Research Grant of HKBU under Project FRG2/17-18/082, and in part by the Shenzhen Science, Technology and Innovation Commission (SZSTI) under Grant JCYJ20160531194006833. (Corresponding authors: Xu Tang; Yiu-Ming Cheung.) Xu Tang is with the Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, School of Artificial Intelligence, Xidian University, Xi’an 710071, China, and also with the Department of Computer Science, Hong Kong Baptist University, Hong Kong (e-mail: [email protected]).
PY - 2021/3
Y1 - 2021/3
N2 - In recent years, with the development of deep learning (DL), the hyperspectral image (HSI) classification methods based on DL have shown superior performance. Although these DL-based methods have great successes, there is still room to improve their ability to explore spatial-spectral information. In this article, we propose a 3-D octave convolution with the spatial-spectral attention network (3DOC-SSAN) to capture discriminative spatial-spectral features for the classification of HSIs. Especially, we first extend the octave convolution model using 3-D convolution, namely, a 3-D octave convolution model (3D-OCM), in which four 3-D octave convolution blocks are combined to capture spatial-spectral features from HSIs. Not only the spatial information can be mined deeply from the high- and low-frequency aspects but also the spectral information can be taken into account by our 3D-OCM. Second, we introduce two attention models from spatial and spectral dimensions to highlight the important spatial areas and specific spectral bands that consist of significant information for the classification tasks. Finally, in order to integrate spatial and spectral information, we design an information complement model to transmit important information between spatial and spectral attention features. Through the information complement model, the beneficial parts of spatial and spectral attention features for the classification tasks can be fully utilized. Comparing with several existing popular classifiers, our proposed method can achieve competitive performance on four benchmark data sets.
AB - In recent years, with the development of deep learning (DL), the hyperspectral image (HSI) classification methods based on DL have shown superior performance. Although these DL-based methods have great successes, there is still room to improve their ability to explore spatial-spectral information. In this article, we propose a 3-D octave convolution with the spatial-spectral attention network (3DOC-SSAN) to capture discriminative spatial-spectral features for the classification of HSIs. Especially, we first extend the octave convolution model using 3-D convolution, namely, a 3-D octave convolution model (3D-OCM), in which four 3-D octave convolution blocks are combined to capture spatial-spectral features from HSIs. Not only the spatial information can be mined deeply from the high- and low-frequency aspects but also the spectral information can be taken into account by our 3D-OCM. Second, we introduce two attention models from spatial and spectral dimensions to highlight the important spatial areas and specific spectral bands that consist of significant information for the classification tasks. Finally, in order to integrate spatial and spectral information, we design an information complement model to transmit important information between spatial and spectral attention features. Through the information complement model, the beneficial parts of spatial and spectral attention features for the classification tasks can be fully utilized. Comparing with several existing popular classifiers, our proposed method can achieve competitive performance on four benchmark data sets.
KW - Attention mechanism
KW - deep learning (DL)
KW - hyperspectral image (HSI) classification
KW - information complement
KW - spatial-spectral features
UR - http://www.scopus.com/inward/record.url?scp=85101810836&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2020.3005431
DO - 10.1109/TGRS.2020.3005431
M3 - Journal article
AN - SCOPUS:85101810836
SN - 0196-2892
VL - 59
SP - 2430
EP - 2447
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
IS - 3
ER -