结合倒置特征金字塔和U-Net的高光谱图像分类
Hyperspectral image classification using an inverted feature pyramid network with U-Net
- 2021年26卷第8期 页码:1994-2008
收稿:2021-03-23,
修回:2021-5-11,
录用:2021-5-18,
纸质出版:2021-08-16
DOI: 10.11834/jig.210194
移动端阅览

浏览全部资源
扫码关注微信
收稿:2021-03-23,
修回:2021-5-11,
录用:2021-5-18,
纸质出版:2021-08-16
移动端阅览
目的
2
地物分类是对地观测研究领域的重要任务。高光谱图像具有丰富的地物光谱信息,可用于提升遥感图像地物分类的准确度。如何对高光谱图像进行有效的特征提取与表示是高光谱图像分类应用的关键问题。为此,本文提出了一种结合倒置特征金字塔和U-Net的高光谱图像分类方法。
方法
2
对高光谱数据进行主成分分析(principal component analysis,PCA)降维,获取作为网络输入的重构图像数据,然后使用U-Net逐层提取高光谱重构图像的空间特征。与此同时,利用倒置的特征金字塔网络抽取相应层级的语义特征;通过特征融合,得到既有丰富的空间信息又有较强烈的语义响应的特征表示。提出的网络利用注意力机制在跳跃连接过程中实现对背景区域的特征响应抑制,最终实现了较高的地物分类精度。
结果
2
分析了PCA降维方法和输入数据尺寸对分类性能的影响,并在Indian Pines、Pavia University、Salinas和Urban数据集上进行了对比实验。本文方法在4个数据集上分别取得了98.91%、99.85%、99.99%和87.43%的总体分类精度,与支持向量机(support vector machine,SVM)等相关算法相比,分类精度高出1%~15%。
结论
2
本文提出一种结合倒置特征金字塔和U-Net的高光谱图像分类方法,可以应用于有限训练样本下的高光谱图像分类任务,并在多个数据集上取得了较高的分类精度。实验结果表明倒置特征金字塔结构与U-Net结合的算法能够高效地实现高光谱图像的特征提取与表示,从而获得更精细的分类结果。
Objective
2
Terrain classification is an important research task in the field of earth observation using remote sensing technology. The hyperspectral image has rich spectral information; thus
it can be applied to the classification of remote sensing image. With the rapid development of the hyperspectral technology
the hyperspectral remote sensing image processing and analyzing technology has attracted wide attention of academia. The hyperspectral images have dozens or even hundreds of continuous narrow spectral bands compared with the traditional panchromatic band and multi-spectral remote sensing image
which provides detailed spectral and spatial feature information. Accordingly
these images have been widely used in various aspects
such as precision agriculture
city planning
and military defense. Hyperspectral images have high dimensional data
and redundancy and noise exist; thus
transformed data must be utilized for image processing. In the application of hyperspectral image classification
the manner by which to effectively represent the features of hyperspectral image is the most critical step in current studies. In this work
we propose an approach for hyperspectral image classification by using an inverted feature pyramid network and U-Net.
Method
2
The dimension of the hyperspectral remote sensing image data is high. Principal component analysis (PCA) method plays a significant role in transforming useful information in the images to the most important
k
characteristic
thus reducing the amount of data and enhancing the data features. After PCA
the data are segmented and collected by means of sliding window. The surrounding area of each pixel is defined as a patch
which is regarded as the input of the proposed network. The category of the pixel is the ground truth label. In the first stage
U-Net is used to extract spatial features of hyperspectral image at the pixel level. The left side of the network is the contraction path
which corresponds to the encoder part of the classic encoder-decoder. The right side of the network is the extension path
which can be regarded as a decoder. The feature maps in the extension path are the result of combining two parts of a feature map along two dimensions
making the acquired features more visible. In the first part
the feature maps from the same layer of contraction path and the feature maps from the upper layer of extension path are simultaneously fed to the attention mechanism. The feature region of this part has a higher weight value. The second part is obtained by deconvolution of the feature graph from the upper layer of the extension path. In a layered way
these feature maps with rich spatial information are fused with feature maps containing rich semantic information obtained by inverted feature pyramid network layers. Therefore
the obtained feature maps have reliable spatial and strong semantic information. Finally
the weight value of the effective features in the image is increased
and the region of irrelevant background is suppressed owing to the attention mechanism. Thus
the classification result of hyperspectral image is acquired.
Result
2
We conduct experiments to evaluate the effectiveness of the proposed method and attempt to investigate the influence of PCA retained principal component number and the size of input data for the performance of classification. We conduct contrast experiments on four publicly available hyperspectral image datasets to demonstrate the performance of the proposed method: Indian Pines
Pavia University
Salinas
and Urban. Experimental results show that the proposed method for hyperspectral image classification is effective
and the best PCA retained principal component numbers are 3
20
10
and 3. Meanwhile
the best input sizes of the proposed model are 64
32
32
and 64. We obtain 98.91%
99.85%
99.99%
and 87.43% overall classification accuracy rates
98.07%
99.39%
99.09%
and 78.30% average classification accuracy rates
and 0.987
0.998
0.999
and 0.831 Kappa values for the four hyperspectral image datasets
respectively
which are higher than those of the other classification algorithms.
Conclusion
2
Hyperspectral images are capable of accurately presenting the rich terrain information contained in the specific region with the help of hundreds of continuous and subdivided spectral bands; however
useless information exists in each spectral band. The mechanism by which to effectively extract the key terrain information from the data of hyperspectral images and utilize them for classification is the most important and difficult problem. We propose to combine U-Net and the inverted pyramid network for hyperspectral image classification. First
we reduce the dimension of hyperspectral image data with the help of PCA method. We adopt the method of sliding window to build patches after the data dimension is reduced. These patches are fed into the model. U-Net is regarded as the backbone of the proposed network
and it aims to extract the characteristics of a hyperspectral image. Then
the rich characteristics of the spatial information are fused with the features from the inverted pyramid network. Subsequently
the abundant spectral and spatial information is obtained. The utilization of attention mechanism allows the model to effectively focus on spectral and spatial information and reduce the influence of signal-to-noise to classification performance. Experimental results show that the proposed method can be applied to hyperspectral image classification tasks with limited training samples and achieve good classification results. The classification accuracy of a hyperspectral image can also be improved by properly handling the input data. In our future work
we will attempt to investigate the manner by which to make the model's structure less complex while maintaining high hyperspectral image classification performance with less training data samples.
Haut J M, Paoletti M E, Plaza J, Plaza A and Li J. 2019. Visual attention-driven hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing, 57(10): 8065-8080[DOI:10.1109/TGRS.2019.2918080]
He N J, Paoletti M E, Haut J M, Fang L Y, Li S T, Plaza A and Plaza J. 2019. Feature extraction with multiscale covariance maps for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing, 57(2): 755-769[DOI:10.1109/TGRS.2018.2860464]
Jia S, Lin Z J, Deng B, Zhu J S and Li Q Q. 2020. Cascade superpixel regularized gabor feature fusion for hyperspectral image classification. IEEE Transactions on Neural Networks and Learning Systems, 31(5): 1638-1652[DOI:10.1109/TNNLS.2019.2921564]
Jia S, Shen L L, Zhu J S and Li Q Q. 2018. A 3-D gabor phase-based coding and matching framework for hyperspectral imagery classification.IEEE Transactions on Cybernetics, 48(4): 1176-1188[DOI:10.1109/TCYB.2017.2682846]
Li Y, Zhen C, Shi X and Zhao Q H. 2019. Hyperspectral image classification algorithm based on entropy weighted K-means with global information. Journal of Image and Graphics, 24(4): 630-638
李玉, 甄畅, 石雪, 赵泉华. 2019. 基于熵加权K-means全局信息聚类的高光谱图像分类. 中国图象图形学报, 24(4): 630-638[DOI:10.11834/jig.180502]
Liu J J, Wu Z B, Xiao L, Sun J and Yan H. 2020. Generalized tensor regression for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing, 58(2): 1244-1258[DOI:10.1109/TGRS.2019.2944989]
Liu W J, Yin X, Qu H C and Liu L M. 2019. Dimensionality-varied convolutional neural network for improving the classification performance of hyperspectral images with small-sized labeled samples. Journal of Image and Graphics, 24(9): 1604-1618
刘万军, 尹岫, 曲海成, 刘腊梅. 2019. 提高小样本高光谱图像分类性能的变维卷积神经网络. 中国图象图形学报, 24(9): 1604-1618[DOI:10.11834/jig.180693]
Liu Y, Ji X F and Wang Y Y. 2016. Classification of hyperspectral image based on double L2 sparse coding. Journal of Image and Graphics, 21(12): 1707-1715
刘洋, 姬晓飞, 王杨扬. 2016. 基于双重L2稀疏编码的高光谱图像分类. 中国图象图形学报, 21(12): 1707-1715][DOI:10.11834/jig.20161215]
Luo F L, Du B, Zhang L P and Tao D C. 2019. Feature learning using spatial-spectral hypergraph discriminant analysis for hyperspectral image. IEEE Transactions on Cybernetics, 49(7): 2406-2419[DOI:10.1109/TCYB.2018.2810806]
Paoletti M E, Haut J M, Fernandez-Beltran R, Plaza J, Plaza A, Li J and Pla F. 2019a. Capsule networks for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing, 57(4): 2145-2160[DOI:10.1109/TGRS.2018.2871782]
Paoletti M E, Haut J M, Fernandez-Beltran R, Plaza J, Plaza A J and Pla F. 2019b. Deep pyramidal residual networks for spectral-spatial hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing, 57(2): 740-754[DOI:10.1109/TGRS.2018.2860125]
Peng J T, Li L Q and Tang Y Y. 2019. Maximum likelihood estimation-based joint sparse representation for the classification of hyperspectral remote sensing images. IEEE Transactions on Neural Networks and Learning Systems, 30(6): 1790-1802[DOI:10.1109/TNNLS.2018.2874432]
Ran Q, Yu H Y, Gao L R, Li W and Zhang B. 2018. Superpixel and subspace projection-based support vector machines for hyperspectral image classification. Journal of Image and Graphics, 23(1): 95-105
冉琼, 于浩洋, 高连如, 李伟, 张兵. 2018. 结合超像元和子空间投影支持向量机的高光谱图像分类. 中国图象图形学报, 23(1): 95-105[DOI:10.11834/jig.170201]
Wu H and Prasad S. 2018. Semi-supervised deep learning using pseudo labels for hyperspectral image classification. IEEE Transactions on Image Processing, 27(3): 1259-1270[DOI:10.1109/TIP.2017.2772836]
Xiao L, Liu P F and Li H. 2020. Progress and challenges in the fusion of multisource spatial-spectral remote sensing images. Journal of Image and Graphics, 25(5): 851-863
肖亮, 刘鹏飞, 李恒. 2020. 多源空-谱遥感图像融合方法进展与挑战. 中国图象图形学报, 25(5): 851-863[DOI:10.11834/jig.190620]
Yu C Y, Han R, Song M P, Liu C Y and Chang C I. 2021. Feedback attention-based dense CNN for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing: #3058549[DOI:10.1109/TGRS.2021.3058549]
Zhang M M, Li W and Du Q. 2018. Diverse region-based CNN for hyperspectral image classification. IEEE Transactions on Image Processing, 27(6): 2623-2634[DOI:10.1109/TIP.2018.2809606]
Zhang X R, Liang Y L, Zheng Y G, An J L and Jiao L C. 2016. Hierarchical discriminative feature learning for hyperspectral image classification. IEEE Geoscience and Remote Sensing Letters, 13(4): 594-598[DOI:10.1109/LGRS.2016.2528883]
Zhu M H, Jiao L C, Liu F and Yang S Y and Wang J N. 2021. Residual spectral-spatial attention network for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing, 59(1): 449-462[DOI:10.1109/TGRS.2020.2994057]
相关作者
相关机构
京公网安备11010802024621