高光谱图像小样本分类的卷积神经网络方法
Convolution neural network method for small-sample classification of hyperspectral images
- 2021年26卷第8期 页码:2009-2020
纸质出版日期: 2021-08-16 ,
录用日期: 2021-04-30
DOI: 10.11834/jig.210103
移动端阅览
浏览全部资源
扫码关注微信
纸质出版日期: 2021-08-16 ,
录用日期: 2021-04-30
移动端阅览
吴鸿昊, 王立国, 石瑶. 高光谱图像小样本分类的卷积神经网络方法[J]. 中国图象图形学报, 2021,26(8):2009-2020.
Honghao Wu, Liguo Wang, Yao Shi. Convolution neural network method for small-sample classification of hyperspectral images[J]. Journal of Image and Graphics, 2021,26(8):2009-2020.
目的
2
与传统分类方法相比,基于深度学习的高光谱图像分类方法能够提取出高光谱图像更深层次的特征。针对现有深度学习的分类方法网络结构简单、特征提取不够充分的问题,提出一种堆叠像元空间变换信息的数据扩充方法,用于解决训练样本不足的问题,并提出一种基于不同尺度的双通道3维卷积神经网络的高光谱图像分类模型,来提取高光谱图像的本质空谱特征。
方法
2
通过对高光谱图像的每一像元及其邻域像元进行旋转、行列变换等操作,丰富中心像元的潜在空间信息,达到数据集扩充的作用。将扩充之后的像素块输入到不同尺度的双通道3维卷积神经网络学习训练集的深层特征,实现更高精度的分类。
结果
2
5次重复实验后取平均的结果表明,在随机选取了10%训练样本并通过8倍数据扩充的情况下,Indian Pines数据集实现了98.34%的总体分类精度,Pavia University数据集总体分类精度达到99.63%,同时对比了不同算法的运行时间,在保证分类精度的前提下,本文算法的运行时间短于对比算法,保证了分类模型的稳定性、高效性。
结论
2
本文提出的基于双通道卷积神经网络的高光谱图像分类模型,既解决了训练样本不足的问题,又综合了高光谱图像的光谱特征和空间特征,提高了高光谱图像的分类精度。
Objective
2
Classification is a major research task in hyperspectral remote sensing image processing. Extracting appropriate features from hyperspectral remote sensing images is a prerequisite to ensure the accuracy and effectiveness of classification. Traditional feature extraction methods can only extract single
shallow spectral or spatial features and have difficulty extracting deep spatial-spectral features. This limitation results in poor classification accuracy. Compared with traditional classification methods
hyperspectral image classification method based on deep learning model can extract deeper features of hyperspectral image. However
the existing deep learning methods can classify hyperspectral images but have low classification accuracy because of their simple network structure and insufficient feature extraction. Moreover
the parameters of deep network are hundreds of thousands of millions
which need a large number of labeled training samples and longer time to complete the model training. However
manual annotation of hyperspectral images is a time-consuming and laborious project
and only a few labeled samples can usually be used. Therefore
constructing a depth network model for hyperspectral image classification with a small number of training samples is a key research problem. A data expansion method of stacking pixel spatial transformation information is proposed to address the problems of simple network structure and insufficient feature extraction of existing classification methods based on deep learning. A hyperspectral image classification model based on different scales of dual-channel 3D convolutional neural network is also proposed to extract the essential spatial features of hyperspectral image.
Method
2
On the one hand
the stacking space transform information method is used to expand the training samples for solving the problem of insufficient training samples in the process of hyperspectral image classification. On the other hand
the dual-channel convolutional neural network is used
and its overall structure is composed of preprocessing
data expansion
and dual-channel convolutional neural network (DC-CNN). First
the hyperspectral image is preprocessed to transform the pixels into 3×3 and 5×5 pixel blocks containing certain spatial information as the unit to be processed. Then
the unit to be processed uses the data expansion method of stacking spatial transformation information to expand the data. The expanded dataset is transformed into shape to meet the input dimension of dual-channel convolutional neural network. Then
the pixel blocks of different sizes are input into the branch convolution of different scales of DC-CNN
and they are combined with the spatial spectrum features extracted from the two channels for training to improve the generalization ability of the network. The spatial-spectral features of hyperspectral images can be more comprehensively and fully extracted using different convolution branches to process different sizes of pixel blocks. The information loss caused by dimension reduction can be reduced
and the classification accuracy can be improved. The method of stack space transform information enriches the potential space information of the center pixel and expands the dataset by rotating each pixel and its adjacent pixels
row column transform
and other operations. The expanded pixel blocks are input into dual-channel 3D convolutional neural networks with different scales to learn the deep features of the training set and achieve higher accuracy classification. Three popular hyperspectral image classification methods are selected and compared with this method to verify its effectiveness. The three methods are 2D convolutional neural network (2D-CNN)
3D convolutional neural network (3D-CNN)
and siamese neural network (SNN). The network structure of 3D-CNN is the same as that of 3×3 branches of DC-CNN
which can be considered a single-channel network model of DC-CNN.
Result
2
Experimental results show that the Indian Pines and Pavia University datasets achieve 98.34% and 99.63% of the overall classification accuracy
respectively. The running time of different algorithms is also compared. The running time of the proposed algorithm is relatively shorter than that of the comparison algorithm
and this feature ensures the efficiency of the classification model. In addition
this study analyzes the influence of different convolution layers on the classification performance. It verifies that the convolution neural network composed of five convolution layers has the strongest feature extraction ability for hyperspectral images. The proposed method also has good classification performance on two groups of real hyperspectral image datasets. Specifically
the proposed method has certain generalization ability and advantages in running time. In this study
dual-channel convolution neural network is used to complete the deep spatial-spectral feature fusion of hyperspectral image
which fully extracts the hidden information of hyperspectral image and is conducive to the application of deep learning model in hyperspectral image classification. In the follow-up research
we can further explore the effect of multichannel convolution neural network
such as three- and four-channel convolution neural networks in hyperspectral image classification. The application of deep learning method in hyperspectral image can also be promoted.
Conclusion
2
The performance of hyperspectral image classification model with limited training samples is not ideal. Thus
this study proposes a data expansion method of stacking pixel neighborhood spatial information and designs a hyperspectral image classification method based on dual-channel convolutional neural network
which is called DC-CNN. The data expansion method of stacking pixel neighborhood spatial information conducts spatial rotation and transformation on different scale pixel blocks to expand the number of training samples and obtain the hidden spatial information of the center pixel. This method solves the problem of insufficient training samples of depth model and insufficient spatial information extraction of hyperspectral image in feature extraction process. It inputs pixel blocks of different scales through DC-CNN network framework to comprehensively extract more abundant spatial information
completes the deep spatial-spectral information extraction
and inputs the fused spatial spectral features into softmax classifier to achieve more accurate classification.
高光谱图像有监督分类空谱结合卷积神经网络(CNN)深度学习数据扩充
hyperspectral imagesupervised classificationspatial spectrum combinationconvolutional neural network(CNN)deep learningdata expansion
Bandos T V, Bruzzone L and Camps-Valls G. 2009. Classification of hyperspectral images with regularized linear discriminant analysis. IEEE Transactions on Geoscience and Remote Sensing, 47(3): 862-873[DOI:10.1109/TGRS.2008.2005729]
Chen Y S, Jiang H L, Li C Y, Jia X P and Ghamisi P. 2016. Deep feature extraction and classification of hyperspectral images based on convolutional neural networks. IEEE Transactions on Geoscience and Remote Sensing, 54(10): 6232-6251[DOI:10.1109/TGRS.2016.2584107]
Chen Y S, Lin Z H, Zhao X, Wang G and Gu Y F. 2014. Deep learning-based classification of hyperspectral data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 7(6): 2094-2107[DOI:10.1109/JSTARS.2014.2329330]
Chen Y S, Zhao X and Jia X P. 2015. Spectral-spatial classification of hyperspectral data based on deep belief network. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 8(6): 2381-2392[DOI:10.1109/JSTARS.2015.2388577]
Du P J, Xia J S, Xue Z H, Tan K, Su H J and Bao R. 2016. Review of hyperspectral remote sensing image classification. Journal of Remote Sensing, 20(2): 236-256
杜培军, 夏俊士, 薛朝辉, 谭琨, 苏红军, 鲍蕊. 2016. 高光谱遥感影像分类研究进展. 遥感学报, 20(2): 236-256[DOI:10.11834/jrs.20165022]
He M Y, Li B and Chen H H. 2017. Multi-scale 3D deep convolutional neural network for hyperspectral image classification//Proceedings of 2017 IEEE International Conference on Image Processing (ICIP). Beijing, China: IEEE: 57-61[DOI: 10.1109/ICIP.2017.8297014http://dx.doi.org/10.1109/ICIP.2017.8297014]
Hu W, Huang Y Y, Wei L, Zhang F and Li H C. 2015. Deep convolutional neural networks for hyperspectral image classification. Journal of Sensors, 2015: #258619[DOI:10.1155/2015/258619]
Kingma D P and Ba J. 2015. Adam: a method for stochastic optimization//Proceedings of the 3rd International Conference on Learning Representations. San Diego, USA: [s. n.]: 58-73
Li G D, Zhang C J, Gao F and Zhang X Y. 2019. Doubleconvpool-structured 3D-CNN for hyperspectral remote sensing image classification. Journal of Image and Graphics, 24(4): 639-654
李冠东, 张春菊, 高飞, 张雪英. 2019. 双卷积池化结构的3D-CNN高光谱遥感影像分类方法. 中国图象图形学报, 24(4): 639-654[DOI:10.11834/jig.180422]
Li W, Wu G D, Zhang F and Du Q. 2017b. Hyperspectral image classification using deep pixel-pair features. IEEE Transactions on Geoscience and Remote Sensing, 55(2): 844-853[DOI:10.1109/TGRS.2016.2616355]
Li Y, Zhang H K and Shen Q. 2017a. Spectral-spatial classification of hyperspectral imagery with 3D convolutional neural network. Remote Sensing, 9(1): #67[DOI:10.1080/2150704X.2017.1280200]
Licciardi G, Marpu P R, Chanussot J and Benediktsson J A. 2012. Linear versus nonlinear PCA for the classification of hyperspectral data based on the extended morphological profiles. IEEE Geoscience and Remote Sensing Letters, 9(3): 447-451[DOI:10.1109/LGRS.2011.2172185]
Makantasis K, Karantzalos K, Doulamis A and Doulamis N. 2015. Deep supervised learning for hyperspectral data classification through convolutional neural networks//2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). Milan, Italy: IEEE: 4959-4962[DOI: 10.1109/IGARSS.2015.7326945http://dx.doi.org/10.1109/IGARSS.2015.7326945]
Qi Y F, Chen J, Huo Y L and Li F Y. 2020. Hyperspectral image classification algorithm based on multiscale convolutional neural network. Infrared Technology, 42(9): 855-862
齐永锋, 陈静, 火元莲, 李发勇. 2020. 基于多尺度卷积神经网络的高光谱图像分类算法. 红外技术, 42(9): 855-862
Villa A, Benediktsson J A, Chanussot J and Jutten C. 2011. Hyperspectral image classification with independent component discriminant analysis. IEEE Transactions on Geoscience and Remote Sensing, 49(12): 4865-4876[DOI:10.1109/TGRS.2011.2153861]
Wang L G, Zhao L and Liu D F. 2018. A review on the application of SVM in hyperspectral image processing. Journal of Harbin Engineering University, 39(6): 973-983
王立国, 赵亮, 刘丹凤. 2018. SVM在高光谱图像处理中的应用综述. 哈尔滨工程大学学报, 39(6): 973-983[DOI:10.11990/jheu.201704074]
Yang G F, Gewali U B, Ientilucci E, Gartley M and Monteiro S T. 2018. Dual-channel densenet for hyperspectral image classification//2018 IEEE International Geoscience and Remote Sensing Symposium. Valencia, Spain: IEEE: 2595-2598[DOI: 10.1109/IGARSS.2018.8517520http://dx.doi.org/10.1109/IGARSS.2018.8517520]
Zhao S Z, Li W, Du Q and Ran Q. 2018. Hyperspectral classification based on Siamese neural network using spectral-spatial feature//2018 IEEE International Geoscience and Remote Sensing Symposium. Valencia, Spain: IEEE: 2567-2570[DOI: 10.1109/IGARSS.2018.8519286http://dx.doi.org/10.1109/IGARSS.2018.8519286]
相关作者
相关机构