Current Issue Cover
深度迭代融合的脑部磁共振图像颅骨去除网络

姚发展, 李智, 王丽会, 程欣宇, 张健(贵州大学计算机科学与技术学院智能医学影像分析与精准诊断重点实验室, 贵阳 550025)

摘 要
目的 去除颅骨是脑部磁共振图像处理和分析中的重要环节。由于脑部组织结构复杂以及采集设备噪声的影响导致现有方法不能准确分割出脑部区域,为此提出一种深度迭代融合的卷积神经网络模型实现颅骨的准确去除。方法 本文DIFNet(deep iteration fusion net)模型的主体结构由编码器和解码器组成,中间的跳跃连接方式由多个上采样迭代融合构成。其中编码器由残差卷积组成,以便浅层语义信息更容易流入深层网络,避免出现梯度消失的现象。解码器网络由双路上采样模块构成,通过具有不同感受野的反卷积操作,将输出的特征图相加后作为模块输出,有效还原更多细节上的特征。引入带有L2正则的Dice损失函数训练网络模型,同时采用内部数据增强方法,有效提高模型的鲁棒性和泛化能力。结果 为了验证本文模型的分割性能,分别利用两组数据集与传统分割算法和主流的深度学习分割模型进行对比。在训练数据集同源的NFBS(neurofeedback skull-stripped)测试数据集上,本文方法获得了最高的平均Dice值和灵敏度,分别为99.12%和99.22%。将在NFBS数据集上训练好的模型直接应用于LPBA40(loni probabilistic brain atlas 40)数据集,本文模型的Dice值可达98.16%。结论 本文提出的DIFNet模型可以快速、准确地去除颅骨,相比于主流的颅骨分割模型,精度有较高提升,并且模型具有较好的鲁棒性和泛化能力。
关键词
Deep iterative fusion network on skull removal of brain magnetic resonance images

Yao Fazhan, Li Zhi, Wang Lihui, Cheng Xinyu, Zhang Jian(Key Laboratory of Intelligent Medical Image Analysis and Precise Diagnosis of Guizhou Province, School of Computer Science and Technology, Guizhou University, Guiyang 550025, China)

Abstract
Objective Magnetic resonance imaging (MRI) is frequently used in clinical applications. It is a common means to detect lesions, injuries, and soft tissue variations in neural system diseases. Skull removal is an important preprocessing step for brain magnetic resonance (MR) image analysis. Its purpose is to remove nonbrain tissue from the brain MRI, thereby facilitating subsequent extraction and analysis of brain tissue. The MR images acquired using clinical scanners inevitably have blurring or noise characteristics due to the complexity of brain tissue structure and the effects of equipment noise and field offset. Differences also exist in the anatomical structure of the brain tissue for different individuals, which cause difficulties in the skull segmentation in brain MR images. Most traditional methods for skull segmentation are incompletely automatic and often require the operator to use the mouse and other tools to determine the center point of the region of interest and adjust the parameters manually. The current automatic skull segmentation method does not require human-computer interaction but has poor adaptability, and satisfactory segmentation results in different MR images are difficult to achieve. On the contrary, the deep learning-based method exhibits advanced performance in multiple segmentation tasks in the field of computer vision. Therefore, we propose a deep iterative fusion convolutional neural network model (DIFNet) in this work to realize skull segmentation. Method The main structure of DIFNet is composed of an encoder and a decoder. The skip connection between the encoder and decoder is realized by multiple upsampling iterative fusion, which means that the input information of one decoder layer comes from not only the same layer but also the deep layers of the encoder. The encoder consists of several residual convolution blocks, which allow the shallow semantic information to flow into deep networks to avoid gradient vanishment. The decoder is composed of double-way upsampling modules. The feature maps generated from the double-way upsampling modules are added as real outputs through deconvolution operations with different receptive field sizes. This process enables to restore the image details effectively by adding multiple scale information. The internal data enhancement method is adopted to enhance the generalization capability of the model. First, the image is randomly scaled, in which the interval of scaling factor sets is determined in accordance with the ratio of the original image size to the output block size. Then, a center point is randomly selected in the scaled image, and the cutting area is determined. Lastly, the cut image patches are fed into the network for training. The Dice loss function embedded with an L2 regularization item is used to optimize the model parameters and overcome the overfitting problem. We use two datasets in this work to evaluate the accuracy and robustness of the proposed model. Each dataset has a brain segmentation mask provided by a professional doctor as the gold standard of the model. One dataset is NFBS(neurofeed back skullstripped), from which a part of images are used for testing (the ratio of the training dataset to the test dataset is 4 :1). The other dataset is LPBA40(loni probabilistic brain atlas 40), which is used as an independent dataset for testing the generality of the models. For quantitative analysis, the Dice score, sensitivity, and specificity are used in this work. Result For the NFBS dataset, the method in this paper obtains the highest average Dice score and sensitivity of 99.12% and 99.22%, respectively, compared with U-Net, U-Net with residual block (Res-U-Net), and U-Net with double-way upsampling modules (UP-U-Net). The Dice score is increased by 1.88%, 1.81%, and 0.6%. The sensitivity and septicity are increased by at least 0.5% compared with the U-Net model. The segmentation results of the model are similar to the manual segmentation results of experts. The model trained with the NFBS dataset is applied directly to the LPBA40 dataset to verify the segmentation capability of the model. The Dice value obtained in the test experiment is up to 98.16%. By contrast, the Dice values of U-Net, UP-U-Net, and Res-U-Net are 81.69%,77.34%, and 76.42%, respectively. Compared with these models, our proposed model is robust. Conclusion Experiments illustrate that the internal data augmentation and deep iterative fusion make the proposed model be easily trained and acquire the best segmentation results. The deep iterative feature fusion can guarantee the robustness of the segmentation model.
Keywords

订阅号|日报