Current Issue Cover
CACIS2019会议 AT-PGGAN: 基于GAN的车辆型号精细识别方法


摘 要
目的 车型识别在智能交通、智慧安防、自动驾驶等领域具有十分重要的应用前景。而车型识别中,带标签车型数据的数量是影响车型识别的重要因素。本文以“增强数据”为核心,结合PGGAN和Attention机制,提出一种基于对抗网络生成数据再分类的网络模型AT-PGGAN(Attention-progressive growing of GANs),利用模型生成带标签车型图像的数量从而提高车型识别准确率。方法 该模型由生成网络和分类网络组成,利用生成网络对训练数据进行增强扩充,利用注意力机制和标签重嵌入方法对生成网络进行优化使其生成图像细节更加完善,提出标签重标定的方法重新确定生成图像的标签数据并对生成图像做相应的筛选。使用扩充的图像加上原有数据集的图像作为输入训练分类网络。结果 基于本文提出的模型,能够很好的扩充已有的车辆图像,在公开数据集StanfordCars上,相比未使用AT-PGGAN模型进行数据扩充的分类网络均有1%以上的提升,在CompCars上与其他网络进行对比,本方法在同等条件下最高准确率达到96.6%,高于已有方法。实验结果表明该方法能有效提高车辆精细识别的准确率。结论 将生成对抗网络用于对数据的扩充增强具有很大的应用前景,其得到的生成图像能够很好的模拟原图像数据,对原图像数据具有正则的作用。其得到图像数据可以使图像的细粒度识别准确率得到一定的提升。
AT-PGGAN:A fine-grained recognition method of vehicle models based on generative adversarial network

Yang Changdong,Yu Ye,Xu Longdao,Fu Yuanzi,Lu Qiang(School of computer and information, hefei university of technology)

Objective Comprehensive perception of traffic management through computer vision technology is particularly important in Intelligent Transportation Systems. Vehicle recognition is an important part of intelligent transportation systems, and fine-grained car model recognition in vehicle recognition is currently the most challenging subject. However, the traditional method has a high demand for a priori information on the data, while the deep learning method requires large-scale data, and the fitting effect is poor when the amount of data is small. Labelling a large number of vehicle images manually is a very time-consuming task. For vehicle recognition, there are also some deviations in labelling manually due to the strong similarity between different categories. In order to get more abundant data, extract more abundant vehicle image features, the attention-progressive growing of generative adversarial network is proposed. Method The AT-PGGAN model consists of a generation network and a classification network. The generation network is used to augment the training data. For fine-grained vehicle recognition, most of the current work is based on high-resolution images. Since the existing generation network is not ideal for generating high-resolution images, the generated high-resolution images cannot be directly used for fine-grained recognition. In this paper, the attention mechanism and the label re-embedding method are used to optimize the generation network to make the high-resolution image details more perfect, which is more conducive to discriminating the true features of the network extracted from the image. On this basis, this paper proposes a method of label recalibration, which is to re-calibrate the label data of the generated images and filter the generated images accordingly, and remove the generated images that do not meet the requirements, thus solve the problem of poor image quality from another aspect. The re-labeled generated images and the original images are collectively used as input data of the classification network. Since there is no direct connection between the generation network and the classification network, the classification part can use multiple classification networks, thereby improving the universality of the model. Result Based on the model proposed in this paper, the existing vehicle model images can be well augmented and used for fine-grained vehicle model recognition. In the public dataset StanfordCars, there is a 1% improvement over the classification network that does not use the AT-PGGAN model for data augment. Compared with other networks on CompCars, The top1 and top5 accuracy of this method are both higher than the existing method under the same condition. Comparing several different semi-supervised image label calibration methods, the method proposed in this paper shows the best results in different sample sizes. Different numbers of generated images also have a certain influence on the recognition accuracy. When the number of generated images reaches that of original samples, the recognition accuracy is the highest. However, when the generated images continue to increase, the recognition accuracy will decrease. In the comparative experiment, the progressive growth strategy has a basic improvement on the generation algorithm, and since a large number of images that do not meet the standard are screened out in the process of label recalibration, the influence on the feature extraction is removed, and the experimental results prove that the labels have a big impact on the results. Re-labelling is the biggest improvement to the algorithm. Conclusion The generative adversarial network is used for data augment and enhancement, and the generated images can simulate the original image data well. The images generated in the classification task have a regular effect on the original image data, the generated image data can improve the fine-grained recognition accuracy of the image. How to generate clear high-resolution images is the key to the problem. Different label calibration methods have a great influence on the results,effective calibration of image label generation is another way to solve the problem effectively.