基于Faster R-CNN深度网络的遥感影像目标识别 方法研究
作者简介:王金传(1993-),男,硕士,主要从事遥感技术应用、深度学习和WebGIS应用研究。E-mail: wangjnchuan@163.com
收稿日期: 2018-05-12
要求修回日期: 2018-07-25
网络出版日期: 2018-10-17
基金资助
国家重点研发计划项目(2017YFB0504202)
国家自然科学基金项目(41871312)
湖北省自然科学基金项目(2017CFB433)
智能地学信息处理湖北省重点实验室(中国地质大学(武汉))开放基金(KLIGIP-2017A09)
上海航天科技创新基金项目(SAST2016006)
空间数据挖掘与信息共享教育部重点实验室(福州大学)开放基金(2016LSDMIS06)
城市空间信息工程北京市重点实验室经费资助项目(2017209)
Faster R-CNN Deep Learning Network Based Object Recognition of Remote Sensing Image
Received date: 2018-05-12
Request revised date: 2018-07-25
Online published: 2018-10-17
Supported by
National Key Research and Development Program of China under Grant, No.2017YFB0504202
National Natural Science Foundation of China, No.41871312
Hubei Natural Science Foundation, No.2017CFB433
Hubei Key Laboratory of Intelligent Geo-Information Processing (China University of Geosciences (Wuhan)), No. KLIGIP-2017A09
The Key Laboratory of Spatial Data Mining & Information Sharing of the Ministry of Education, Fuzhou University, No.2016LSDMIS06
Shanghai Aerospace Science and Technology Innovation Fund, No.SAST2016006
Beijing Key Laboratory of Urban Spatial Information Engineering, No.2017209.
Copyright
遥感影像目标识别在众多领域中具有极高的理论意义与应用价值,更快速、更精确的目标识别方法研究是目前遥感及图像研究领域的热点与难点。本文将深度学习的方法应用于遥感影像目标识别中,提出基于Faster R-CNN深度学习网络的目标快速精确识别方法。该方法采用了包括基于RPN的建议区域提取方法和VGG16训练卷积网络模型,构建了面向遥感影像目标识别的深度卷积神经网络。为验证该方法的精度及性能,在Caffe深度学习框架上,选取高分辨率遥感影像中飞机、油罐、操场及立交桥目标进行验证实验。结果表明,基于Faster R-CNN的深度学习方法能够实现对遥感影像目标的快速、准确识别,同时具有较好的推广性。通过本文的研究,证明基于Faster R-CNN深度学习的高分遥感影像目标识别方法具有显著优势和潜力,对基于其他深度学习方法的目标识别研究也有一定的参考意义。
关键词: 深度学习; Faster R-CNN网络; 高分辨率; 遥感影像; 目标识别
王金传 , 谭喜成 , 王召海 , 钟燕飞 , 董华萍 , 周松涛 , 成布怡 . 基于Faster R-CNN深度网络的遥感影像目标识别 方法研究[J]. 地球信息科学学报, 2018 , 20(10) : 1500 -1508 . DOI: 10.12082/dqxxkx.2018.180237.
Object recognition of remote sensing image is of great theoretical significance and application value in many fields. Faster and more accurate object identification methods are hot and difficult points in the field of remote sensing and image. In this paper, the method of deep learning is applied to remote sensing image object recognition, and a fast and accurate method of object recognition based on Faster R-CNN deep learning network is proposed. This method uses the proposal region extraction method based on RPN and the VGG16 training convolution network model, and constructs a deep convolutional neural network for the object recognition of remote sensing image. In order to verify the accuracy and performance of the method, the GPU accelerated computing model was used in the Caffe deep learning framework. Firstly, the aircraft target recognition experiment in remote sensing image was designed. The aircraft target recognition accuracy reached 96.67%. Then, after the experiment was successful, we continued to identify other target features, and selected the high-resolution remote sensing image of the oil tank, playground and overpass object for verification experiments. In the same experiment environment, the same good experimental verification results were obtained, the target recognition rate was at a high level, and the cost time of recognition in each picture was less than 0.2 seconds, which fully verified the validity and reliability of the model studied in this paper. After analysis and comparison, the conclusion is that the deep learning method based on Faster R-CNN can realize the fast and accurate recognition of the selected targets, which proves that the method has a good promotion significance in high-resolution remote sensing image target recognition applications. Therefore, the model has great application value, and it also has certain reference significance for target recognition research based on other deep learning methods.
Fig. 1 Faster R-CNN network structure图1 Faster R-CNN网络模型结构图 |
Fig. 2 Proposal regions generation process图2 建议区域生成流程 |
Fig. 3 The VGG16 convolutional neural network图3 VGG16卷积神经网络 |
Fig. 4 Data expansion and data sets图4 数据扩充及数据集 |
Fig. 5 The result of airplane recognition图5 飞机目标识别结果 |
Tab. 1 The test performance comparison in R-CNN, Fast R-CNN and Faster R-CNN表1 R-CNN、Fast R-CNN和Faster R-CNN测试性能对比 |
深度学习方法 | 识别精度/% | 每张图片识别耗时/s |
---|---|---|
R-CNN | 77.10 | 13.40 |
Fast R-CNN | 77.50 | 4.60 |
Faster R-CNN | 96.67 | 0.14 |
Fig. 6 The oil tank, playground and overpass target recognition results图6 油罐、学校操场和立交桥目标识别结果 |
Tab.2 Comparison of recognition accuracy and recognition efficiency between our model and reference表2 本文模型与文献模型识别精度与识别效率对比 |
目标类别 | Faster R-CNN + VGG16 | AlexNet-DR & GoogleNet-DR | |||
---|---|---|---|---|---|
识别精度/% | 识别效率/s | 识别精度/% | 识别效率/s | ||
飞机 | 96.67 | 0.144 | 94.99 | 37.292 | |
油罐 | 97.46 | 0.184 | 94.47 | 38.283 | |
操场 | 97.41 | 0.143 | 97.18 | 11.928 | |
立交桥 | 81.08 | 0.186 | 88.30 | 5.360 |
The authors have declared that no competing interests exist.
[1] |
[
|
[2] |
[
|
[3] |
|
[4] |
|
[5] |
|
[6] |
|
[7] |
[
|
[8] |
[
|
[9] |
[
|
[10] |
[
|
[11] |
[
|
[12] |
[
|
[13] |
[
|
[14] |
[
|
[15] |
|
[16] |
|
[17] |
[
|
[18] |
|
[19] |
[
|
[20] |
|
[21] |
[
|
[22] |
|
[23] |
[
|
[24] |
|
[25] |
[
|
[26] |
|
[27] |
|
/
〈 |
|
〉 |