基于可视范围的图像检索方法
作者简介:吴 勇(1981-),男,福建福州人,博士,主要从事视频GIS、移动GIS研究。E-mail:wuyong3216@163.com
收稿日期: 2016-05-12
要求修回日期: 2016-07-01
网络出版日期: 2016-08-10
基金资助
国家自然科学基金项目(41201417)
资源与环境信息系统国家重点实验室青年人才培养基金项目(O8R8B6E0YA)
福建省自然科学基金项目(2016J01186)
福建省测绘地理信息局局校合作科技项目(2016JX02)
A New Image Retrieval Method Based on Image Viewshed
Received date: 2016-05-12
Request revised date: 2016-07-01
Online published: 2016-08-10
Copyright
图像是视觉所及环境的自然再现,具有表达直观、内容丰富的特性,其已成为GIS空间数据采集与应用中一种重要的可视数据源。随着硬件终端的高速发展,集成的传感器愈加丰富,多源传感器信息融合已成为移动GIS数据釆集的发展趋势。针对移动终端获取方位角偏差问题,本文重点阐述了基于移动终端的重力传感器与磁力传感器获取镜头拍摄朝向的算法,并基于EXIF图像格式实现了图像元数据、定位信息、方位信息一体化采集生成可定位图像。针对现有可定位图像以点代面建模方式的不足,本文以镜头视野范围AOV(Angle of View)模型为基础,提出了以图像元数据、位置信息以及方位信息构建图像可视域的方法,并采用多级网格方式对图像可视域面状特征建立空间索引,实现基于可视域的图像检索。在此基础上,针对福建省野外遥感核查采集的8022幅可定位图像,采用点建模及可视域建模2种方式建立检索图像集。从检索速度、检索结果2个方面进行对比分析,表明基于可视域图像检索方法不仅能检索出拍摄位置在检索区域内的图像,还可检索出拍摄位置在检索区域外但实际拍摄场景在检索范围内的图像。同时,通过多级空间网格索引可提升6.22-8.64倍检索速度。
吴勇 , 姚凌 , 童为民 . 基于可视范围的图像检索方法[J]. 地球信息科学学报, 2016 , 18(8) : 1030 -1036 . DOI: 10.3724/SP.J.1047.2016.01030
Image is a natural representation of the real world, which has the characteristics of intuitionistic expression and rich content. It has become an important visual data source in GIS data acquisition and application. With the rapid development of hardware terminal, the sensors integrated in mobile terminal become more abundant, and multi-sensor data fusion has become a trend in mobile GIS data collection. Aiming at the problem of obtaining azimuth deviation of mobile terminal, this paper emphasized on the acquisition algorithm of camera shooting directions, which was based on a gravity sensor and a magnetic sensor embedded in mobile terminals. Based on the EXIF format, locatable images were generated by the integrated collection of image metadata, location and direction. To overcome the disadvantages of existing modeling method of taking a surface as a point for locatable image, this paper proposed a method to build image viewshed by using image metadata, location and direction based on the model of lens’ angle of view. Multi-level grid was used to build the spatial index for the polygon feature of the image viewshed. Based on this method, a case study was conducted in Fujian province. In the experiment, 8022 locatable images were collected by field verification of remote sensing. Point based modeling and polygon based modeling were utilized to build the retrieved image set. Retrieval speed and results were compared and analyzed. It showed that the image retrieval results could be more accurate based on image viewshed modeling. It could retrieve images of which the actual shooting position located inside and outside of the retrieval area. Meanwhile, the image retrieval speed could increase 6.22 to 8.64 times by using the multi-level grid spatial index.
Fig. 1 Technology flowchart of the locatable image acquisition图1 可定位图像采集方法流程图 |
Tab. 1 The Parameters of the Locatable Image Model表1 可定位图像存储模型参数列表 |
EXIF | 存储参数 | 参数描述及存储位置 |
---|---|---|
EXIF IFD | 图像高度 | 图像纵向像素数,存储于ImageLength标签 |
图像宽度 | 图像横向像素数,存储于ImageWidth标签 | |
拍摄时间 | 图像拍摄时间,存储于DateTime标签 | |
焦距 | 拍摄图像时镜头焦距长度,存储于FocalLength 标签 | |
水平视角 | 当前焦距下镜头水平视角,其值小于180°,以<水平视角>参数值</水平视角>存储于UserComment标签 | |
垂直视角 | 当前焦距下镜头垂直视角,其值小于180°,以<垂直视角>参数值</垂直视角>存储于UserComment标签 | |
放大倍数 | 图像拍摄时镜头数码变焦倍数,定焦时默认为0,变焦时为大于1的浮点数,储存于DigitalZoom标签 | |
GPS IFD | 经度值 | 拍摄位置经度,存储于GPSLongitude标签 |
经度参考 | 经度参考,东经为“E”,西经为“W”,存储于GPSLongitudeRef标签 | |
纬度值 | 拍摄位置纬度,存储于GPSLatitude标签 | |
纬度参考 | 纬度参考,北纬为“N”,南纬为“S”,存储于GPSLatitudeRef标签 | |
拍摄方位 | 拍摄时镜头朝向,其值范围为0~360°,存储于GPSImgDirection标签 |
Fig. 2 Schematic diagram of relationship betweengravity and terminal posture图2 重力与终端姿态关系示意图 |
Fig. 3 Sketch map of inclination compensation and angle conversion图3 倾角补偿与角度换算示意图 |
Fig. 4 Sketch map of the lens parametersunder the pinhole imaging图4 针孔成像下镜头参数关系示意图 |
Fig. 5 Calculation results of image visual field图5 图像可视域计算结果图 |
Fig. 6 Sketch map of space division and object location图6 空间划分与对象定位示意图 |
Tab. 2 Testing results of the locatable image retrieval表2 可定位图像检索测试结果 |
组 别 | 数据集构建方式 | 检索面积/km2 | 检索耗时/ms | 耗时比 (无索引/索引) | 检索结果/幅 | |
---|---|---|---|---|---|---|
无索引 | 多级网格索引 | |||||
1 | 点模型 | 100.1 | 136.7 | 15.8 | 8.65 | 1 |
面模型 | 148.2 | 22.5 | 6.58 | 1 | ||
2 | 点模型 | 119.1 | 137.6 | 16.0 | 8.60 | 3 |
面模型 | 152.8 | 22.2 | 6.88 | 4 | ||
3 | 点模型 | 121.6 | 133.5 | 18.0 | 7.41 | 5 |
面模型 | 155.4 | 25.0 | 6.22 | 5 | ||
4 | 点模型 | 133.3 | 135.9 | 16.4 | 8.29 | 4 |
面模型 | 154.7 | 22.2 | 6.97 | 6 | ||
5 | 点模型 | 165.4 | 134.5 | 18.8 | 7.15 | 13 |
面模型 | 162 | 23.4 | 6.92 | 15 |
Fig. 7 Retrieval results of the locatable image图7 可定位图像检索结果 |
The authors have declared that no competing interests exist.
[1] |
|
[2] |
[
|
[3] |
[
|
[4] |
|
[5] |
|
[6] |
|
[7] |
|
[8] |
|
[9] |
|
[10] |
|
[11] |
[
|
[12] |
|
[13] |
|
[14] |
[
|
[15] |
|
[16] |
[
|
[17] |
[
|
[18] |
Jeita CP-3451. Exchangeable image file format for digital still cameras: Exif version 2.2[S]. Japan Electronics and Information Technology Industries Association, Tokyo: Jap, 2002:1-72.
|
[19] |
|
[20] |
[
|
/
〈 |
|
〉 |