地球信息科学学报 ›› 2015, Vol. 17 ›› Issue (9): 1014-1021.doi: 10.3724/SP.J.1047.2015.01014

• • 上一篇    下一篇

一个可定位视频对象的地理空间表达框架

韩志刚(), 孔云峰*(), 秦耀辰, 秦奋   

  1. 1. 黄河中下游数字地理技术教育部重点实验室,开封 475004
    2. 河南大学环境与规划学院,开封 475004
  • 收稿日期:2014-12-18 修回日期:2015-02-23 出版日期:2015-09-10 发布日期:2015-09-07
  • 通讯作者: 孔云峰 E-mail:zghan@henu.edu.cn;yfkong@henu.edu.cn
  • 作者简介:

    作者简介:韩志刚(1981-),男,博士,副教授,研究方向为GIS开发与空间分析。E-mail: zghan@henu.edu.cn

  • 基金资助:
    国家自然科学基金项目(41201402);中国博士后科学基金项目(2013M531666);河南大学优秀青年科研人才培育基金项目

A Geospatial Representation Framework for Geo-tagged Video Objects

HAN Zhigang(), KONG Yunfeng*(), QIN Yaochen, QIN Fen   

  1. 1. Key Laboratory of Geospatial Technology for the Middle and Lower Yellow River Regions, Ministry of Education, Kaifeng 475004, China
    2. College of Environmental & Planning, Henan University, Kaifeng 475004, China
  • Received:2014-12-18 Revised:2015-02-23 Online:2015-09-10 Published:2015-09-07
  • Contact: KONG Yunfeng E-mail:zghan@henu.edu.cn;yfkong@henu.edu.cn
  • About author:

    *The author: SHEN Jingwei, E-mail:jingweigis@163.com

摘要:

可定位视频是包含位置信息的视频,它的地理空间表达是集成视频与GIS的关键问题。针对视频添加位置标签空间语义单一的不足,设计了一个可定位视频的地理空间表达框架。该框架包括视频帧与视频片段2个层次,在扩展OGC空间数据库几何要素标准基础上,分别定义了3类共7种对象描述可定位视频的空间信息。(1)视频位置(点),描述视频帧及视频片段拍摄位置及相机姿态;(2)视频轨迹(线),描述视频片段拍摄轨迹;(3)视频平面视域(面)与立体视域(体),分别描述视频帧及视频片段拍摄场景二维与三维空间范围。该框架包含了主要空间对象类型,空间表达范围较为完备,同时支持视频帧、视频片段等不同层次数据,在不改变现有数据结构基础上,实现视频数据与GIS松散耦合与有机集成。本文讨论了视频对象空间数据获取方法,定义了逻辑模型,并以可视化与检索为例进行应用分析。结果表明,该框架扩展了现有空间数据标准,易于实现,在地理可视化、视频检索及分析挖掘方面具有应用价值。

关键词: 可定位视频, 地理表达, GIS, 数据模型, 空间数据库

Abstract:

Geo-tagged video contains location information, and it is critical for true geographic representation. The geospatial representation of geo-tagged video is the key feature for the integration of video and GIS. Regarding to the disadvantage of geo-tagged representation methods for video objects with monotone spatial semantic information, a geographic representation framework for geo-tagged video objects is proposed. On the basis of extending OGC specifications for geographic information, this paper defined the respective objects in 7 types from 3 categories to describe the spatial information on two levels, including the video frame and video clip. The 3 categories include: (1) the video positions (point) to represent the location and attitude as the camera taking shoots; (2) the video trajectories (line) to portray the track of the video clip; and (3) the video field of view in plain view (polygon) or 3D (solid) space to describe the spatial extent of the video scene. The framework consists of the main spatial objects including the point, line, polygon and solid. It is more competent for demonstrating video spatial information. Meanwhile, the framework supports different levels of video data, such as the video frame and video clip. It achieves the loosely-coupled and perfectly-integrated integration of video and GIS, which does not need to alter the data structures. This paper discussed the data acquisition methods for the spatial information of video frames or clips in detail, which take use of the GPS receiver and 3D digital compass. We also developed 9 tables and defined their relations for the logical model to realize the geographic representation of geo-tagged video objects, and we analyzed the data visualization and retrieval methods by taking them as the application cases. The results show that the geographic representation framework for geo-tagged video extends the current spatial database standard. It is easy to implement and applicable in geographic visualization, video retrieval and spatial analysis or data mining.

Key words: geo-tagged video, geographic representation, GIS, data model, spatial database