地球信息科学学报 ›› 2016, Vol. 18 ›› Issue (12): 1588-1596.doi: 10.3724/SP.J.1047.2016.01588

• 地球信息科学理论与方法 • 上一篇    下一篇

分布式环境下矢栅一体化数据组织模型设计

徐道柱1,2,3(), 罗斌4,*(), 周岩2,3, 金澄1,2,3   

  1. 1. 信息工程大学地理空间信息学院,郑州 450052
    2. 地理信息工程国家重点实验室,西安 710054
    3. 西安测绘研究所,西安 710054
    4. 中国科学院地理科学与资源研究所,北京 100101
  • 收稿日期:2016-04-28 修回日期:2016-10-18 出版日期:2016-12-27 发布日期:2016-12-20
  • 通讯作者: 罗斌 E-mail:confutao@163.com;luob@igsnrr.ac.cn
  • 作者简介:

    作者简介:徐道柱(1982-),男,江苏洪泽人,博士,助理研究员,研究方向为地理信息智能获取与应用。E-mail:confutao@163.com

  • 基金资助:
    国家自然科学基金项目(41201469)

Model Design of Integrated Vector and Raster Data Organization Under the Distributed Environment

XU Daozhu1,2,3(), LUO Bin4,*(), ZHOU Yan2,3, JIN Cheng1,2,3   

  1. 1. Institute of Surveying and Mapping, Information Engineering University, Zhengzhou 450052, China
    2. State Key Laboratory of Geo-information Engineering , Xi′an 710054, China;
    3. Xi′an Institute of Surveying and Mapping, Xi′an 710054, China;
    4. Institute of Geographic Sciences and Natural Resources Research, Beijing 100101, China
  • Received:2016-04-28 Revised:2016-10-18 Online:2016-12-27 Published:2016-12-20
  • Contact: LUO Bin E-mail:confutao@163.com;luob@igsnrr.ac.cn

摘要:

随着地理信息技术与计算机网络技术的结合和发展,基于全球框架的地理信息服务对海量数据管理的要求越来越高,传统的单中心的关系数据库的管理模式已经不能满足要求。分布式文件系统、半结构化数据库和关系数据库技术优势互补,为海量数据高效管理提供了新的技术思路。本文提出了分布式环境下空间数据一体化存储管理架构,设计了矢量和栅格数据的数据逻辑组织和物理存储模型,通过统一的分层+分块的数据划分规则,实现了矢、栅数据分布式环境下的一体化管理。在该模型中,利用关系数据库和半结构化数据库的特点,对空间索引和实体数据分开管理,有效地提高了数据处理和访问效率。实验表明,该模型具有更高的数据管理能力,可为分布式环境下数据服务中心构建提供一个有效的解决方案。

关键词: 分布式, 关系数据库, 图层对象模型, 矢量数据模型, 栅格数据模型

Abstract:

With the combination and development of geographic information technology and computer network technology, geographic information services based on global framework demand for more efficient massive data management, the traditional single center relational database management mode is unable to meet the requirements. Since the distributed file systems, semi-structured databases and relational database technology have complementary advantages to each other, a new technical method for efficient management of massive data is developed. In order to achieve high effective geospatial data management, this paper presented an integrated architecture oriented to the storage and management of geospatial data in distributed environment, designed user - oriented massive geospatial data integration model and distributed storage organization model . In this model, a technical route combining the NoSQL database and relational database is adopted, and a layered + partitioned data model and multi-level index mechanism for the rapid access of massive data is designed, so it can realizes the integrated management of vector and raster data in distributed environment. Because the model has taken advantages of relational database and semi-structured database, structured geographic information, spatial index and entity data can be managed separately and the efficiency of data processing and access is improved effectively. Vector data and raster data is the largest and most widely used geospatial data, In this paper, an experiment system is set up in the experiment environment, which realizes vector data and raster data management model. TB-level data are used to conduct experiments of data loading, index (pyramid) creation and concurrent data access efficiency, compared with the traditional data model, the model in the data management capacity, processing speed and access efficiency have greatly improved. The results show that the model can support the parallel operation in distributed environment, with a higher data management capability will offer an effective solution to massive data management in distributed environment.

Key words: distributed, relational database, layer object model, vector data model, raster data model, raster data model