ARTICLES

Design and Implement of Parallel Spatial Analysis System Based on MySQL & MPI

Expand
  • 1. State Key Laboratory of Resources and Environmental Information System (LREIS), Institute of Geographic Sciences and Natural Resources Research, CAS, Beijing 100101, China;
    2. University of Chinese Academy of Sciences, Beijing 100049, China

Received date: 2012-03-17

  Revised date: 2012-07-26

  Online published: 2012-08-22

Supported by

null

Abstract

With the rapid development of space survey technology, GIS is facing a challenge of fast growing size on spatial data and complexity of spatial analysis algorithm. Traditional serial spatial analysis method isn't able to deal with this condition well. High performance computer and new computing methods provide an innovative way for spatial data processing and analysing problem. Remote sensing data processing is data-intensive and an ideal domain to use parallel computing, but vector data operation is computing-intensive which needs more computing ability. In this paper, a distributed spatial analysis framework based on MySQL spatial and MPI is described. Parallel spatial vector data mean is explored in kind of cluster way. This framework uses MySQL spatial cluster to store and manage GIS data which can resolve the problem about fault-tolerant and concurrent access for the same data block. MPI is good at passing messages in distributed network nodes, so it's not necessary to control telecom between nodes manually. Task management and distribution use prior queue to achieve load balance and fault-tolerant through monitoring the status of cluster. Finally, a parallel polygon overlay operation is experimented on this distributed system to test the performance of the cluster. The strategy of parallel Overlay operation is in a pipeline way, each node gets a part set of the polygons in the overlaid layers. And this method got relative better speedup than the serial overlay operation.

Cite this article

ZHOU Yuke, MA Ting, ZHOU Chenghu, GAO Xizhang, Fan Junfu . Design and Implement of Parallel Spatial Analysis System Based on MySQL & MPI[J]. Journal of Geo-information Science, 2012 , 14(4) : 448 -453 . DOI: 10.3724/SP.J.1047.2012.00448

References

[1] 赵春宇.高性能并行GIS中矢量空间数据存取与处理关键技术研究.武汉:武汉大学,2006.

[2] Mineter M J, Dowers S and Gittings B M. Towards a HPC framework for integrated processing of geographical data: Encapsulating the complexity of parallel algorithms[J]. Transactions in GIS, 2000(4): 245-261.

[3] 薛勇,万伟,艾建文. 高性能地学计算进展[J]. 世界科技研究与发展,2008(3):314-319.

[4] 王结臣,王豹,胡玮,等. 并行空间分析算法研究进展及评述[J]. 地理与地理信息科学,2011(6):1-5.

[5] 罗英伟,汪小林.空间信息合作与并行处理[J].计算机辅助设计与图形学学报,2003,15(10):1307-1314.

[6] 方裕,邬伦,谢昆青,等.分布式协同计算的GIS技术研究[J].地理与地理信息科学,2006,22(3):9-12,54.

[7] MySQL Replication. http://dev.MySQL.com/doc/refman/5.5/en/replication.html

[8] 朱江,张立立.海量影像数据的发布集群系统与应用[J].地球信息科学,2006,8(2):101-105.

[9] Torque. http://www.clusterresources.com/torquedocs21.

[10] 吴亮,谢忠,陈占龙,等.分布式空间分析运算关键技术[J].地球科学(中国地质大学学报), 2010(3).

[11] 陈国良,孙广中,徐云,等. 并行算法研究方法学[J].计算机学报, 2008,12(9):1493-1502.

[12] Bentley J L, Ottmann T A. Algorithms for reporting and counting geometric intersections[J]. IEEE Trans. Comput., 1979, C-28:643-647.

[13] 王璟,张云泉,李玉成.基于MPI和MySQL的并行数据库系统搭建[J].计算机科学,2003,31(10):418-421.

Outlines

/