空间扫描统计量方法中候选聚集区域生成的快速算法

李小洲, 王劲峰

doi:10.3724/SP.J.1047.2013.00505

地球信息科学学报 >

2013 , Vol. 15 >Issue 4: 505 - 511

DOI: https://doi.org/10.3724/SP.J.1047.2013.00505

本期要文(可全文下载)

空间扫描统计量方法中候选聚集区域生成的快速算法

展开

1. 武汉科技大学医学院公共卫生学院, 武汉 430065;
2. 中国科学院地理科学与资源研究所资源与环境信息系统国家重点实验室, 北京 100101

李小洲(1974- ),男,湖北麻城人,讲师,主要研究方向为空间数据分析及空间流行病学。E-mail:lixiaozhou@wust.edu.cn

收稿日期: 2013-01-14

修回日期: 2013-03-20

网络出版日期: 2013-08-08

基金资助

国家科技重大专项子课题“艾滋病和病毒性肝炎等重大传染病防治/传染病病原谱时空信息分析和预报”(2012ZX10004-201);卫生行业科研专项项目“传染病时空预警模型及关键参数研究”(201202006)。

收起

A Fast Method for Making Candidate Clusters in Spatial Scan Statistic Method

Expand

1. Medical School, Wuhan University of Science and Technology, Wuhan 430065, China;
2. State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, CAS, Beijing 100101, China

Received date: 2013-01-14

Revised date: 2013-03-20

Online published: 2013-08-08

Fold

摘要

空间扫描统计量方法是公共卫生监测领域应用非常广泛的空间聚集探测快速算法。其利用传染病监测数据可探测到病例异常增多的局部区域,对可能的传染病暴发做出早期预警。候选聚集区域的预先生成是该方法的一个关键步骤。将现有的候选聚集区域生成方法应用到包含子区域较多的大区域时,可能导致大量候选聚集区域的遗漏,影响探测结果的准确性;或可能生成大量重复的候选聚集区域,导致随后空间扫描计算时间的延长。本文在原有候选聚集区域生成方法的基础上,提出了一种新的快速算法。它以格网点间隔的优化选择,可减少对可能候选聚集区域的遗漏;同时,基于多重排序算法可在较短的时间之内,删除掉原始候选聚集区域集合中的大量重复。通过山东省西南部608个乡镇点的候选聚集区域生成测试,改进的方法可减少候选聚集区域的遗漏,并在较短的时间内删除掉所有的重复候选聚集。

关键词： 格网; 多重排序; 候选聚集区域; 空间扫描统计量

本文引用格式

李小洲, 王劲峰 . 空间扫描统计量方法中候选聚集区域生成的快速算法[J]. 地球信息科学学报, 2013 , 15(4) : 505 -511 . DOI: 10.3724/SP.J.1047.2013.00505

Abstract

Spatial scan statistic method is a widely adopted spatial cluster detection method in the field of public health surveillance. It can detect a sub-zone where the number of disease cases rises abnormally, based on infectious disease surveillance data, and thus is able to make early warning on possible outbreak of infectious disease. Chinese Center for Disease Control and Prevention (China CDC) launched China Infectious Disease Automated-alert and Response System (CIDARS) in 2004, which handles the infectious disease surveillance data of all of the counties of China to detect possible case clusters. The making of candidate clusters is a key step to this method, which to some extent determines the accuracy and time efficiency of the spatial scan statistic method. There are two deficiencies if the existing candidate clusters making method is applied to a very big research area with a lot of sub-regions. The first is that, the inappropriate separation distance of grid points might miss a lot of possible candidate clusters, which affects the accuracy of detected result. The second is that, the existing method might duplicate a great number of candidate clusters, which could prolong the computing time of subsequent spatial scan operation. In this paper a new efficient method is proposed according to the former existing candidate clusters making method. Based on the correct setting to the separation distance of grid points, the new method could greatly reduce the possibility of missing of some possible candidate clusters. At the same time, applying multiple-sort arithmetic, the proposed new method could find and delete a great number of duplicate clusters in the original-making candidate clusters in a shorter time. Finally, the paper applies and tests the proposed method for the making of candidate clusters in 608 counties in southwest Shandong Province and proves that the method works satisfactorily in both two aims, that is, it reduced the computing time and reduced the missing of candidate clusters.

Key words： candidate cluster; grid; multiple-sort; spatial scan statistic

参考文献

[1] Kulldorff M, Nagarwalla N. Spatial disease clusters: Detection and Inference[J]. Statistics in Medicine, 1995(14):799-810.

[2] Kulldorff M. A spatial scan statistic[J]. Communications in Statistics: Theory and Methods, 1997(26):1481-1496.

[3] Lawson A B, Kleinman K. Spatial and syndromic surveillance for public health[M]. New York: Wiley, 2005,115-131.

[4] Pfeiffer D U, Robinson T P, Stevenson M, et al. Spatial Analysis in Epidemiology[M]. Oxford: Oxford University Press, 2008,51-56.

[5] Jung I, Kulldorff M, Klassen A. A spatial scan statistic for ordinal data[J]. Statistics in Medicine, 2007(26):1594-1607.

[6] Huang L, Tiwari R, Tiwari R, et al. Weighted normal spatial scan statistic for heterogeneous population data[J]. Journal of the American Statistical Association, 2009(32):1034-1042.

[7] Kulldorff M, Mostashari F, Duczmal L, et al. Multivariate spatial scan statistics for disease surveillance[J]. Statistics in Medicine, 2007(26):1824-1833.

[8] Li X Z, Wang J F, Yang W Z, et al. A spatial scan statistic for nonisotropic two-level risk cluster[J], Statistics in Medicine, 2012,31(2):177-187.

[9] Li X Z, Wang J F, Yang W Z, et al. A spatial scan statistic for multiple clusters[J]. Mathematical Biosciences, 2011,233(2):135-142.

[10] Mostashari F, Kulldorff M, Hartman J J, et al. Dead bird clustering: A potential early warning system for West Nile virus activity[J]. Emerging Infectious Diseases, 2003(9):641-646.

[11] Ghebreyesus T A, Byass P, Witten K H, et al. Appropriate tools and methods for tropical microepidemiology: A case-study of malaria clustering in Ethiopia[J]. Ethiopian Journal of Health Development, 2003(17):1-8.

[12] 杨维中,兰亚佳,李中杰,等.国家传染病自动预警系统的设计与应用[J].中华流行病医学杂志,2010,31(11):13-18.

[13] Yang W Z, Li Z J, Lan Y J, et al. A nationwide web-based automated system for outbreak early detection and rapid response in China[J]. Western Pacific Surveillance and Response, 2011,2(1):1-6.

[14] Kulldorff M. Spatial scan statistics: Models, calculations and applications[M].//Balakrishnan N and Glaz J (eds). Recent Advances on Scan Statistics and Applications. Boston, USA: Birkhäuser, 1999,303-324.

[15] 严蔚敏,吴伟民.数据结构[M].北京:清华大学出版社,2007,284-286.

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献