Journal of Geo-information Science >
Study and Application of the Method of Multi-scale Outliers Detection of Natural Disaster Investigation Data
Received date: 2017-07-10
Request revised date: 2017-09-06
Online published: 2017-12-25
Copyright
"Natural disaster" is the phenomenon of the losses of life and property, which is caused by the interaction of human society and natural environment. It’s also the product of the disaster environment, disaster-causing factors and disaster-bearing body. In order to study the processes, mechanisms and impacts of natural disasters as well as the reduction of the losses caused by natural disasters, it is necessary to conduct surveys of basic data and natural disaster events on a large scale of which the authenticity and consistency are much significant for ensuring the reliability and validity of the research results. However, the large number of organizations and investigators participating in the survey and evaluation process, large regional differences and large spatial scale create challenges in data quality control and validating the consistency of data from various survey units. To ensure the correctness and consistency of the data, it is necessary to carry out manual inspection. However, for the massive survey data, it is unrealistic to totally rely on manual work to effectively identify the abnormities. As a result, we design a multi-scale anomaly detection method for natural disaster survey data by using the single-element detection method of outliers based on normal distribution and spatial clustering method of Anselin Local Moran's I to detect the abnormal values and abnormal spatial distribution patterns of the massive survey data. It can effectively extracts the abnormalities and abnormal investigation units at all levels of scale and gains the reasons for abnormal data. It provides the support for the manual checking of survey data. In this paper, taking the project of flash flood disaster investigation and evaluation in mainland of China as an example, this method is used to audit the events of historical flash flood disaster and the areas of the towns which are in the prevention zones. Also, it quickly extract the anomaly units of flash flood disaster point density and township units with exceptional area values. Further analysis found that the reasons for these abnormalities were due to the inconsistency of filling methods, unit errors, and repetition of records and so on. The method resolved the inconsistency in massive amounts of flash flood survey data. This method is an effective approach of checking the quality of various other large-scale disaster datasets. Although the data validation approach used in this study is very effective, there are still some problems, i.e. the outlier checking only considers the outliers between survey units based on the administrative divisions. Regions are not divided according to their economic development and natural conditions. Finally, we analyze the applicable conditions of this method in the large-scale natural disaster investigations.
LIU Yesen , ZHANG Xiaolei , GUO Liang . Study and Application of the Method of Multi-scale Outliers Detection of Natural Disaster Investigation Data[J]. Journal of Geo-information Science, 2017 , 19(12) : 1653 -1660 . DOI: 10.3724/SP.J.1047.2017.01653
Fig. 1 Flowchart of the abnormal data detection图1 异常检测流程 |
Fig. 2 Auditing results of the point data of historical flash flood disaster图2 历史山洪灾害点数据审核结果 |
Fig. 3 land area of township图3 乡镇土地面积统计结果 |
Fig. 4 Spatial distribution of outliers of the rural land area图4 乡镇土地面积异常检测结果 |
The authors have declared that no competing interests exist.
[1] |
[
|
[2] |
[
|
[3] |
[
|
[4] |
[
|
[5] |
[
|
[6] |
[
|
[7] |
|
[8] |
[
|
[9] |
[
|
[10] |
[
|
[11] |
|
[12] |
[
|
[13] |
|
[14] |
[
|
[15] |
[
|
[16] |
[
|
[17] |
[
|
[18] |
[
|
[19] |
[
|
[20] |
[
|
[21] |
[
|
[22] |
[
|
[23] |
[
|
[24] |
[
|
[25] |
|
[26] |
|
[27] |
[
|
[28] |
[
|
[29] |
[
|
[30] |
[
|
/
〈 | 〉 |