Journal of Geo-information Science >
Analysis of Classification Methods and Activity Characteristics of Urban Population based on Social Media Data
Received date: 2017-04-30
Request revised date: 2017-07-20
Online published: 2017-10-09
Copyright
With the rapid development of spatial information technology, the concept of Pan-spatial Information System has been proposed. It extends the scope of spatial information system from the traditional mapping space to the space, interior space, microscopic space and other measurable space. Location data is one of the important research objects of Pan-spatial Information System and it has become a way of studying people's social life and urban dynamics. In this paper, we propose a new crowd classification method based on check-in data which is different from the traditional method based on socioeconomic attributes. Firstly, using the time series of check-in data, we build a matrix model. Then, we analyze the temporal characteristics of residents’ check-in activities. The analytical process starts from spatial-temporal profiles, learns the different behaviors, and returns annotated profiles. In the analytical process, we use the K-means clustering algorithm and K-NN algorithm to learn how to annotate profiles with a city user category (resident, dynamic resident, commuter, or visitor). Finally, according to the classification results of the population, we analyze the temporal and spatial behavior of different city user category and find their differences and potential regularity of spatial behavior. Our method can be applied to a new research perspective for characterizing the composition and characteristics of the urban population and studying urban spatiotemporal structure.
ZHOU Yan , LI Yanxi , HUANG Yueying , GENG Erhui . Analysis of Classification Methods and Activity Characteristics of Urban Population based on Social Media Data[J]. Journal of Geo-information Science, 2017 , 19(9) : 1238 -1244 . DOI: 10.3724/SP.J.1047.2017.01238
Fig. 1 The construction method of time series matrix图1 时序矩阵构造方法 |
Fig. 2 Examples of time series matrix of each type of urban population categories图2 每类城市人口类别的时序矩阵示例 |
Fig. 3 Individual check-in profiles’ categories and classification quality: the result of the semi-automatic labeling over the reduced time windows of one, two, three and four months, respectively图3 月份组合数为1、2、3、4的城市人口类别及总体的分类结果质量比较 |
Fig. 4 The check-in law of change of different types of people in different time period图4 不同类型人群在不同时间段签到变化规律 |
Fig. 5 Community discovery of check-in trajectory networks of different types of people图5 不同类型人群签到轨迹网络的社区发现结果 |
The authors have declared that no competing interests exist.
[1] |
[
|
[2] |
[
|
[3] |
|
[4] |
|
[5] |
|
[6] |
|
[7] |
|
[8] |
[
|
[9] |
[
|
[10] |
|
[11] |
|
[12] |
[
|
[13] |
|
[14] |
|
[15] |
|
[16] |
|
[17] |
|
[18] |
|
/
〈 |
|
〉 |