Journal of Geo-information Science

Select

A Super-Resolution Enhancement Algorithm for Remote Sensing Images Using Conditional Controlled Diffusion Models

FU Yibo, XIE Donghai, WANG Zhibo, YI Chang, GUO Liuyan, WU Yu

Journal of Geo-information Science. 2024, 26(10): 2384-2393. https://doi.org/10.12082/dqxxkx.2024.240315

Download PDF (144) HTML (1280)

Knowledge map

Save

Image super-resolution technology enhances image clarity and enriches image detail by improving image spatial resolution, enabling quality enhancement without changing hardware conditions. Given the large size, complex target features, and abundant details of remote sensing images, along with the need for efficient information acquisition, we propose a Diffusion Super-Resolution (DSR) algorithm based on a conditional diffusion model. This approach uses low-resolution remote sensing images from the same region as conditioning inputs to the diffusion model, while high-resolution images with added noise are concatenated as inputs. A deep noise training network was constructed with U-Net as the backbone, incorporating residual connections and self-attention mechanisms. The loss function was also improved for better super-resolution results. The DSR method was tested using high-resolution remote sensing images from multiple periods of the domestic Gaofen and SuperView satellite series. The super-resolution results demonstrated pixel dimension expansion from 32 to 128. Comparative experiments with Bicubic, SRGAN, Real-ESRGAN, and SwinIR super-resolution algorithms showed that the DSR method outperforms these algorithms in both PSNR and SSIM metrics. Additionally, the DSR method significantly improves the quality of multispectral remote sensing images. By leveraging the conditional diffusion model, it successfully preserves rich detail and enhances spatial resolution without compromising image clarity. This method offers an efficient solution for super-resolution reconstruction, ensuring effective information acquisition in remote sensing applications and fulfilling the requirements of various domains such as land use classification, environmental monitoring, and urban planning. Moreover, the DSR method also opens new avenues for future research by demonstrating the potential of diffusion models in remote sensing image processing. It overcomes the limitations of simple convolutional networks, which extract only shallow features, and avoids the convergence issues commonly seen in adversarial neural networks during training, ultimately improving the restoration of rich details in remote sensing images.

Select

Opportunities and Challenges in Developing Geographic Information Science and Technology in the Era of the Low-Altitude Economy

LIAO Xiaohan, HUANG Yaohuan, LIU Xia

Journal of Geo-information Science. 2025, 27(1): 1-9. https://doi.org/10.12082/dqxxkx.2025.250028

Download PDF (201) HTML (957)

Knowledge map

Save

[Significance] As a representative of new-quality productivity, the low-altitude economy is gradually emerging as a new engine for economic growth. This economy is based on the development and utilization of low-altitude airspace resources. While bringing development opportunities to geospatial information technology, it also poses entirely new challenges. [Progress and Analysis] In this paper, we introduce the division of low-altitude airspace resources and highlight typical drone application scenarios in the context of the low-altitude economy. Subsequently, we analyze the broad application prospects of geospatial information technology in key areas of the low-altitude economy, including the refined utilization of airspace resources, the construction of low-altitude environments, the planning, construction, and operation of new air traffic infrastructure, as well as the safe and efficient operation and regulatory oversight of drones. We emphasize that the geospatial information industry will benefit from development opportunities such as the integration and innovation of emerging scientific and technological advancements, growing market demand, policy support, industrial guidance, and industrial upgrading and transformation. [Prospect] Finally, we briefly address the challenges geospatial information technology must overcome to meet the development needs of the low-altitude economy. These include advancements in spatio-temporal dimension elevation, map and location-based services, high-frequency and rapid data acquisition systems, all-time and all-domain capabilities, and ubiquitous intelligent technologies. These areas will also serve as future directions for development and breakthroughs in geospatial information technology.

Select

Research and Application of Text-to-Image Technology Based on Al Foundation Models

ZHANG Xinchang, ZHAO Yuan, QI Ji, FENG Weiming

Journal of Geo-information Science. 2025, 27(1): 10-26. https://doi.org/10.12082/dqxxkx.2025.240657

Download PDF (150) HTML (934)

Knowledge map

Save

[Objectives] To systematically review recent advancements in text-to-image generation technology driven by large-scale AI models and explore its potential applications in urban and rural planning. [Discussion] This study provides a comprehensive review of the development of text-to-image generation technology from the perspectives of training datasets, model architectures, and evaluation methods, highlighting the key factors contributing to its success. While this technology has achieved remarkable progress in general computer science, its application in urban and rural planning remains constrained by several critical challenges. These include the lack of high-quality domain-specific data, limited controllability and reliability of generated content, and the absence of constraints informed by geoscience expertise. To address these challenges, this paper proposes several research strategies, including domain-specific data augmentation techniques, text-to-image generation models enhanced with spatial information through instruction-based extensions, and locally editable models guided by induced layouts. Furthermore, through multiple case studies, the paper demonstrates the value and potential of text-to-image generation technology in facilitating innovative practices in urban and rural planning and design. [Prospect] With continued technological advancements and interdisciplinary integration, text-to-image generation technology holds promise as a significant driver of innovation in urban and rural planning and design. It is expected to support more efficient and intelligent design practices, paving the way for groundbreaking applications in this field.

Select

Assessment of Green Space Accessibility Incorporating Sentiment Analysis: An Improved 2SFCA Method

CHANG Wanxuan, ZHANG Yongqi, FU Xiao

Journal of Geo-information Science. 2024, 26(10): 2243-2253. https://doi.org/10.12082/dqxxkx.2024.240096

Download PDF (175) HTML (1205)

Knowledge map

Save

With the increasing improvement of the living standard of the residents in urban areas and their pursuit of quality of life, urban green spaces have become the main places of leisure and recreation for residents. Under this background, how to fairly evaluate the rationality of the layout of urban green spaces and put forward suggestions for improvement has become an important part of urban transportation and land use planning. Urban green space accessibility is a key indicator for evaluating the layout of urban green spaces. In response to the limitations of assessing attractiveness based solely on urban green space area in the past, this paper takes Suzhou urban area as an example. In addition to calculating accessibility using objective attributes in the traditional framework, the paper delves into social media data to incorporate urban residents' subjective sentiment towards urban green space quality indicators into the consideration scope of attractiveness. Through this innovative integration, the paper improves the Two-Step Floating Catchment Area (2SFCA) method, analyzing in-depth the accessibility of urban residents to urban green spaces and the dynamic changes in accessibility before and after public health emergencies. The improved 2SFCA method, combined with Sentiment Knowledge Enhanced Pre-training (SKEP) model, incorporates residents' emotional evaluations of urban green spaces to measure their subjective attractiveness. Meanwhile, considering the skewness characteristic of area indicators, the paper innovatively proposes the Scale Index (SI) as an objective attractiveness evaluation indicator for urban green spaces, providing more scientific and robust support for urban green space planning. The research findings reveal that during public health emergencies, individuals tend to prefer urban green spaces that offer convenient access, such as community parks. However, as daily life gradually resumes, there is a greater preference for urban green spaces equipped with high-quality facilities, such as specialized parks. Only considering objective area as the attractiveness of urban green space leads to overestimation of the accessibility of large-area and underestimation of small-area urban green space. Moreover, solely based on visitors' subjective quality perception of urban green space may underestimate the accessibility of communities around large urban green spaces. The improved 2SFCA method, considering both visitors' subjective perception and objective attributes of urban green space attractiveness, can more accurately assess urban green space accessibility, broadening the perspective of traditional urban green space accessibility assessment. This method can not only be applied to urban green space planning, but also provides a new idea and computational framework for the accessibility analysis of public service facilities.

Select

Remote Sensing Data Intelligence: Progress and Perspectives

HE Guojin, LIU Huichan, YANG Ruiqing, ZHANG Zhaoming, XUE Yuan, AN Shihao, YUAN Mingruo, WANG Guizhou, LONG Tengfei, PENG Yan, YIN Ranyu

Journal of Geo-information Science. 2025, 27(2): 273-284. https://doi.org/10.12082/dqxxkx.2025.240630

Download PDF (120) HTML (737)

Knowledge map

Save

[Significance] Data resources have become pivotal in modern production, evolving in close synergy with advancements in artificial intelligence (AI) technologies, which continuously cultivate new, high-quality productive forces. Remote sensing data intelligence has naturally emerged as a result of the rapid expansion of remote sensing big data and AI. This integration significantly enhances the efficiency and accuracy of remote sensing data processing while bolstering the ability to address emergencies and adapt to complex environmental changes. Remote sensing data intelligence represents a transformative approach, leveraging state-of-the-art technological advancements and redefining traditional paradigms of remote sensing information engineering and its applications. [Analysis] This paper delves into the technological background and foundations that have facilitated the emergence of remote sensing data intelligence. The rapid development of technology has provided robust support for remote sensing data intelligence, primarily in three areas: the advent of the big data era in remote sensing, significant advancements in remote sensing data processing capabilities, and the flourishing research on remote sensing large models. Furthermore, a comprehensive technical framework is proposed, outlining the critical elements and methodologies required for implementing remote sensing data intelligence effectively. To demonstrate the practical applications of remote sensing data intelligence, the paper presents a case study on applying these techniques to extract ultra-high-resolution centralized and distributed photovoltaic information in China. [Results] By integrating large models with remote sensing data, the study demonstrates how remote sensing data intelligence enables precise identification and mapping of centralized and distributed photovoltaic installations, offering valuable insights for energy management and planning. The effectiveness of remote sensing data intelligence in addressing challenges associated with large-scale photovoltaic extraction underscores its potential for application in critical fields. [Prospect] Finally, the paper provides an outlook on areas requiring further study in remote sensing data intelligence. It emphasizes that high-quality data serves as the foundation for remote sensing data intelligence and highlights the importance of constructing AI-ready knowledge bases and recognizing the value of small datasets. Developing targeted and efficient algorithms is essential for achieving remote sensing intelligence, making the advancement of practical data intelligence methods an urgent research priority. Furthermore, promoting multi-level services for remote sensing data, information, and knowledge through data intelligence should be prioritized. This research provides a comprehensive technical framework and forward-looking insights for remote sensing data intelligence, offering valuable references for further exploration and implementation in critical fields.

Select

Spatiotemporal Analysis and Prediction of Landslide Deformation Combining Time-Series InSAR and LSTM

LIN Na, TAN Libing, ZHANG Di, DING Kai, LI Shuangtao, XIAO Maochi, ZHANG Jingping, WANG Xiaohua

Journal of Geo-information Science. 2024, 26(12): 2772-2787. https://doi.org/10.12082/dqxxkx.2024.240409

Download PDF (148) HTML (1109)

Knowledge map

Save

China is one of the countries most severely affected by geological disasters. Researching high-precision and highly reliable methods for monitoring and predicting landslide deformation holds practical significance for disaster prevention and mitigation efforts. Using the massive Outang landslide in the Three Gorges Reservoir Area as a case study, this paper addresses the issue of the atmospheric interference in extracting landslide deformation using time-series InSAR technology. To correct for atmospheric effects, the GACOS model is introduced and validated against GNSS observation data. To address the often-overlooked temporal-spatial analysis before landslide deformation prediction, the Moran index and Hurst index are calculated to analyze the spatiotemporal characteristics of landslide deformation. Recognizing that landslide deformation is influenced not only by historical deformation but also by various external factors, this paper proposes coupling landslide influencing factors with deformation data for prediction. A Long Short-Term Memory (LSTM) model, optimized by Variational Mode Decomposition (VMD) and the Sparrow Search Algorithm (SSA), is employed for the prediction. By decomposing landslide displacement data into trend, periodic, and random components using VMD, the LSTM network structure is constructed. SSA is used to determine the optimal number of hidden units, maximum training periods, and the initial learning rate of the LSTM model. Additionally, methods such as data normalization, regularization, and model evaluation are employed to enhance the performance and stability of the LSTM model. Finally, the model is trained using the influencing factors and decomposed displacement data to predict landslide deformation. The results indicate that: (1) From January 2021 to June 2023, the maximum and minimum deformation rates of the Outang landslide were -72.75 mm/a and 50.74 mm/a, respectively; (2) The deformation in the study area exhibits positive spatial autocorrelation, with the landslide in the settlement area showing a persistent trend; (3) The prediction error of the LSTM model optimized by VMD and SSA is only 0.37 mm, representing an 11.004% accuracy improvement compared to the standard LSTM model. Based on time-series InSAR technology and spatiotemporal analysis results, this paper constructs a high-precision prediction model for landslide deformation, incorporating multiple influencing factors. This model provides a valuable reference for the prevention and control of landslide disasters.

Select

GeoAI-driven Spatiotemporal Prediction: Progress and Prospects

WANG Peixiao, ZHANG Hengcai, ZHANG Yan, CHENG Shifen, ZHANG Tong, LU Feng

Journal of Geo-information Science. 2025, 27(1): 60-82. https://doi.org/10.12082/dqxxkx.2025.240718

Download PDF (148) HTML (878)

Knowledge map

Save

[Objectives] Forecasting is a key research direction in Geospatial Artificial Intelligence (GeoAI), playing a central role in integrating surveying, mapping, geographic information technologies, and artificial intelligence. It drives intelligent innovation and facilitates the application of spatial intelligence technologies across diverse real-world scenarios. [Progress] This study reviews the historical development of GeoAI-driven spatiotemporal forecasting, providing an overview of prediction models based on statistical learning, deep learning, and generative large models. In addition, it explores the mechanisms of spatiotemporal dependence embedding within these models and decouples general computational operators used for modeling temporal, spatial, and spatiotemporal relationships. [Prospect] The challenges faced by intelligent prediction models include sparse labeled data, lack of explainability, limited generalizability, insufficient model compression and lightweight design, and low model reliability. Furthermore, we discuss and propose four future trends and research directions for advancing geospatial intelligent prediction technologies: a generalized spatial intelligent prediction platform incorporating multiple operators, generative prediction models integrating multimodal knowledge, prior-guided deep learning-based intelligent prediction models, and the expansion of geospatial intelligent prediction models into deep predictive applications for Earth system analysis.

Select

An Intelligent Site Selection Approach for Public Service Facilities Coupled with Improved Graph Attention Network and Deep Reinforcement Learning

WANG Zhong, CAO Kai

Journal of Geo-information Science. 2024, 26(11): 2452-2464. https://doi.org/10.12082/dqxxkx.2024.240044

Download PDF (244) HTML (1117)

Knowledge map

Save

In the context of the rapid development of urbanization, the reasonable selection of locations for public service facilities is critical for delivering efficient services and enhancing the quality of urban residents' lives. However, prevailing approaches for allocation of public service facilities often fall short of meeting the demands on their performance and efficiency in complex and large-scale real-world scenarios. To address these issues, this article proposed a novel Graph-Deep-Reinforcement-Learning Facility Location Allocation Model (GDRL-FLAM), coupling a Facility Location Allocation Graph Attention Network (FLA-GAT) with a Deep Reinforcement Learning (DRL) algorithm. This proposed model tackled the location allocation problem for public service facilities based on graph representation and the REINFORCE algorithm. To assess the performance and efficiency of the proposed model, this study conducted experiments based on randomly generated datasets with 20, 50, and 100 points. The experimental results indicated that: (1) For the tests with 20, 50, and 100 points, the GDRL-FLAM model exhibited a significant improvement ranging from 11.79% to 14.49% compared to the Genetic Algorithm (GA) which is one of the commonly used heuristic algorithms for addressing location allocation problems. For the tests with 150 and 200 points, the improvement ranged from 1.52% to 9.35%. Moreover, with the increase in the size of the training set, the model also demonstrated enhanced generalizability on large-scale datasets; (2) The GDRL-FLAM model showed strong transfer learning ability to obtain the location allocation strategies in simple scenarios and adapt them to more complex scenarios; (3) In the case study of Singapore, the GDRL-FLAM model outperformed GA significantly, achieving obvious improvements ranging from 1.01% to 10.75%; (4) In all these abovementioned tests and experiments, the GDRL-FLAM model showed substantial improvement in efficiency compared to GA. In short, this study demonstrated the potential of the proposed GDRL-FLAM model in addressing the location allocation issues for public service facilities, due to its generalization and transfer learning abilities. The proposed GDRL-FLAM could also be adapted to solve other spatial optimization problems. Finally, the article discussed the limitations of the model and outlined potential directions for future research.

Select

Remote Sensing Image Segmentation Based on Multi-modal Feature Extraction and Hierarchical Perception

ZHANG Yinsheng, SHAN Mengjiao, CHEN Xin, CHEN Ge, TONG Junyi, JI Ru, SHAN Huilin

Journal of Geo-information Science. 2024, 26(12): 2741-2758. https://doi.org/10.12082/dqxxkx.2024.240488

Download PDF (142) HTML (1085)

Knowledge map

Save

In high-resolution remote sensing images, challenges such as blurred visual features of objects and different spectra for the same object arise. Segmenting similar ground objects and shaded ground objects in a single mode is difficult. Therefore, this paper proposes a remote sensing image segmentation model based on multi-modal feature extraction and hierarchical perception. The proposed model introduces a multi-modal feature extraction module to capture feature information from different modalities. Using the complementary information of IRRG and DSM, accurate pixel positions in the feature map are obtained, improving semantic segmentation of high-resolution remote sensing images. The coordinate attention mechanism fully fuses the features from different modalities to address issues of blurred visual features and different object spectra during image segmentation. The abstract feature extraction module uses MobileNetV3 with dual-path bottleneck blocks as the backbone network, reducing the number of parameters while maintaining model accuracy. The hierarchical perception network is introduced to extract deep abstract features, and the attention mechanism is improved by embedding scene perception of pixels. Leveraging the inherent spatial correlation of ground objects in remote sensing images, efficient and accurate class-level context modeling is achieved, minimizing excessive background noise interference and significantly improving the semantic segmentation performance. In the decoding module, the model uses multi-scale aggregation dual fusion for feature recovery, strengthening the connection between the encoder and the decoder. This combines low-level features with high-level abstract semantic features, enabling effective spatial and detailed feature fusion. Progressive upsampling is used for feature recovery, resolving the issue of blurred visual features and improving segmentation accuracy. Based on high-resolution remote sensing images from the ISPRS Vaihingen and Potsdam datasets, the experimental results demonstrate that MFEHPNet outperforms a series of comparison models, including C3Net, AMM-FuseNet, MMFNet, CMFet, CIMFNet, and EDGNet, across various performance indicators. In the ISPRS Vaihingen and Potsdam datasets, MFEHPNet achieves an overall accuracy of 92.21% and 93.45%, an average intersection ratio of 83.24 % and 83.94 %, and a Kappa coefficient of 0.85. The frequency-weighted intersection ratio is 89.24 % and 90.12%, respectively, significantly improving the semantic segmentation performance of remote sensing images and effectively addressing the issues of blurred feature boundaries and different spectra during segmentation.

Select

Characterizing the Evolution of Territorial Space Patterns and Driving Mechanisms in the Countries Along the Central Asia-West Asia Economic Corridor

ZHANG Ke, YIN Li, WEI Wei, LI Hongrui, ZHAO Lang, BO Liming

Journal of Geo-information Science. 2024, 26(11): 2529-2551. https://doi.org/10.12082/dqxxkx.2024.240439

Download PDF (124) HTML (940)

Knowledge map

Save

Scientific knowledge of the spatio-temporal evolution processes and formation mechanisms of territorial space in countries along the Central Asia-West Asia Economic Corridor holds significant scientific value and practical importance for supporting the current "Going Global" strategy and the "Belt and Road" initiative. Based on the dominant functions of the territories, the Central Asia-West Asia Economic Corridor is divided into three major types of territorial space: urban and rural construction, agricultural production, and ecological protection. A long-term analysis base map of territorial space from 2002 to 2022 was constructed by integrating multi-source spatio-temporal data. The spatio-temporal cube model was employed to depict the spatio-temporal evolution processes and typical patterns, while the integrated spatial transformation intensity model analyzed the characteristics of spatial structural transformation across three dimensions: scale, location, and intensity. The VIVI-SHAP framework of an interpretable machine learning model was used to analyze the evolution mechanisms, focusing on the importance of driving factors, interaction intensity, and non-linear dependencies. The results show that: (1) Approximately 6.14% of the territorial space in countries along the corridor underwent structural transformation over the past 20 years. The proportion of urban and rural construction space, though small, increased steadily by 0.17%, while agricultural production space decreased by 19.04% overall, with significant structural changes within the ecological protection space. (2) The dynamic interchange between green and other ecological spaces within the ecological protection space is predominant, with a systematic tendency for green ecological space to convert into agricultural production space, while the main source of urban and rural construction space expansion was green ecological space, accounting for 56.36% of the total converted area. 3) The territorial spatial pattern of the corridor is shaped by multiple processes of territorial space transformation, each with different magnitudes, intensities, and driving mechanisms. Natural geographic factors and transportation location factors played decisive roles, while the global influence of population growth and socio-economic development on territorial space structural transformation was less pronounced. This study provides new perspectives and methods to reveal the patterns and mechanisms of changes in land spatial types in the Central Asia-West Asia region. It further provides data support for decision-making departments to formulate reasonable land spatial planning, and demonstrates its application value in achieving greater spatial comprehensive benefits and promoting coordinated regional economic development.

Select

Exploring Urban Renewal Consumption: A Rapid Estimation Method for Annual Urban Construction Waste Production Based on High-Resolution Remote Sensing Images

HUANG Lei, LIN Shaofu, LIU Xiliang, WANG Shaohua, CHEN Guihong, MEI Qiang

Journal of Geo-information Science. 2024, 26(9): 2192-2212. https://doi.org/10.12082/dqxxkx.2024.240199

Download PDF (131) HTML (939)

Knowledge map

Save

Construction waste is an inevitable byproduct of urban renewal processes, causing serious environmental pollution and ecological pressure. Precisely quantifying the annual production of urban construction waste and the resource conversion rate is crucial for assessing the cost of urban renewal. Traditional manual methods of estimating construction waste production rely heavily on statistical data and historical experience, which are inflexible, time-consuming, and labor-intensive in practical application, and need improvement in terms of accuracy and timeliness. Existing deep learning models have relatively poor capabilities in extracting and integrating small targets and multi-scale features, making it difficult to handle irregular shapes and fragmented detection areas. This paper proposes a Multi-Scale Feature Fusion and Attention-Enhanced Network (MS-FF-AENet) based on High-resolution Remote Sensing Images (HRSIs) to dynamically track and detect changes in buildings and construction waste disposal sites. This paper introduces a novel encoder-decoder structure, utilizing ResNet-101 to extract deeper features to enhance classification accuracy and effectively mitigate the gradient vanishing problem caused by increasing the depth of convolutional neural networks. The Depthwise Separable-Atrous Spatial Pyramid Pooling (DS-ASPP) with different dilation rates is constructed to address insufficient receptive fields, resolving the issue of discontinuous holes when extracting large targets. The Dual Attention Mechanism Module (DAMM) is employed to better preserve spatial details, enriching feature extraction. In the decoder, Multi-Scale Feature Fusion (MS-FF) is utilized to capture contextual information, integrating shallow and intermediate features of the backbone network, thereby enhancing extraction capabilities in complex scenes. MS-FF-AENet is employed to extract and analyze changes in building areas at different time periods, calculating the engineering waste from new constructions and demolition waste from demolished buildings, thereby obtaining the annual production of urban construction waste. Furthermore, MS-FF-AENet is utilized to extract construction waste disposal sites at different time periods, estimating the amount of construction waste landfill based on changes in landfill waste, indirectly assessing the resource conversion rate of urban construction waste. Based on HRSIs of Changping District, Beijing from 2019 to 2020, experimental results demonstrate: (1) Among a series of baseline models including UNet, SegNet, PSPNet, DeepLabV3+, DSAT-Net、ConvLSR-Net and SDSC-UNet, MS-FF-AENet exhibits advantages in terms of precision and efficiency in extracting buildings and construction waste; (2) During the period from 2019 to 2020, the annual production of construction waste in the study area due to urban renewal is approximately 4 101 156.5 tons, with approximately 2 251 855.872 tons being landfill waste and approximately 1 849, 300.628 tons being resource conversion waste, resulting in a construction waste resource conversion rate of 45.09%, further corroborating government statistical reports. This paper provides a convenient and effective analysis approach for accurate measurement of the cost of urban renewal.

Select

High-Precision Identification and Spatial Feature Analysis of Green Space in a Mega-City Based on Street View and High-Resolution Remote Sensing Images

CHEN Hong, TANG Jun, GONG Yangchun, CHEN Zhijie, WANG Wenda, WANG Shaohua

Journal of Geo-information Science. 2024, 26(12): 2818-2830. https://doi.org/10.12082/dqxxkx.2024.230504

Download PDF (190) HTML (898)

Knowledge map

Save

Urban green spaces are critical components of urban ecosystems, playing an irreplaceable role in improving the ecological environment and enhancing quality of life. High-precision identification of urban green spaces is fundamental for urban renewal and optimizing green infrastructure. However, research on the identification and spatial heterogeneity of green spaces in megacities remains relatively limited. This study, taking Xi'an as an example, integrates urban street view images and GF-2 (Gaofen-2) satellite imagery, employing methods such as ISODATA classification, K-Means classification, and convolutional neural networks to achieve multi-dimensional, downscaled, and high-precision identification and analysis of green spaces. The results indicate the following: (1) The K-Means classification method demonstrates significantly higher accuracy (84.5%) compared to the ISODATA classification method (62.4%) and more accurately maps the spatial characteristics and heterogeneity patterns of green spaces. The green space coverage identified by the K-Means method is 0.277 0, which is lower than the 0.360 7 identified by ISODATA. (2) The average Green View Index (GVI) of streets in Xi'an's main urban area is 0.156 0, indicating a generally good level of street greening. However, there is notable polarization across different roads, with 30% of sampling points having a GVI below 0.080 0. Overall, the GVI of higher-grade roads is greater than that of lower-grade roads, following the trend: primary roads > secondary roads > trunk roads > tertiary roads. (3) There is a positive correlation between the GVI of streets and the vegetation coverage in their surrounding areas in Xi'an's main urban area. However, this correlation weakens in certain road sections, reflecting differences between vertical cross-sections and overhead views of the streets. Combining these perspectives provides a more accurate assessment and quantification of urban green spaces. This study provides a reference for green space planning, green infrastructure construction, and smart management in Xi'an, as well as technical guidance for high-precision identification and spatial analysis of urban green spaces in other cities.

Select

A Method of Road Network Matching Using Graph Embedding via Improved Neighbor Aggregations

YANG Ming, YANG Jian, HOU Yang, FANG Li, ZHANG Meng, ZHANG Bianying, ZHANG Jingru

Journal of Geo-information Science. 2024, 26(10): 2335-2351. https://doi.org/10.12082/dqxxkx.2024.240005

Download PDF (103) HTML (916)

Knowledge map

Save

As an important transportation infrastructure, the timely updating of road network data is of great significance in the fields of traffic management, emergency response, and urban planning. Road network matching that determines the correspondence between the features of road network data from different sources serves this purpose. It also provides technical support for tasks such as the quality assessment of crowdsourced road network data, which has attracted a lot of attention in the field of geographic information. However, traditional road network matching methods mainly measure the similarity of road network structure through the geometric and topological attributes of road network data to determine the matching relationship of road network elements. Such methods with manually designed features and thresholds are easily limited by experts' experience, which degrades their performance under complex road network structures. In recent years, road network data modeling based on graph neural networks has become a research hotspot and has achieved excellent performance in several road network modeling tasks. However, most of the existing methods use direct neighborhood aggregation on the graph topology to learn the embedded representation of the road network structure, without considering the spatial relationship of road network features in this key step, and failing to make full use of the representation learning capability of graph neural networks. For this reason, this study proposes an improved neighborhood aggregation that performs a spatially explicit graph-based embedding learning method for road network matching. First, a road graph model of the road network data is constructed, and geometric, semantic, and location features are extracted. Then, based on the GraphSAGE framework, three kinds of neighborhood aggregation operators (i.e., spatial, classified, and hybrid) are proposed, and the computation of spatial relationships and attribute types of road network features is introduced in the neighborhood aggregation operations. Finally, the similarity of graph node embedding is utilized to determine the matching relationship of road network features. To verify the effectiveness of the proposed method, extensive experiments are carried out using real-world road network data. The proposed method achieves the optimal performance in all metrics on the test data of the study region, which improves the matching correctness rate by more than 11% and the recall rate by more than 6.8% compared to the baseline graph neural network method. Furthermore, the road network graph embedding features are analyzed from the aspects of graph embedding structure and embedded road network structure, which helps explore the role of improved neighborhood aggregation on the graph embedding representation capability and provides a new perspective for further improving the graph neural network road network modeling.

Select

Outdoor Hiking Navigation Road Network Map Construction Using Crowd-Source Trajectory Data

TANG Jianbo, XIA Heyan, PENG Ju, HU Zhiyuan, DING Junjie, ZHANG Yuyu

Journal of Geo-information Science. 2025, 27(1): 151-166. https://doi.org/10.12082/dqxxkx.2025.240479

Download PDF (79) HTML (744)

Knowledge map

Save

[Objectives] The outdoor pedestrian navigation road network is a vital component of maps and a crucial basis for outdoor activity route planning and navigation. It plays a significant role in promoting outdoor travel development and ensuring safety management. However, existing research on road network generation mainly focuses on the construction of urban vehicular navigation networks, with relatively less emphasis on hiking navigation road networks in complex outdoor environments. Moreover, existing methods primarily emphasize the extraction of two-dimensional geometric information of roads, while the reconstruction of real three-dimensional geometric and topological structures remains underdeveloped. [Methods] To address these limitations, this study proposes a method for constructing the three-dimensional outdoor pedestrian navigation road network maps using crowdsourced trajectory data. This approach leverages a road network generation layer and an elevation extraction layer to extract the two-dimensional structure and three-dimensional elevation information of the road network. In the road network generation layer, a trajectory density stratification strategy is adopted to construct the two-dimensional vector road network. In the elevation extraction layer, elevation estimation and optimization are performed to generate an elevation grid raster map, which is then matched with the two-dimensional road network to produce the three-dimensional hiking navigation road network. [Results] To demonstrate the effectiveness of the proposed approach, experiments were conducted using 1 170 outdoor trajectories collected in 2021 from Yuelu Mountain Scenic Area in Changsha through an online outdoor website. The constructed outdoor three-dimensional hiking road network map achieved an average positional offset of 4.201 meters in two-dimensional space and an average elevation estimation error of 7.656 meters. The results demonstrate that the proposed method effectively handles outdoor trajectory data with high noise and varied trajectory density distribution differences, generating high-quality three-dimensional hiking road network maps. [Conclusions] Compared to traditional outdoor two-dimensional road networks, the three-dimensional navigation road networks constructed this study provide more comprehensive and accurate map information, facilitating improved pedestrian path planning and navigation services in complex outdoor environments.

Select

Time Series Variation Pattern Recognition of Spatiotemporal Distribution Patterns of Ride-hailing

CHEN Zhiju, LIU Kai, WANG Jiangbo

Journal of Geo-information Science. 2024, 26(10): 2229-2242. https://doi.org/10.12082/dqxxkx.2024.230406

Download PDF (171) HTML (859)

Knowledge map

Save

The rapid development of information and communication technologies and mobile computing has generated a variety of mobility big data, providing new opportunities for understanding and exploring the spatiotemporal distribution and mobility characteristics of resident travel, and further contributing to the construction of smart cities. However, the emerging mobile data have experienced significant growth in both scale and complexity compared to traditional data, posing challenges for its structural characteristic analysis. To address these issues, this paper proposes an analytical framework to deal with the spatiotemporal distribution characteristics of high-dimensional ride-hailing travel pattern. Compared to traditional square partitions, a regular hexagon is closer to a circle, and the six adjacent hexagons connected to its edges are symmetrically equivalent, which can be more advantageous in aggregating demands with similar travel characteristics into the same partition. Therefore, hexagonal partition is selected as the basic clustering unit, and different spatiotemporal patterns are identified by clustering homogeneous travel distribution groups. Firstly, the spatiotemporal characteristics of travel distribution aggregated in the hexagonal partition are summarized into three main components: the departure demand distribution at the origin partition, the spatial distribution at the destination partition, and the arrival demand distribution at the destination partition. The spatiotemporal similarity between two partitions can be expressed as the product of these three types of distribution similarity. Furthermore, a Clustering Algorithm with Fast Search and Find of Spatiotemporal Density Peaks (CFSFSTDP) is proposed to identify the spatiotemporal patterns of ride-hailing travel distribution in each partition. The spatiotemporal distances between different partitions are obtained through the calculation of spatiotemporal similarity. Finally, affinity propagation clustering algorithm is used to perform clustering analysis on the time series variation pattern of spatiotemporal pattern of travel distribution in each partition. The time series similarity of spatiotemporal patterns between different partitions is represented by the sum of Euclidean distances between time series of each interval, and the model converges through continuous updates of attractiveness and affiliation indices. Through the empirical analysis of Didi Chuxing order data in Chengdu for one month, the validity of the method is verified. Based on the identified seven spatiotemporal distribution patterns, the differences of spatiotemporal patterns in the size, location, and time of demand are analyzed, and the functional types of ride-hailing travel in different partitions are discussed. The identified six time series patterns better grasp the time continuity of spatiotemporal patterns of ride-hailing travel distribution and help to better build the corresponding spatiotemporal evolution digital.

Select

A Method for Evaluating Urban Vitality Using the RAGA-PPM Model: A Case Study of Central Nanjing

YU Lei, SHENG Yehua, LIU Xingyu

Journal of Geo-information Science. 2024, 26(11): 2567-2582. https://doi.org/10.12082/dqxxkx.2024.240380

Download PDF (109) HTML (809)

Knowledge map

Save

Enhancing and sustaining urban competitiveness is contingent upon the presence of urban vitality. Urban planners and managers face increasing pressure to find more accurate and logical ways to manage urban development due to the growing challenges associated with municipal government. This study focuses on the central region of Nanjing. A detailed framework for evaluating urban vitality is proposed from three perspectives, human activities, network interactions, and the physical environment. This framework uses foundational road networks and building footprints from the World Map, Baidu heatmap, Dianping restaurant data, social media check-in data, Baidu Map POI, and innovation data. To create a comprehensive vitality evaluation framework, nine urban vitality indicators were reduced in dimensionality using the real-coded accelerated genetic algorithm based on the Projection Pursuit Model (RAGA-PPM). An analysis was also conducted on the differences with EWM and the spatial distribution patterns of both unidimensional and comprehensive vitality in Nanjing. The conclusions can be divided into three parts. First, the spatial distribution pattern of vitality in Nanjing's central urban area is successfully reflected by the comprehensive evaluation technique based on multi-source big data. The validity of the proposed evaluation system was confirmed by analyzing vitality cluster sample locations. Second, similar spatial features may be seen in Nanjing's unidimensional vitality, revealing a monocentric urban structure, with high-value areas gradually decreasing outward from the Xinjiekou commercial district. Commercial districts and metro stations are the focal points of population activity vitality, with each district exhibiting strong central values and secondary vitality clusters. Urban vitality values decline, with the Xinjiekou commercial area and Nanjing South Station serving as hubs of network interaction vitality. Urban vitality ratings decrease concentrically, with the Xinjiekou commercial area and Nanjing South Station serving as the hubs of network interaction vitality. Physical building vitality is geographically scattered, with high and relatively high values spread across most areas. Third, unidimensional vitality is not unfamiliar to comprehensive vitality. Additional viable centers for vitality were identified, with each district having a vitality hub. Xuanwu, Gulou, Jianye, and Qinhuai districts, which comprise the old city, form the core of Nanjing's vibrancy and serve as significant hubs. Liuhe and Yuhuatai districts have the lowest vitality, while Xuanwu and Qinhuai districts show the highest vitality. Most districts with above-average comprehensive vitality scores are located near transportation hubs, university areas, industrial parks, pedestrian streets, and commercial centers. According to the study, urban designers may benefit from a more thorough and multifaceted understanding of urban vitality patterns.

Select

Estimation of Potential Charging Demand for Taxis and Spatiotemporal Feature Decomposition Based on Trajectory Data

MA Ruichen, WANG Pinxi, HUANG Ailing, QI Geqi, XU Xiaohan

Journal of Geo-information Science. 2024, 26(10): 2282-2299. https://doi.org/10.12082/dqxxkx.2024.240079

Download PDF (136) HTML (800)

Knowledge map

Save

In the transition of urban taxi fleets toward electrification in large cities, the charging demand of taxis exhibits characteristics of high charging load and strong spatiotemporal randomness, with a noticeable mismatch between charging supply and demand in time and space. To accurately estimate the potential charging demand post- the full electrification of the taxi fleet, this study introduces a bottom-up conceptual framework and a binary tree algorithm based on trajectory map matching, leveraging fuel taxi trajectory data exclusively. This approach provides a new paradigm for estimating charging demands in regions lagging behind in the electrification transition. The model and algorithm are validated using trajectory data from 890 electric taxis with battery status fields (State of Charge, SOC). Results show that the estimation errors of indicators such as the number of charging segments and charging amount are less than 6.5%. Moreover, the model also exhibits high spatiotemporal distribution estimation accuracy under different parameter settings (battery depletion threshold θ) and spatial scales (with grid sizes of 500 m, 1 000 m, and 10 000 m), ensuring their applicability in real-world scenarios. Specifically, the temporal distribution error of charging amount is less than 8.5% in the best-case scenario, and over half of the charging amount within 500 m grids has a spatial distribution error less than 0.3, with 59% of the 500 m grids having an estimated error of charging segment count less than 0.3. Building upon this, the Singular Value Decomposition (SVD) algorithm is used to decompose and reduce the dimensionality of the spatiotemporal matrix of charging demands, identifying spatiotemporal patterns of potential charging demands within the real road network of Beijing's Sixth Ring Road at road level. Finally, a case study is conducted using trajectory data from 1 913 taxis in Beijing over three consecutive days from March 9th (Monday) to March 11th (Wednesday) in 2019, and the results indicate that the spatial distribution of potential charging demands for taxis in Beijing exhibits prominent clustering features in key areas and critical corridors, corresponding to high-density charging demands associated with residents' high activity levels and long-distance travel. The decomposed charging demands reveal a spatiotemporal structural pattern dominated by regular charging demands, with supplementary heterogeneity in charging demands between morning and afternoon, as well as during working and non-working hours. This analysis method assists in uncovering the spatial distribution structural characteristics of potential charging demand and spatiotemporal coupling relationships, providing decision-making references for long-term planning of charging infrastructure, grid load scheduling, and charging demand management in the electrification transformation of taxi fleets.

Select

Reasoning on Geographical Knowledge Graph: Advances and Perspectives

DUAN Yuxi, CHEN Biyu, LI Yan, ZHANG Xueying, LIN Li

Journal of Geo-information Science. 2025, 27(1): 41-59. https://doi.org/10.12082/dqxxkx.2025.240460

Download PDF (171) HTML (665)

Knowledge map

Save

[Objectives] With the application of knowledge graph techniques in the field of Geographical Information Science (GIS), the Geographical Knowledge Graph (GeoKG) has become a key research direction. GeoKGs often lack sufficient geographic knowledge coverage, which can negatively impact downstream applications. Therefore, reasoning techniques are essential for GeoKG to complete missing knowledge, identify inconsistencies, and predict trends in geographic phenomena. Unlike reasoning techniques applied to general knowledge graphs, reasoning on GeoKGs must handle the unique and complex spatial and temporal characteristics of geographic phenomena. This paper comprehensively introduces and summarizes recent advances in GeoKG reasoning. [Analysis] First, it introduces the relevant concepts and problem definitions of GeoKG reasoning. Second, it analyzes the two core tasks of GeoKG reasoning: knowledge completion and prediction. The reasoning model for knowledge completion primarily fills gaps in the graph to ensure knowledge integrity, while the reasoning model for prediction aims to forecast future trends based on existing geographic data. These two models are optimized for different application scenarios, with different focuses in processing geographic data. [Prospect] Finally, the paper explores future development trends in GeoKG reasoning, highlighting areas such as processing complex relationships in spatiotemporal data, reasoning with multi-scale geographic knowledge, fusing multimodal data, and enhancing the interpretability and intelligence of reasoning models. Additionally, the integration of GeoKGs with large-scale pre-trained models is expected to become a key area of focus.

Select

Spatial Prediction Based on Cross-area Knowledge Transfer: Comparison of Two Methods

LUAN Yupeng, HE Rixing, JIANG Chao, DENG Yue, ZHU Mengzhen, WANG Yitong, TANG Zongdi

Journal of Geo-information Science. 2024, 26(11): 2465-2482. https://doi.org/10.12082/dqxxkx.2024.240365

Download PDF (95) HTML (757)

Knowledge map

Save

Due to the imbalanced regional development, data scarcity exists in some regions, which to some extent restricts the progress of spatial prediction research. The introduction of cross-area knowledge transfer offers a valuable method for mitigating the impact of data scarcity in areas with limited samples and for conducting spatial prediction. With technological advancements, spatial prediction methods based on transfer learning and the Third Law of Geography have become mainstream in the fields of computer science and geography. Transfer learning techniques leverage knowledge from a source domain with abundant data to solve related tasks in a target domain with limited data. Meanwhile, the proposal and application of the Third Law of Geography show that by comparing the similarity of geographical environmental variables between sampled regions and unsampled regions (rather than relying solely on traditional spatial distance or quantitative relationships), it is possible to predict target information in unsampled regions using a small amount of sample data. This provides a theoretical basis and methodological reference for selecting the source domain and target domain in cross-regional knowledge transfer. This paper conducts a literature review of cross-regional spatial prediction research based on these two major methods since 2018, focusing on the following key tasks: (1) Comparing and analyzing the basic principles of spatial prediction based on geographical similarity and transfer learning, and identifying differences in their technical procedures; (2) Summarizing the differences in similarity representation indicators and measurement methods between the two approaches; (3) Examining differences in commonly used auxiliary data, spatial analysis units, modeling methods, and evaluation indicators between the two prediction methods; (4) Discussing the challenges and limitations faced by these cross-regional knowledge transfer methods. The study shows that while the technical principles of both methods are basically consistent, they have specific limitations regarding their scope of application, similarity representation and measurement, relevant auxiliary variables, and parameter selection. The research offers useful insights for optimizing and improving these methods, integrating them effectively, innovating cross-regional prediction approaches, and expanding their application fields.

Select

Evaluation Method for Supply and Demand Matching in Subway-Bus Transfers: A Case Study of Shenzhen City

LEI Jiexuan, BIAN Mengyuan, GU Zhihui

Journal of Geo-information Science. 2024, 26(10): 2419-2432. https://doi.org/10.12082/dqxxkx.2024.240280

Download PDF (191) HTML (723)

Knowledge map

Save

Realizing convenient transfers between subway and regular bus systems is fundamental to advancing the integration and development of these two transportation networks, which is crucial for constructing a multi-modal and accessible public transportation system. This paper takes Shenzhen as a case study and innovatively combines mobile phone signal data with IC card data to identify the transfer characteristics between subway and regular bus systems. These characteristics include temporal and distance aspects, which effectively illustrate the daily travel patterns of transfer passengers. Through a detailed analysis of the overall transfer characteristics, this study establishes a distance threshold to estimate potential transfer demand and the gap in transfer demand at each subway station. Furthermore, this paper uses the Entropy Weight-TOPSIS Model to conduct a preliminary evaluation of the transfer supply conditions at various subway stations. Based on the evaluation results of the matching between transfer supply and demand, as well as the size of the transfer demand gap, this study proposes corresponding optimization strategies for subway stations, providing an effective method for identifying inefficient stations. The research findings indicate that, in Shenzhen, the subway stations with high potential demand for transfers to regular buses are mainly located near densely populated residential areas. The central urban area exhibits a high degree of matching between transfer supply and demand, with some old urban areas experiencing an oversupply due to the well-developed public transportation infrastructure. However, peripheral stations commonly face a situation where demand exceeds supply, necessitating focused attention on improving transfer supply conditions at these sites. Regarding the transfer demand gap, even among subway stations with the same level of transfer supply, variations in the size of the demand gap exist. Stations with insufficient transfer supply but efficient operations offer valuable lessons, while stations with large demand gaps and inefficient operations should be targeted for specific improvements based on their individual supply and demand matching situations. The results demonstrate that evaluating the alignment between potential subway-bus transfer demand and the level of transfer supply using multi-source data, and formulating optimization strategies in conjunction with the transfer demand gap, is of significant importance for enhancing the refined management level of subway and bus transfer services. Overall, the theory and calculation methods of transfer potential demand and transfer demand gap proposed in this study provide a new perspective and reference for transfer research, public transport planning, and urban planning in the field of public transportation.

Select

From Geographic Information System to Geographic Intelligent Agent

LUO Bin, LIU Wenhao, WU Jin, HAN Jiafu, WU Wenzhou, LI Hongsheng

Journal of Geo-information Science. 2025, 27(1): 83-99. https://doi.org/10.12082/dqxxkx.2025.240658

Download PDF (135) HTML (541)

Knowledge map

Save

[Objectives] The geographic system is an integrated framework encompassing natural and human phenomena and their interrelationships on the Earth's surface. While Geographic Information Systems (GIS) can digitally process these geographic elements, they face challenges in addressing rapidly changing geographic contexts with complex 3D structures. This is primarily due to the lack of bi-directional interactions between physical and informational spaces, as well as their reliance on predefined rules and historical data. In this paper, we propose the concept of a “Geographic Intelligent Agent” as an advanced form of GIS, which integrates embodied intelligence, self-supervised learning, and multimodal language modeling to improve environmental perception, spatial understanding, and autonomous decision-making. [Methods] The architecture of the geographic intelligent agent consists of three core components: multimodal perception, an intelligent hub, and an action manipulation module. These components collectively acquire comprehensive environmental information through sensor networks, perform complex situatio reasoning using knowledge graphs and generative models, and enable real-time control and multilevel planning of the physical environment. To adapt to differences between virtual and real environments, the geographic intelligent agent is tested using the earth simulator and a test field platform, equipping it with stronger autonomous capabilities in complex and dynamic geographic contexts. [Results] This paper also demonstrates the implementation of geographic intelligent agent in spatial intelligence applications using the virtual digital human “EarthSage” as an example. [Conclusion] As a prototype of the geographic intelligent agent, "EarthSage" integrates modules such as the spatiotemporal Knowledge Ggraph (GeoKG) and a Cognitive Map Generation Model (GeoGPT), assisting users in obtaining intelligent spatial decision-making support in fields such as emergency management, urban planning, and ecological monitoring. This work exemplifies the transformation of GIS from a traditional information processing tool to an autonomous spatial intelligent system, marking a significant advancement in the field.

Select

Mesh Simplification Algorithm for Photorealistic 3D Models Based on Edge Collapse

JIN He, LIU Tao, DU Ping, ZHANG Zhao, DING Nannan, CHEN Chen, JIA Yandang, LIU Changxin

Journal of Geo-information Science. 2024, 26(10): 2254-2267. https://doi.org/10.12082/dqxxkx.2024.230665

Download PDF (118) HTML (740)

Knowledge map

Save

The lightweighting of three-dimensional models is a pressing need in the construction of realistic three-dimensional environments in China. Simplifying three-dimensional models to a certain extent is a suitable approach. In the realm of model simplification algorithms, the Quadric Error Metrics (QEM) algorithm is considered a classic method. However, traditional QEM algorithms lack a dedicated mechanism to preserve crucial details during the simplification process, and the post-simplification mesh quality needs further optimization. To address these challenges, this paper proposes a real-world three-dimensional model simplification algorithm based on edge collapsing. The aim is to better preserve key details and enhance the mesh quality of the simplified model in the process of three-dimensional model lightweighting. The simplification algorithm in this method introduces vertex approximation curvature and volume error as constraints, altering the edge collapsing cost to maintain important details while achieving effective simplification. By incorporating boundary preservation conditions while modifying edge collapsing costs, the algorithm effectively safeguards the model's boundaries. Finally, post-simplification mesh optimization is performed using the Laplacian mesh optimization method to optimize triangle shapes and preserve features. Experiments were conducted to verify the proposed algorithm and compare it with the traditional QEM algorithm. The research results indicate that our proposed method retains regions with rich details better than the traditional QEM algorithm, significantly reducing the number of elongated triangles and improving mesh quality. This algorithm is applicable to the lightweighting of real-world three-dimensional models, contributing to enhanced model performance and visualization effects.

Select

Critical Technologies and Service Approaches for Cloud-Native Geospatial Knowledge Base Management

ZHONG Teng, ZHANG Xueying, XU Pei, CAO Min, CHEN Biyu, LIU Qiliang, WANG Shu, YANG Yizhou

Journal of Geo-information Science. 2024, 26(9): 2013-2025. https://doi.org/10.12082/dqxxkx.2024.240184

Download PDF (192) HTML (725)

Knowledge map

Save

The essence of geospatial knowledge lies in unveiling the spatiotemporal distribution, dynamics of change, and interaction patterns of geographical entities and phenomena. However, existing knowledge base management platforms often overlook the specific needs of geospatial knowledge representation and lack the capability to handle the unique attributes of geospatial data, making it challenging to meet the requirements for constructing and applying geospatial knowledge graphs. The Geospatial Knowledge Base Management System (GeoKGMS) is designed on the basis of an integrated geospatial knowledge base engine that efficiently aggregates geospatial knowledge resources across various modalities—'Image-Text-Number'—automates the construction of geospatial knowledge graphs, and facilitates a one-stop geospatial knowledge engineering process. This paper elucidates four key technologies for managing geospatial knowledge bases. First, the cloud-native geospatial knowledge base microservice unified scheduling technology decomposes the large geospatial knowledge base management system into fine-grained, independently operable, and deployable microservices. By comprehensively managing the lifecycle of the geospatial knowledge base, service classification and orchestration methods are determined to achieve unified scheduling of these microservices. Second, a human-computer collaborative geospatial knowledge graph construction method is proposed, supporting the sustainable, collaborative construction of geospatial knowledge graph engineering. Third, the spatiotemporal hybrid encoding technology of the geospatial knowledge graph achieves unified representation of geospatial knowledge by integrating multimodal geospatial data and spatiotemporal information. Fourth, a multimodal geospatial knowledge integrated storage and large-scale spatiotemporal graph partitioning technology is proposed to address the challenges of efficiently managing complex structured geospatial knowledge and retrieving large-scale spatiotemporal knowledge tuples. Based on these key technologies, an application service framework for GeoKGMS has been designed, featuring six functional modules: geospatial knowledge base management, multimodal geospatial knowledge extraction, human-computer collaborative construction of geospatial knowledge graphs, geospatial knowledge reasoning, geospatial knowledge graph quality assessment, and geospatial knowledge visualization. To demonstrate GeoKGMS's capabilities, the Karst landform knowledge graph is used as a case study. The Karst landform knowledge graph is an integrated 'Image-Text-Number' geospatial knowledge graph, constructed based on geospatial knowledge extracted from the texts, schematic diagrams, and related maps in geomorphology textbooks. Through a collaborative pipeline, geomorphology experts and computers jointly perform tasks such as mapping, alignment, supplementation, and conflict resolution of geospatial knowledge. This collaboration ultimately leads to the automated construction of the Karst landform knowledge graph by GeoKGMS. The resulting graph is highly consistent with expert knowledge models, ensuring the interpretability of knowledge-driven geocomputation and reasoning in practical applications.

Select

Tunnel Point Cloud Section Extraction Method Based on Orthogonal Rotation of the Central Axis

WANG Tiexing, WEI Guanjun, WANG Yongxin

Journal of Geo-information Science. 2024, 26(12): 2759-2771. https://doi.org/10.12082/dqxxkx.2024.240505

Download PDF (63) HTML (680)

Knowledge map

Save

The accurate extraction of tunnel sections is a pivotal step in tunnel deformation analysis. However, due to inadequate illumination, the reflection and occlusion caused by dust and structural elements generate noise and erroneous points in the point cloud data, complicating data processing. Additionally, the intricate geometry of tunnel features, such as curved surfaces, corners, and cracks, renders traditional section extraction algorithms ineffective for point cloud data. Consequently, there is an urgent need for more efficient and robust algorithms. To address this issue, this paper proposes a method for continuous tunnel section extraction based on laser point cloud data. First, a combination filtering method is introduced, integrating Random Sample Consensus (RANSAC) cylindrical fitting and radius filtering to effectively remove scattered outliers and noise points adhering to the tunnel walls with sparse density. Next, the tunnel central axis is obtained via bidirectional projection, and a mathematical function model is established in line with the principle that ‘a straight line intersecting the tunnel central axis intersects the tunnel wall’, enabling the continuous extraction of tunnel section point clouds. Finally, the fitting center coordinates of the section points and the tunnel design radius are used as parameters to calculate the radial deviation of the tunnel points, representing the shape variables. The tunnel point cloud is visually rendered using these shape variables to display the overall deformation of the tunnel. In this paper, laser point cloud data from three sections of a subway tunnel in Chengdu are used as the experimental data. The results show that the mean values of Class I error, Class II error, and total error are 1.48%, 1.03%, and 1.21%, respectively, with the Kappa coefficient reaching 97.45% when using this method for noise filtering. Compared to traditional methods such as least squares, density clustering, and normal deviation algorithms, this method reduces cumulative errors by 9.34%, 10.61%, and 4.41%, respectively, while increasing the Kappa coefficient by 5.36%, 6.38%, and 3.65%. This demonstrates the enhanced robustness and accuracy of the proposed method. Moreover, the mean deviation between the tunnel section fitting radius obtained through this method and the design radius is merely 1.36 mm, compared to deviations of 1.60 mm and 6.00 mm with existing methods, achieving reductions of 2.5 mm and 2.7 mm, respectively. The range of the tunnel shape variable is between 0 and 18 mm, and the overall deformation of the tunnel is visually displayed through point cloud rendering. The method provides a reliable foundation and essential support for tunnel safety monitoring.

Select

YOLOv8l-FMSC-Spatial: A Retrieval Model for We-Map Geographical Elements

HOU Yuhao, YANG Weifang, YAN Haowen, LI Jingzhong, ZHU Xinyu, YAN Xiangrong, PENG Yibo

Journal of Geo-information Science. 2025, 27(2): 461-478. https://doi.org/10.12082/dqxxkx.2025.240327

Download PDF (37) HTML (524)

Knowledge map

Save

[Objectives]Currently, systematic research in content retrieval for We-maps is lacking. To address this gap, this paper proposes an approach for geographic feature extraction and retrieval in hand-drawn map scenes using the YOLOv8l-FMSC-Spatial model (You Only Look Once v8l - Fewer Multi-Scale Convolution-Spatial). [Methods]First, different YOLO models were compared to select the optimal YOLOv8l model. The C2f-FMSC module was introduced to improve this model, resulting in the YOLOv8l-FMSC training model specifically designed for We-maps. This model was applied to extract geographic features from raster maps. Next, to meet the retrieval needs of geographic features, a spatial relationship database for these features was established. A spatial computation and retrieval module, Spatial, was designed to process geographic feature information by transmitting and filtering it. The module further calculates spatial correlations between user queries and the geographic feature information in the database. Based on the degree of spatial relationship association, the model indexes maps containing relevant geographic feature information from the We-maps database, enabling the construction of a spatial relationship-based geographic feature retrieval model. The method was validated using hand-drawn campus map retrieval scenarios. The experimental dataset comprised publicly available maps from schools and maps freely created by students, totaling 493 hand-drawn campus maps. These maps were used to study the retrieval of representative geographical elements such as water bodies, sports fields, and unique architectural structures associated with schools nationwide. The focus was on accurately identifying and retrieving these characteristic elements to ensure the model’s practical applicability. [Results] The experimental results indicate: (1) The trained YOLOv8l model effectively identifies geographical elements in self-made maps, with its effectiveness and robustness verified on the proposed dataset; (2)The YOLOv8l model, enhanced with the FMSC module, achieved a precision of 0.8 and a recall of 0.764, making it the optimal choice for practical comparisons; (3)The Spatial calculation model effectively captures the spatial information of relevant geographical elements, narrowing the gap with orthographic map retrieval. By applying this method, the retrieval of geographical elements from hand-drawn campus maps, while considering spatial relationships, becomes achievable. [Conclusions] The proposed model can quickly and accurately retrieve content-relevant hand-drawn maps based on geographic feature conditions, effectively filling the research gap in content retrieval for We-maps.

Select

Extraction of Building Outlines from Airborne LiDAR Point Clouds Using Line-CNN Based on Deep Network

HUANG Yishu, ZANG Yufu, JIANG Qihan, MI Wenhan

Journal of Geo-information Science. 2024, 26(9): 2161-2176. https://doi.org/10.12082/dqxxkx.2024.230503

Download PDF (112) HTML (676)

Knowledge map

Save

Urban 3D modeling is indispensable for digital twinning and the development of smart cities. The effective extraction of building outlines is a critical step in achieving high-precision urban modeling and 3D mapping. At present, the extraction of building outlines from airborne point cloud data still faces challenges, such as low efficiency and accuracy with conventional methods and limited calibration samples. In response to these challenges, this paper introduces a deep learning method for extracting building outlines from 3D airborne point clouds. The airborne LiDAR point clouds are the primary data input. First, through vertical projection to the XOY plane, point clouds of buildings with the application of progressive morphological filtering are converted to rasterized elevation that characterizes spatial variation of terrain and visible light raster images that depict texture differences. Then, the deep learning model based on Lines-Convolutional Neural Networks (Line-CNN) is employed to preliminarily extract line features from raster images, encompassing stages of feature extraction, node prediction, route generation, and others. To enhance the quality of the primary straight-line extraction, an optimization strategy is introduced, which incorporates a range of comprehensive trimming and completion operations, aligning with information extracted from both the elevation and visible light raster images. Simultaneously, false line segments are eliminated, and missing lines are added, resulting in the regular and complete building outline features. To verify the proposed model, the airborne point cloud data from NUIST campus and the ISPRS H3D 2019 datasets are utilized in the experiment. Our results show that the proposed method accurately and comprehensively extracts building outline features from LiDAR images, achieving an impressive average accuracy and completeness rate, both up to 90%. Furthermore, the proposed method is highly efficient and effectively addresses the challenge of insufficient 3D calibration samples in traditional methods, making it suitable for various applications, particularly large-scale urban 3D modeling and cadastral surveying. To sum up, the proposed method constitutes a significant stride in advancing urban modeling and 3D mapping. It provides a novel solution to address the challenges associated with building outline extraction, particularly within the context of smart cities and digital twins. Due to the model's high accuracy, completeness, and efficiency, our method is highly helpful for a wide range of applications in the urban planning and geospatial information fields.

Select

Classification of Mangrove Species Using Coupled Sentinel-1/2 Multi-Source Time-Series Remote Sensing Image Features

XUE Yufei, ZHANG Shenghan, BAI Nana, YUAN Feng, LIU Jie, CHEN Ye, HUANG Xiaohui, XIONG Lanlan, FU Yingchun

Journal of Geo-information Science. 2024, 26(11): 2626-2642. https://doi.org/10.12082/dqxxkx.2024.240334

Download PDF (128) HTML (642)

Knowledge map

Save

Scientific and accurate monitoring of mangroves is the basis and premise for protecting marine coastal wetland ecosystems. Multi-source remote sensing data can be used to classify mangrove species effectively, but challenges remain in applying optical and SAR image features along with their time-varying information. In this paper, based on Sentinel-1/2 image data, we propose a mangrove species classification framework using Multi-source Features-coupled and Ensemble Learning algorithm (MFEL). The framework analyzs the classification advantages of spectral index features, SAR polarization features, and their temporal harmonic spectral features in feature selection and coupling. It then stacks the Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) models to construct an Ensemble Learning model for mangrove species classification. Comparing the RF classification model and XGBoost models based on feature optimization, we evaluated the classification accuracy and feature application differences of the MFEL classification method. Zhanjiang Mangrove Forest National Nature Reserve was selected as the experimental area. The results show that: ① compared to using only spectral index features, classification accuracy improves by 6% and 8% with the addition of SAR polarization features or temporal harmonic spectral features, respectively. Adding both SAR polarization features and temporal harmonic spectral features simultaneously improves classification accuracy by 12%, making it more effective for mangrove species classification. ② The MFEL method achieves the highest classification accuracy, with an overall accuracy of 88.03% and a Kappa coefficient of 0.86. When the MFEL model trained on samples from the experimental area was applied to other areas, the classification accuracies were 83.94% and 82.77%, respectively. ③ This study verifies the potential application of SAR polarization features and time-sequence harmonic spectral features in mangrove species classification, significantly improving the accuracy for five mangrove species, with accuracies ranging from 76% to 91%. The study results provide valuable insights for expanding the use of medium-resolution remote sensing satellite imagery in monitoring mangrove species.

Select

Improved K-means Clustering Method Based on Spectral Clustering and Particle Swarm Optimization for Individual Tree Segmentation of Airborne LiDAR Point Clouds

QIAN Yuhang, WANG Jingxue, ZHENG Xuetao

Journal of Geo-information Science. 2024, 26(9): 2177-2191. https://doi.org/10.12082/dqxxkx.2024.240243

Download PDF (70) HTML (661)

Knowledge map

Save

The precision of individual tree segmentation is important for survey of forest resources. However, traditional individual tree segmentation algorithms suffer from issues such as near tree confusion and low computational efficiency when processing large-scale point cloud data. To address these issues, this paper introduces an improved K-means clustering method that combines spectral clustering and particle swarm optimization for individual tree segmentation of airborne LiDAR point clouds. The proposed method is designed to overcome the limitations of conventional methods by increasing the accuracy of tree segmentation and optimizing the processing of large and complex point cloud data. By combining advanced techniques in spectral clustering and particle swarm optimization, the proposed method significantly improves the precision and efficiency of individual tree segmentation. Firstly, the voxelization of the point cloud data is performed using the Mean Shift algorithm, where the adaptive bandwidth and Gaussian kernel function compute the similarity between voxels, resulting a Gaussian similarity graph reflecting the properties of voxels. This graph not only encapsulates the space structure of the forest but also improves the accuracy of the subsequent data analysis and processing. After voxelization, the Nyström method is applied to efficiently manage the Gaussian similarity graph. This method uses K-nearest neighbor search to select representative samples, effectively reducing the computational burden associated with spectral clustering when dealing with large-scale datasets. By selecting representative samples, the algorithm ensures that the main features of the data are retained, facilitating a more manageable and accurate clustering process. This method optimizes the processing of large amounts of point cloud data by balancing computational efficiency with the requirement to maintain data integrity and accuracy, thus providing a robust foundation for accurate tree segmentation. Using the Nyström approximation, approximate eigenvectors of the similarity graph are obtained, facilitating an effective mapping from the high-dimensional space to a low-dimensional feature space. Finally, the particle swarm optimization algorithm is introduced to enhance the K-means clustering process. This optimization algorithm first randomly initializes a set of particles, each representing a set of potential cluster centers. In each iteration, the particles update their clustering speed and position based on the best historical position of the individual and the best historical position of the group, adjusting the clustering centers to minimize the internal cluster distance. In this paper, publicly available point cloud data from NEWFOR is selected for experiments. The experimental results show that the segmentation results obtained by the proposed algorithm are 5.3% higher in accuracy and 23 times more efficient than the comparison algorithm.

Select

Intelligent Purification of Natural Resource Element Change Polygons Driven by Remote Sensing Spatiotemporal Knowledge Graphs

LI Yansheng, ZHONG Zhenyu, MENG Qingxiang, MAO Zhidian, DANG Bo, WANG Tao, FENG Yuanjun, ZHANG Yongjun

Journal of Geo-information Science. 2025, 27(2): 350-366. https://doi.org/10.12082/dqxxkx.2025.240571

Download PDF (86) HTML (412)

Knowledge map

Save

[Objectives] With the development of deep learning technology, the ability to monitor changes in natural resource elements using remote sensing images has significantly improved. While deep learning change detection models excel at extracting low-level semantic information from remote sensing images, they face challenges in distinguishing land-use type changes from non-land-use type changes, such as crop rotation, natural fluctuations in water levels, and forest degradation. To ensure a high recall rate in change detection, these models often generate a large number of false positive change polygons, requiring substantial manual effort to eliminate these false alarms. [Methods] To address this issue, this paper proposes a natural resource element change polygon purification algorithm driven by remote sensing spatiotemporal knowledge graph. The algorithm aims to minimize the false positive rate while maintaining a high recall rate, thereby improving the efficiency of natural resource element change monitoring. To support the intelligent construction and effective reasoning of the spatiotemporal knowledge graph, this study designed a remote sensing spatiotemporal knowledge graph ontology model taking into account spatiotemporal characteristics and developed a GraphGIS toolkit that integrates graph database storage and computation. This paper also introduces a vector knowledge extraction method based on the native spatial analysis of the GraphGIS graph database, a remote sensing image knowledge extraction method based on efficient fine-tuning of the SkySense visual large model, and a polygon purification knowledge extraction method based on the SeqGPT large language model. Under the constraints of the spatiotemporal ontology model, vector, image, and text knowledge converge to form a remote sensing spatiotemporal knowledge graph. Inspired by the manual operation methods for change polygon purification, this paper developed an automatic purification method of change polygons based on first-order logical reasoning within the knowledge graph. To improve the concurrent processing and human-computer interaction, this paper developed a remote sensing spatiotemporal knowledge graph management and service system. [Results] For the task of purifying natural resource element change polygons in Guangdong Province from March to June 2024, the proposed method achieved a true-preserved rate of 95.37% and a false-removed rate of 21.82%. [Conclusions] The intelligent purification algorithm and system for natural resource element change polygons proposed in this study effectively reduce false positives while preserving real change polygons. This approach significantly enhances the efficiency of natural resource element change monitoring.

Select

Research Progress in Spatio-temporal Big Data Mining and Knowledge Services for Natural Resources

DENG Min, WANG Da

Journal of Geo-information Science. 2025, 27(1): 27-40. https://doi.org/10.12082/dqxxkx.2025.240625

Download PDF (124) HTML (483)

Knowledge map

Save

[Significance] As a comprehensive observation of natural resource development and utilization, spatio-temporal big data on natural resources contains valuable knowledge about resource distribution, spatio-temporal process evolution, and interrelationships. [Progress] This paper examines spatio-temporal big data mining and knowledge services for natural resources, highlighting key data mining techniques and their critical applications in knowledge services. First, it introduces the core concepts, technical frameworks, and methodological processes of spatio-temporal clustering analysis, association mining, anomaly detection, predictive modeling, and geographic risk assessment, along with their applications in natural resource management and land-use decision-making. Second, a four-tier natural resource spatio-temporal knowledge service system is proposed, encompassing descriptive, diagnostic, predictive, and decision-making knowledge services, which provide essential support for applications such as resource status monitoring, land-use regulation, and disaster prevention and mitigation. Finally, the paper indicates that current natural resource management is transitioning from data aggregation and analysis to knowledge-driven intelligent services, forming an emerging research and application paradigm of big data, big analysis, big knowledge, and big services. [Prospect] Future efforts will focus on advancing collaborative data and knowledge mining technologies, addressing the standardization challenges in spatio-temporal knowledge bases and services, and exploring the potential of cutting-edge technologies such as generative large models in the natural resource domain to drive the information and intelligent transformation of natural resource management.