感知物理先验的机器学习及其在地理空间智能中的研究前景
张 彤(1979— ),男,福建武夷山人,博士,教授,研究方向为时空机器学习、高分遥感解译、交通地理信息系统。E-mail: zhangt@whu.edu.cn |
收稿日期: 2022-10-17
修回日期: 2022-11-04
网络出版日期: 2023-06-30
基金资助
国家重点研发计划“政府间国际科技创新合作/港澳台科技创新合作”重点专项(2019YFE0106500)
国家自然科学基金项目(41871308)
Physics-informed Machine Learning and Its Research Prospects in GeoAI
Received date: 2022-10-17
Revised date: 2022-11-04
Online published: 2023-06-30
Supported by
National Key R&D Program of China (International Scientific & Technological Cooperation Program)(2019YFE0106500)
National Natural Science Foundation of China under Grant(41871308)
许多复杂的物理现象和动态系统尚未为科学家所完全理解和解释,因此不能构建确定的数学方程来描述,不能直接使用紧凑的物理机理模型来进行分析和预测。随着观测数据的日益丰富,数据驱动的机器学习方法可以较好地描述复杂非线性现象,但是纯数据驱动模型在表征能力、可解释、泛化能力、样本利用效率方面还存在很多不足。常规机器学习方法在地学领域的应用还面临时空异质性、样本稀疏等带来的挑战。近年来感知物理先验的机器学习方法可以在物理原理不太明确的情况下更有效地利用观测数据描述和分析动态系统,受到了广泛关注,并在物理、计算机、生物、医学和地学等领域得到了一定的应用。近年来人工智能和机器学习技术已经大量应用于地理学尤其是地理信息和遥感领域,受到地理学者广泛重视,被称为地理空间智能,并已成为一个热门的研究方向。感知物理先验的机器学习方法融合了模型驱动和数据驱动思想,为地理空间智能研究带来新的研究范式,促进各种复杂地理现象的精细描述和预测。本文首先分别从物理先验的表达形式和如何在机器学习中集成物理先验两方面概述了该领域的进展。物理先验是在机器学习建模之前已经存在、独立于机器学习方法的知识。本文从增广的数据和定制特征、物理定律和约束规则、支配方程、几何特性等方面总结物理先验知识的表达形式。主要从机器模型约束建模、辅助任务设计和模型的训练推理角度总结如何在机器学习模型中有效集成各种物理先验。本文基于以上的综述框架,结合地学机器学习和地理空间智能的发展和前沿问题,探讨了地理时空先验与其他物理先验的关系,简要总结和分析了目前感知时空先验的地理空间智能方法研究案例,探讨了时空先验表征以及集成时空先验地理空间智能的未来研究规划和应用前景。随着感知物理先验的机器学习方法研究的快速发展,我们相信感知时空先验的地理空间智能研究未来将逐步构建起跨多时空尺度的通用地理表征、分析、预测和解释框架,不仅能更好地解决地理信息科学的传统问题,还将鼓励地理学者与其他相关学科一起建立交叉研究的前沿机会,探索解决人类未来面临的共同挑战。
张彤 , 刘仁宇 , 王培晓 , 高楚林 , 刘杰 , 王望舒 . 感知物理先验的机器学习及其在地理空间智能中的研究前景[J]. 地球信息科学学报, 2023 , 25(7) : 1297 -1311 . DOI: 10.12082/dqxxkx.2023.220795
Scientists still cannot fully understand and explain many complex physical phenomena and dynamic systems, which cannot be described by deterministic mathematic equations and be analyzed and predicted through compact physical mechanistic models. With the ever-increasing of observational data, data-driven machine learning methods can effectively describe many complex non-linear phenomena. Nevertheless, pure data-driven models still have shortcomings in representation, interpretation, generalization capabilities, and sample efficiency. Conventional machine learning methods are confronted with challenges brought by spatiotemporal heterogeneity and sample sparsity. Recently, Physics-Informed Machine Learning (PIML) can effectively leverage observation data to describe and analyze dynamical systems when physical principles are uncertain. PIML has gain wide attention and been extensively applied in physics, computer science, biology, medical science, and geosciences. In recent years, artificial intelligence and machine learning technologies have been widely applied in geography, especially in GIScience and remote sensing, attracting wide research interests of geographers. This line of research is termed GeoAI and has become a cutting-edge research frontier in geography. PIML methods integrate the ideas of model-driven and data-driven methods, introducing new research paradigms for GeoAI and improving the description and prediction of complex geographical phenomena. This survey first summarizes recent progress in this domain from the perspectives of the representation of physical priors and the integration of physical priors in machine learning methods. Physical prior refers to existing independent knowledge that is already available before building machine learning models. This survey reviews the representation of physical priors from the aspects of augmented data and customized features, physical laws and constraints, governing equations as well as geometric properties. We also review how physical priors are integrated into various machine learning models, including constraint modeling, auxiliary task design as well as model training and inference. Based on the PIML survey framework, we explore the relationships between spatiotemporal priors and other physical priors, before briefly reviewing and summarizing typical case studies of spatiotemporal prior-informed GeoAI research. We also discuss the research agenda and future prospects of spatiotemporal prior representation and the spatiotemporal prior-informed GeoAI in the context of geo-machine learning and GeoAI frontiers. In light of fast progress of PIML, we contend that GeoAI studies that are well informed by spatiotemporal priors can gradually establish a generic geographical representation, analysis, prediction, and interpretation framework, which not only helps handle many classical problems in GIScience but also addresses future profound challenges of human being by encouraging geographers to explore more research opportunities when collaborating with researchers from other disciplines.
表1 感知时空先验的机器学习方法分类和案例Tab. 1 Types and case studies of spatio-temporal prior-informed machine learning |
集成方法类型 | 优点 | 缺点 | 参考文献 |
---|---|---|---|
损失函数定制 | 设计和实现方便、可以增加多个正则项表达各种先验 | 作为软约束不能保证时空先验的严格实现 | [103]、[104] |
网络架构定制 | 严格满足具体先验、设计灵活、扩展性好 | 技术难度高,需要对深度学习和研究问题有很深的理解 | [95]、[96] |
描述几何特性 | 引入合理的几何特性作为归纳偏好,可以显著改善泛化能力和训练效率 | 现有不变性、等变性定义过于粗略,特定动态系统的缺乏精细化几何特性定义 | [34] |
描述动态系统 | 精细化描述局部和远程时空相干和关联,灵活表征复杂动态系统 | 缺少明确的支配方程,时空先验的定义和集成实现技术难度大 | [80]、[104] |
辅助任务设计 | 可在元学习等框架下设计辅助任务,更充分地利用任务间关联集成先验知识 | 辅助任务的合理设计难度较高;如果相关性不强,辅助任务有可能影响主任务 | [60]、[103] |
描述时空异质 | 通过位置、方向的表征体现时空异质更好地描述地理动态系统 | 时空异质模式复杂,非地学领域研究较少,可参考借鉴的方法比较缺乏 | [103]、[105-107] |
[1] |
|
[2] |
|
[3] |
|
[4] |
|
[5] |
|
[6] |
李野, 陈松灿. 基于物理信息的神经网络:最新进展与展望[J]. 计算机科学, 2022, 49(4):254-262.
[
|
[7] |
|
[8] |
|
[9] |
|
[10] |
|
[11] |
|
[12] |
|
[13] |
|
[14] |
|
[15] |
|
[16] |
|
[17] |
|
[18] |
|
[19] |
|
[20] |
|
[21] |
|
[22] |
|
[23] |
|
[24] |
|
[25] |
|
[26] |
|
[27] |
|
[28] |
|
[29] |
|
[30] |
|
[31] |
|
[32] |
|
[33] |
|
[34] |
|
[35] |
|
[36] |
|
[37] |
|
[38] |
|
[39] |
|
[40] |
|
[41] |
|
[42] |
|
[43] |
|
[44] |
|
[45] |
|
[46] |
|
[47] |
|
[48] |
|
[49] |
|
[50] |
|
[51] |
|
[52] |
|
[53] |
|
[54] |
|
[55] |
|
[56] |
|
[57] |
|
[58] |
|
[59] |
|
[60] |
|
[61] |
|
[62] |
|
[63] |
Desai,
|
[64] |
|
[65] |
|
[66] |
|
[67] |
|
[68] |
|
[69] |
|
[70] |
|
[71] |
|
[72] |
|
[73] |
|
[74] |
|
[75] |
|
[76] |
|
[77] |
|
[78] |
|
[79] |
|
[80] |
|
[81] |
|
[82] |
|
[83] |
|
[84] |
|
[85] |
|
[86] |
艾廷华. 深度学习赋能地图制图的若干思考[J]. 测绘学报, 2021, 50(9):1170-1182.
[
|
[87] |
|
[88] |
|
[89] |
张永生, 张振超, 童晓冲, 等. 地理空间智能研究进展和面临的若干挑战[J]. 测绘学报, 2021, 50(9):1137-1146.
[
|
[90] |
高松. 地理空间人工智能的近期研究总结与思考[J]. 武汉大学学报·信息科学版, 2020, 45(12):1865-1874.
[
|
[91] |
刘瑜, 郭浩, 李海峰, 等. 从地理规律到地理空间人工智能[J]. 测绘学报, 2022, 51(6):1062-1069.
[
|
[92] |
|
[93] |
|
[94] |
|
[95] |
|
[96] |
|
[97] |
|
[98] |
|
[99] |
|
[100] |
|
[101] |
|
[102] |
|
[103] |
|
[104] |
|
[105] |
|
[106] |
|
[107] |
|
[108] |
|
[109] |
|
[110] |
|
[111] |
|
[112] |
|
[113] |
|
/
〈 |
|
〉 |