Change Detection Approach for High Resolution Remotely Sensed Images Based on Superpixel and Active Learning

  • WANG Chengjun , 1, 2, 3 ,
  • MAO Zhengyuan , 1, 2, 3, * ,
  • XU Weiming 1, 2, 3 ,
  • WENG Qian 1, 2, 3, 4
  • 1. Key Lab of Spatial Data Mining & Information Sharing of Ministry of Education, Fuzhou University, Fuzhou 350116, China;
  • 2. National Engineering Research Centre of Geospatial Information Technology, Fuzhou University, Fuzhou 350116, China
  • 3. Research Centre of Spatial Information Engineering in Fujian Province, Fuzhou University, Fuzhou 350116, China
  • 4. College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China
*Corresponding author: MAO Zhengyuan, E-mail:

Received date: 2017-07-22

  Request revised date: 2017-09-24

  Online published: 2018-03-02

Supported by

The Pilot project of Fujian Provincial Science and Technology Department, No.2017Y01010103

The Natural Science Fund Project of Fujian Province, No.2017J01464

The Education Research Project for Young and Middleaged Teachers of Fujian Province, No.JAT160087


《地球信息科学学报》编辑部 所有


In terms of change detection with high resolution remote sensing images, there are still some unresolved problems such as scattered plots with ragged boundaries in output, being prone to occurrence of “salt-and-pepper” noise, expensive cost of manual annotation in the process of supervised training, redundancy of training samples, underutilization of information in unlabeled samples and so on. In order to address these problems, this paper proposes a new high resolution remote sensing image change detection method by combining the superpixel segmentation technology and Active Learning (AL) approaches. The proposed method consists of the following steps. Firstly the difference image is derived from two temporal remote sensing images. Subsequently the lattice-like homogenous superpixel are obtained by applying the Simple Linear Iterative Clustering (SLIC) algorithm. Simultaneously, we compare the SLIC algorithm with entropy-rate-based and modified-watershed-based superpixel generating algorithms respectively by means of homogeneity of superpixel and their coherence with image object boundaries. Then we compute the means and standard deviations of three bands of superpixel objects as spectral features and extract the entropy, energy and angular second moment by employing Gray-Level Co-occurrence Matrix (GLCM) as texture features. After that, initial training samples are randomly selected and labeled by introducing and following the Margin Sampling (MS) active learning sample selection strategy which is a kind of SVM based AL algorithm taking advantage of SVM geometrical properties and suitable for bipartition problems. A cosine distance based sample similarity measurement called Angle Based Diversity (ABD) is introduced to relief redundancy and ensure diversity of the selected samples. Lastly change detection is carried out according to the extracted information from trained samples. The proposed algorithms (SLIC-MS, SLIC-MS+ABD) are utilized to process WorldViewⅡmultispectral remote sensing data of urban and suburb scenes and the detection result from proposed sampling is compared with that from random sampling to explain detection accuracy of our methods. To illustrate the efficiency of methods proposed in this article, we investigate the iterative times of three techniques for reaching the same detection accuracy. Experimental results confirm that both SLIC-MS and SLIC-MS+ABD can reduce manual labeling cost and achieve better change detection quality than random sampling methods. They also indicate that the two proposed methods can find out samples with high uncertainty, which can be labeled by user themselves, from the unlabeled sample pool by making full use of and mining unlabeled sample information. Compared with the other two methods, SLIC-MS+ABD is more accurate with respect to identical data sets (the same two mentioned remote sensing images) and the same labeled sample number because the diversity of new selected samples has been considered in the learning process. In addition, SLIC-MS+ABD can obviously reduce iterative times to converge for achieving the same detection accuracy than other two approaches. On the basis of the experiment, it can be concluded that our proposed methods greatly relief the amount of user marking and acquire good change detection performance on high resolution remote sensing data sets as well. Experimental results also indicate that the methods implemented in this article saliently exhibit their advantages of manual cost reduction in sample labeling, avoidance of training sample redundancy to reach the same change detection quality for the same data set.

Cite this article

WANG Chengjun , MAO Zhengyuan , XU Weiming , WENG Qian . Change Detection Approach for High Resolution Remotely Sensed Images Based on Superpixel and Active Learning[J]. Journal of Geo-information Science, 2018 , 20(2) : 235 -245 . DOI: 10.12082/dqxxkx.2018.170336

1 引言

变化检测在土地利用/覆被变化、城市扩张和灾害监测等方面发挥着重要的作用[1],它是通过观测一个对象或者现象在不同时刻的状态提取变化信息的过程[2]。近年来,一系列成功发射的高分商用卫星正在以越来越低的价格提供越来越丰富的高空间分辨率遥感影像(下文简称“高分辨率遥感影像”或“高分影像”)数据资源,为人类更加精细地检测地表变化提供新的可能性。同时,空间分辨率的提高,使高分影像的光谱异质性明显增大,过于丰富的地物细节信息、大量“同物异谱”和“同谱异物”现象以及阴影等因素的不利影响也给基于高分影像的变化检测带来了新的挑战。现有相关文献中提出了大量的变化检测方法[3,4,5],这些方法各具特色,但直接运用前人提出的方法实施高分影像变化检测中还存在以下问题:① 传统的基于像元的变化检测方法容易产生“椒盐现象”,检测结果图斑边缘破碎;② 基于对象的变化检测流程一般是通过某种分割算法将像元合并成影像对象,但现有的分割算法很难有效地提取合理表达影像空间结构信息的局部影像特征;③ 基于监督学习的变化检测方法需要选取大量的训练样本,人工标注成本较高,降低了变化检测的效率。
本文通过集成面向对象和监督学习方法的优势,提出一种超像素(Superpixel)与主动学习 (Active Learning,AL)相结合的高分辨率遥感影像变化检测方法。针对现有分割算法不能有效提取影像局部特征与合理表达影像空间结构信息的缺点,实现超像素尺度的高分辨率影像对象分割;针对监督学习变化检测方法效率低的问题,引入主动学习分类方法,提高变化检测的精度与效率。

2 超像素分割与主动学习相结合的变化检测方法

Fig. 1 Flowchart of the proposed change detection method

图1 本文变化检测方法流程

2.1 SLIC超像素分割算法

2003年,Ren等[4]首次提出了超像素的概念,超像素是指图像中具有局部一致性、能够保持图像局部结构特征的子区域,超像素分割就是将像素合并成超像素的过程[6]。基于图论和基于梯度上升是目前实现超像素分割的2种基本思路[7]。图割(Normalized Cuts)[8]、超像素网格(Superpixel Lattices)[9]、熵率超像素分割(Entropy Rate Superpixel Segmentation)[10]属于前者;分水岭(Watershed)[11]、Turbopixel[12]、简单线性迭代聚类(Simple Linear Iterative Clustering,SLIC)[13]属于后者。本文采用Achanta等[14,15]提出的SLIC算法获取超像素,该算法综合考虑了像素的光谱和空间特征,具有较快的运行速度、较高的存储效率以及较好的边缘信息保持度等优势。
SLIC是对K-Mean算法的改进,该算法仅需要一个参数K,表示期望得到的超像素个数。假设图像的像素为N,SLIC算法首先在 S = N K 的规则格网中初始化K个聚类中心,为避免初始聚类中心落在边界,在初始聚类中心3×3的邻域内计算梯度最小的像元点,将聚类中心移动到该点;在聚类中心2S×2S邻域内不断迭代搜索到聚类中心距离D (式(1))最近的像元点直至收敛,完成分割[16,17]
D = d c m 2 + d s s 2 (1)
其中, d c = sp I x i , y i , S P - I x j , y j , S P 2 (2)
d s = ( x j - x i ) 2 + ( y j - y i ) 2   (3)

2.2 主动学习算法

Fig. 2 A typical framework of active learning

图2 主动学习流程

样本选择策略是主动学习算法的核心,边缘采样(Margin Sampling,MS)、多层次不确定性采样(MultiClass Level Uncertainty,MCLU)、委员会采样(Query By Committee,QBC)和熵值不确定性采样(Entropy Query by Bagging,EQB)是4种常用的主动学习样本选择策略[22,23]。本文采用边缘采样(MS)样本选择策略。MS是一种基于SVM分类器的主动学习算法,适用于二分类问题,较好地利用了SVM分类器的几何特性。MS通过式(4)选择主动学习样本。
x ^ = argmin      q i Q f q i (4)
f q i = sign j = 1 n y j α j K x j , q i (5)
式中: f q i 表示样本qi到分类超平面的距离;Kxj,qi)为核函数,定义了候选样本qi与第j个支持向量的相似度; α 代表支持向量的系数;类标yj为+1或-1[24]
选择样本的过程中需要综合考虑样本的不确定性与多样性(样本之间的相似性),尽管边缘采样策略选取的样本信息量大,但因每次挑选的多个样本距离分类面都较近,这些样本之间存在大量的信息冗余,对分类器模型的改善作用有限,同时不必要地增加了人工标注成本。为此,在SLIC-MS方法的基础进一步改进样本选择策略,提出基于余弦角距离的样本间相似性度量准则(Angle-Based Diversity,ABD)[26]以及应用该准则保持样本多样性的变化检测方法(SLIC-MS+ABD)。样本间的余弦角距离通过式(6)计算[26]
cos ( ( x j , x j ) ) = | ϕ ( x i ) ϕ ( x j ) | | ϕ ( x i ) | | | | ϕ ( x j ) | | = K ( x i , x j ) K ( x i , x i ) K ( x j , x j ) (6)
式中:xi,xj表示样本i和样本j; ϕ 表示非线性映射函数;K为核函数。通过计算所选择样本间的余弦角距离衡量样本间的相似性,余弦角越小样本间相似性越大。

3 实验结果与分析

3.1 实验数据

本文以城镇和山区2种不同场景作为研究区域,分别选用北京市局部区域(337×510)和福州市局部区域(689×1045)不同时相的WorldView Ⅱ多光谱影像(包含红、绿、蓝、近红外4个波段,1.8 m空间分辨率)为数据源进行实验。对影像进行辐射校正、几何校正以及匹配等预处理,结果如图3所示。其中,福州试验区2期影像植被种类存在一定变化,但本文重点检测人为地物变化,故不予考虑。
Fig. 3 True color composite(RGB) images of case study area at different times

图3 不同时相的研究区真彩色(RGB)影像

3.2 评价指标

TPR = FN P ' (7)
FPR = FP P (8)
Tab. 1 Error matrix of change detection

表1 变化误差矩阵

实际变化像元数 实际未变化像元数 行像元数之和
检测变化像元数 TP FP P
检测未变化像元数 FN TN N
列像元数之和 P′ N′ T

3.3 结果与分析

按照本文方法的技术路线,首先对2期影像求差得到差值图像(图4),采用SLIC算法选取不同分割参数对差值影像进行多次分割,保留其中效果最佳的超像素分割结果。图5为SLIC分割与基于熵率的超像素分割以及改进的分水岭超像素分割[26] 3种不同方法针对同一地区的分割结果。
Fig. 4 Difference images

图4 差值影像

Fig. 5 The segmentation results of different superpixel segmentation algorithms in Beijing

图5 北京地区超像素分割结果

Fig. 6 The segmentation results of different superpixel segmentation algorithms in Fuzhou

图6 福州地区超像素分割结果

图7可知,北京研究区建筑物较密集,地物类型较复杂,在迭代初期体现各种方法分类性能的Kappa系数和漏检率2项指标相差不大,这是由于样本数量较少,训练得到的分类器不是很准确,在这种情况下样本选择近似于随机选择,在样本为250(迭代15次)时,体现出SLIC-MS+ABD方法的优越性: Kappa系数由0.4246升至0.7030、漏检率由0.5018下降到0.3286,说明在第15次迭代时,MS+ABD策略选择了信息量更丰富的样本,分类器精度因此得到较大提升。上升过程中存在的小范围波动则是选择样本的信息含量较低所致。随着样本数量的增加,变化检测的精度逐渐稳定提高,而SLIC-MS方法在迭代20次后精度才出现较大提升。值得注意的是,3种方法的误检率都较低,且误检率下降趋势较为接近。虽然SLIC-MS+ABD方法的误检率在前迭代30次时比其他2种方法要高,但是在迭代30次后,误检率有了大幅下降,在迭代50次时(标注样本为700)接近收敛。福州地区地物较简单,覆被多为植被,变化区域明显,故本文提出的2种方法精度提升较快,但SLIC-MS+ABD方法提升趋势明显,迭代收敛速度较快,在第40次迭代时(标注样本为500)达到收敛,且收敛时精度明显高于其他2种方法,同时在迭代过程中波动较小,学习曲线稳定。图8展示了当SLIC-MS+ABD方法达到收敛时3种方法的变化检测结果图。表2展示了当SLIC-MS+ABD方法达到收敛时不同方法的检测 精度。
Fig. 7 Accuracy comparison of the change detection

图7 本文方法与其他方法精度比较

Fig. 8 Change detection results of different algorithm

图8 变化检测结果

Tab. 2 The change detection accuracy of different methods

表2 不同方法的检测精度

北京 福州
Kappa系数 漏检率 误检率 Kappa系数 漏检率 误检率
SLIC-MS+ABD 0.8306 0.1630 0.1263 0.8763 0.0719 0.0642
SLIC-MS 0.8008 0.2106 0.1439 0.8017 0.2018 0.1153
SLIC-RS 0.6660 0.4106 0.1625 0.7157 0.3098 0.1190
图8可知,在迭代次数相同的条件下SLIC-MS+ABD方法能够获得质量较好的检测结果,而SLIC-MS和SLIC-RS这2种方法不能高质量地保持变化区域的完整性以及边界特征,检测结果较为破碎,收敛速度较慢,精度较低。在训练样本较少的情况下,由于SLIC-RS算法在选择样本时是随机的,而SLIC-MS算法在增加训练样本时只是单纯地考虑样本的不确定性,没有考虑训练样本的多样性,加入到训练集中的样本信息量不足,甚至存在冗余样本,没有充分利用分类器模型的学习效能,从而制约了变化检测的精度与效率;SLIC-MS+ABD方法能够检测出绝大部分变化区域,且噪声较少、边缘保持度好(图8(b)中红圈区域)、变化区域结构完整,漏检率和误检率较低。这是由于在少量训练样本的前提下,SLIC-MS+ABD方法能够从未标记的样本中选择对分类器最具学习价值的样本,从而使分类器获取较好的分类性能,减少了人工标注负担,提高了变化检测效率。为说明SLIC-MS+ABD方法在减少人工标注负担上所起的作用,采用迭代次数来反映人工样本标注成本,迭代次数越多标注成本越高。表3展示了在达到相同检测精度(以Kappa 系数为0.7为例)时,3种方法所需的迭代次数。
Tab. 3 The numbers of iteration rounds required by different methods for achieving the same classification accuracy on the two research areas

表3 3种方法达到相同精度所需迭代次数

北京 福州
SLIC-MS 20 30
SLIC-RS 90 50

4 结论

SLIC算法分割得到的超像素对象具有边缘特征保持度好、内部同质性强等优势。 MS主动学习方法从未标记样本集中选择不确定性大的样本交由用户标注,但未考虑样本的多样性,对分类器性能的改善作用有限;MS+ABD主动学习方法在确定样本的过程中同时考虑了样本的不确定性与多样性,通过选择最具信息量、最有价值的样本提交用户人工标注,再将其加入到训练集中,明显加快了学习速度。本文提出的SLIC-MS+ABD变化检测方法集成了二者的相对优势,实现了变化检测精度和效率的同时提升。实验结果表明:① SLIC分割产生的超像素对象边缘贴合度以及紧密度较好、超像素内部的同质性较高,避免了地物细节信息的丢失;② 利用主动学习方法只需标记少量训练样本便可以达到较好的检测结果,减少了训练过程中的人工标注成本,有效地提高了变化检测效率。如何合理选择初始样本以及初始样本中各类别样本数目并通过与其它半监督分类方法结合进一步减少样本标注负担、提高算法的泛化能力和鲁棒性是后续研究的目标。

