Orginal Article

An Overview of Quantitative Experimental Methods for Segmentation Evaluation of High Spatial Remote Sensing Images

  • CHEN Yangyang ,
  • MING Dongping , * ,
  • XU Lu ,
  • ZHAO Lu
Expand
  • School of Information Engineering, China University of Geosciences (Beijing), Beijing 100083, China
*Corresponding author: MING Dongping, E-mail:

Received date: 2017-03-15

  Request revised date: 2017-04-19

  Online published: 2017-06-20

Copyright

《地球信息科学学报》编辑部 所有

Abstract

Geographic Object-Based Image Analysis (GEOBIA) is much better than traditional pixel-based method of high spatial resolution remote sensing image analysis. Since image segmentation is the key technique in GEOBIA, scholars and researchers have already conducted extensive research and proposed a number of segmentation algorithms. In order to compare different segmentation methods and evaluate its own performance, segmentation results need to be evaluated. Therefore, the study of segmentation evaluation is equally important to segmentation algorithm. We could choose the applicable segmentation method and set appropriate parameters for specific images and applied the segmentation evaluation. The aim of image segmentation is to enable the automation of image analysis. However, the evaluation methods which cannot provide quantitative indexes are not applicable in automatic real-time image analysis system. Moreover, research in segmentation evaluation is less than segmentation itself. Thus, it will be significant to study segmentation and review the quantitative evaluation method. In this paper, based on summarizing the evaluation methods, the hierarchy of segmentation evaluation method is presented. In spite of describing quantitative empirical methods, we discussed their range of application. Their advantages and shortcomings were also analyzed. Finally, possible future direction and potential application prospect for high spatial remote sensing image segmentation evaluation were proposed.

Cite this article

CHEN Yangyang , MING Dongping , XU Lu , ZHAO Lu . An Overview of Quantitative Experimental Methods for Segmentation Evaluation of High Spatial Remote Sensing Images[J]. Journal of Geo-information Science, 2017 , 19(6) : 818 -830 . DOI: 10.3724/SP.J.1047.2017.00818

1 引言

近年来随着遥感成像技术的迅猛发展,人们可以十分便捷、高效地获取卫星和飞行器所采集的各种高空间分辨率遥感影像(简称高分影像)。高分影像目前已经广泛应用于土地资源管理、城市及道路交通规划、灾害监测与评估、军事目标检测等领域。与遥感成像技术所取得的进展相比,高分影像的处理与分析技术发展较慢,高分影像在应用中的潜力没有得到充分发挥和释放。因此,对高分影像进行高效、自动化的信息提取与分析是当前遥感科学研究的重点和亟待突破的瓶颈。
高分影像较中低空间分辨率影像包含了大量的地物空间细节信息,对其进行空间分析及信息提取的技术难度显著增加。针对高分影像使用基于像素的传统图像分析方法只能提取和使用单个像元的光谱统计信息,而忽视了目标之间的空间信息[1]。GEOBIA(Geographic Object-Based Image Analysis)技术因其能充分利用高分影像中影像对象丰富的大小、形状和纹理等信息[2],针对高分影像分析的效果和精度远优于基于像元的传统方法[3]。影像分割作为高分影像信息提取的关键步骤,是GEOBIA技术中的基础环节,分割之后得到的影像对象可替代传统方法中的像元进行后续的图像分析处理(如图像分类、目标提取等)[4-6]。基于影像分割的特征提取和目标表达能将原始图像转化为更抽象更紧凑的形式,使得更高层的图像分析和理解成为可能[7]
迄今为止,学者们已经提出大量针对高分影像的分割算法,在实验中需根据影像的特性和需求选择合适的分割算法并为其设定一个或者多个分割参数,但如何比较2种分割算法或对同一种分割方法比较不同参数设定结果来辅助实现分割参数优选是分割的难点之一,而且影像分割效果的好坏会直接影响影像后续分析处理的结果和精度[8]。因此,设计出一套对影像分割结果进行评价的方法和体系具有十分重要的意义。现今对分割算法的评价研究还远远落后于对分割算法本身的研究,一定程度上制约了影像分割技术的应用和发展,因此本文对高分影像分割评价方法进行了系统的总结,并指出了分割评价方法未来的改进方向和应用前景。

2 现有分割评价方法体系

虽然分割评价的研究相对于分割算法本身较为落后,但还是涌现出很多具有一定适用性的分割评价算法。这些方法各不相同,可以划分为5大类别(图1[9]
Fig. 1 The hierarchy of segmentation evaluation methods

图1 分割评价方法体系

根据是否需要人工通过目视评判对分割结果进行评价,可将整个分割评价方法划分为主观评价法和客观评价法。主观评价法是目前在影像分割领域应用最为广泛的分割评价方法,该方法以人类自身的视觉感知效果为评判标准,针对同一分割结果综合多名评价者的打分,进而对某种分割方法进行定性的评价。客观评价法包括系统级评价法和直接评价法,系统级评价法将分割作为整个系统的一部分,依据分割结果进行后续信息分析提取的效果和精度(如分类和目标提取精度)间接对分割方法进行评价。直接评价法可以对分割算法进行评价,也可以对分割算法产生的结果进行评价,所以直接评价法可以分为分析法(定性、定量指标)和实验法(定量指标)。分析评价法在无需进行分割实验的前提下,直接对算法本身进行评价,通过对分割算法的基本思想和理论的分析推理得到算法的性能和适用性,评价指标可分为定性和定量2类,其意义在于发现算法的实质性缺陷,并明确算法的改进方向。实验法通过分割实验,对分割结果进行评价,根据是否需要使用真实地表分割数据可划分为监督度评价法(差异实验法)和非监督评价法(优度实验法)。需要说明的是,上述评价方法并非相互独立或排斥,每种方法都有不同的特点和局限性。针对不同影像,对其分割结果进行评价可能会用到上述方法中的一种或多种的组合。
主观评价法给出的评价结果定性且主观性强,而系统级评价法和分析评价法需要结合特定的分割算法和应用目,缺乏普适性,在实时、自动化的影像信息分析中仍存在一定的局限性。定量的实验评价法是分割评价研究领域关注的重点,目前国内外还未系统和全面的总结。本文从监督评价和非监督评价2个方面对高分影像分割算法进行了系统地综述。

3 监督评价法

监督评价法又称差异实验法,该方法将分割结果与手工选取的分割参考数据(黄金标准图像[31])进行对比,以二者之间的差异或不相似度作为评判标准。在精确地确定地表真实地物范围和影像分割对象之后,监督评价法能有效地克服主观性,客观定量地对分割算法的性能进行评价,是最佳的评价方法[32-33]。近年来,监督评价法已逐步取代主观评价法,成为较常用的分割评价方法。监督评价法进行分割评价有3个主要步骤:建立分割参考数据、对象匹配、差异(相似度)计算。

3.1 建立分割参考数据

分割参考数据是代表研究区域最理想分割结果的参考基准,由若干个参考对象构成,每个参考对象是一个矢量多边形。针对高分影像的参考数据通常通过目视解译或者实地数据采集的方式数字化得到,在本文中表示为参考对象集 R = r i ; i = 1 , , n 。参考数据的获取费时费力。对于整幅影像,特别是高分影像,地物细节十分丰富,参考数据中分割对象的数量庞大,建立一个数字化的分割参考图具有一定的主观性且难度大[34],一定程度上限制了监督评价法的运用范围。

3.2 对象匹配

在分割参考影像建立之后,需要从分割方法所对应的分割对象集 S = s j : i = 1 , , m 中提取出与每一个参考对象 r i 构成空间重叠关系的重叠对象集 S i = s j : area ( s j r i ) 0 。重叠对象集的建立是对参考对象 r i 和分割对象 s j 的粗匹配,仅考虑了空间拓扑关联。由于实际分割结果与参考数据不可避免会存在偏差,重叠对象集中的每个分割对象可能与多个参考对象存在重叠关系,而在最佳理想状态下每一个参考对象和它的匹配对象应当是一对一的关系(图2(b))。因此需要对重叠对象集 S i 进行筛选,得到匹配对象集 S i * = s j : j = 1 , , v 。对重叠对象进行筛选的条件主要有“对象面积占比”和“对象空间位置”。
Fig. 2 Arithmetic relationship between the reference object and the corresponding object

图2 参考对象和匹配对象数量关系

Lucieer等[35]提出区域重叠面积最大法,将与参考对象 r i 重叠面积最大的重叠对象 s j 定为对应的匹配对象,每个参考对象匹配了一个重叠对象。该方法应用广泛,原理简单,计算复杂低,但当重叠面积占匹配对象面积较小时,匹配对象属于参考对象的欠分割对象,影响匹配结果的可靠性。
S i * Max = s j : max a rea r i s j , s j S i (1)
赵磊等[36]对区域重叠面积最大法进行了改进,定义了反映欠分割程度的参数,即欠分割比例(Under Segmentation Ratio,USR)。通过该参数限制欠分割对象参与对象匹配,基于匹配结果可以更加客观的评价相应分割方法。
USR = 1 - max ( r i s j ) s j (2)
Liu等[37]提出单向50%法(One-Sided 50%):重叠面积大于参考对象 r i 或重叠对象 s j 二者之一面积的50%的重叠对象为匹配对象。单向50%法的匹配原则较区域重叠面积最大法更加严格,得到的匹配结果相对可靠,但每个参考对象往往存在多个匹配对象,且在许多分割关系下并不适用。例如当一个参考对象 r i 完全包含多个重叠对象 s j 时,既不存在过分割现象,也不存在欠分割现象。
S i * 1 = s j area r i s j area s j > 50 % area r i s j area r i > 50 % , s j S i (3)
Yang等[38]对单向50%方法进行了改进,提出了双向50%法(Two-Sided 50%):重叠面积均大于参考对象 r i 和重叠对象 s j 二者面积的50%的重叠对象为匹配对象。该方法较单向50%法,可以从多个单向50%匹配对象中,所选出最能代表真实分割情况的匹配对象。
S i * 2 = s j : area r i s j area s j > 50 % area r i s j area r i > 50 % , s j S i (4)
E Schöpfer等[39]提出Object-Fate匹配方法。将重叠对象按照参考对象和重叠对象的面积占比和空间位置划分为优良(Good)匹配对象、扩张(Expanding)匹配对象和侵入(Invading)匹配对象3类(图3)。优良匹配对象和扩张匹配对象共同组成匹配对象;优良匹配对象完全落入参考对象的范围当中;扩张匹配对象的范围超出参考对象,但是其质点位于参考对象范围内,且重叠面积大于扩张匹配对象的50%;侵入匹配对象的范围超出参考对象,其质点位于参考对象范围之外,且重叠面积小于扩张匹配对象的50%;优良匹配对象和扩张匹配对象的合并区域即所需的匹配对象。
Goo d i = s j : area r i s j = area ( s j ) , s j S i (5)
Expandin g i = s k : area r i s k area s k > 50 % s k 的质点在 r i 内部 , s k S i (6)
Invadin g i = s l : area r i s l area s l < 50 % s l 的质点不在 r i 内部 , s l S i (7)
Fig. 3 The object-fate matching method

图3 Object-Fate匹配方法

3.3 差异计算

匹配完成之后就可以对匹配对象进行差异计算,学者们提出了多种评价指标对分割对象和分割参考对象的差异度进行度量,差异评价指标可以基于对象的形状、大小、位置、边界、灰度和分割、参考对象数量。差异的度量结果数值越大,分割结果和理想的标准分割结果偏差越大,说明在此类影像中该分割算法的性能较差。本文将目前常用的评价指标分为几何关系指标、数量指标、混合指标。
3.3.1 几何关系指标
3.3.1.1 基于区域
基于区域的测度指标主要是对参考对象和匹配对象之间的几何关系差异(Geometric Discrepancy)进行度量。几何关系包含3种基本类型,分别是重叠 r i s j 、过分割 r i - s j 、欠分割 s j - r i 图4)。文献[37]、[40]-[42]对这3种几何关系进行了详细的定义和阐述。
Fig. 4 Geometric relationship between the reference object and the corresponding object

图4 参考对象和匹配对象几何关系

(1)基于重叠区域
Fram等[43]和陈秋晓等[44]基于区域重叠面积最大法选定与参考对象重叠面积最大的重叠对象进行匹配,通过将对象重叠区域定义为参考对象被正确分割的区域,定义了正确分割的百分数(Fraction of Correctly Segmented Pixels,FCSP)。该方法仅对参考对象的过分割现象进行了描述,但未对匹配对象的欠分割程度进行评价。
FCS P i = area r i s j area ( r i ) , s j S i * max (8)
FCSP的取值与参考对象和分割对象之间的重叠面积呈正相关,取值范围为[0, 1]。FCSP值为1时,对应的影像分割效果最好。
FCSP类似,Lucieer等[35]基于区域重叠面积最大法,针对对象之间的面积差异提出了面积匹配指数(Area Fit Index,AFI)。
AF I i = area r i - area s j area r i , s j S i * max (9)
式中:AFI的取值范围[-1,1],理想取值为0。AFI大于0表示分割对象分割过度,AFI小于0表示分割对象未分割完全。但当重叠面积占参考对象或匹配对象面积比较小时,对欠分割和过分割现象的评价并不可靠。
Zhan等[45]定义了相似尺寸(SimSize)指标,对参考对象和匹配对象的尺寸(面积)相似程度进行评价。该方法仅考虑了面积相似度,在一对多的匹配关系下,当参考对象和多个匹配对象面积相似度相同但重叠面积不同时,显然重叠面积大的匹配对象分割结果更理想,但通过SimSize值无法判断。
SimSiz e ij = min area r i , area s j max area r i , area s j , s j S i * 1 (10)
式中:SimSize的取值在[0, 1]之间,最佳取值为1。
Moller等[46]根据重叠区域面积占参考对象和重叠对象面积的比例,定义了相关区域指标(Relative Area in Sub-Object,RAsub)(等价于FCSP),(Relatice Area in Super-Object,RAsuper),对过分割和欠分割现象进行评价。
R A su b ij = area r i s j area ( r i ) , s j S i (11)
R A supe r ij = area r i s j area ( s j ) , s j S i (12)
式中:RAsub和RAsuper的取值范围为[0, 1],取值越高分割结果越理想。
Weidner[47]定义了质量合格率(Quality Rate,QR)。当参考对象和匹配对象的重叠面积等于对象并集的面积时二者完全重合,面积相似度达到最大,分割效果最好。与FCSPAFI相比,QR的计算不仅考虑了参考对象和匹配对象的重合面积,还考虑了二者面积的相似度,能更加客观的对二者的几何关系进行评定。
Q R ij = 1 - area r i s j area r i s j , s j S i * 1 (13)
式中:QR取值范围为[0, 1],理想取值为0。
基于重叠区域的测度指标,计算简单,定义直观,是目前较为常用的评测指标。
(2)基于过分割、欠分割区域
Clinton等[48]提出过分割比例(Over Segmentation,OS),对RAsubRAsuper指标进行改进,将重叠对象替换为匹配对象,使用欠分割区域和过分割区域替代重叠区域,欠分割比例(Under Segmentation,US),通过过分割和欠分割区域面积占参考对象和匹配对象的面积的比例,对分割对象的的过分割欠分割程度进行度量。
O S ij = 1 - area r i s j area r i = area r i - s j area r i , s j S i * 1 (14)
U S ij = 1 - area r i s j area s j = area s j - r i area s j , s j S i * 1 (15)
OSUS通常组合为ED1使用,取值范围均为[0,1],取最佳值0时分别参考对象不存在过分割现象和匹配对象不存在欠分割现象。
分割后续的影像分类应用中,在分类器设计合理的情况下,过分割对象依旧可以被正确分类,不会对最终分类结果造成影响。但欠分割无可避免的会直接影响到分类精度[47,42]。Liu等[37]基于此特性对US指标进行改进,提出潜在分割误差 (Potential Segmentation Error,PSE)。通过错误分类比例(欠分割区域面积/参考对象面积)间接对分割结果进行评价。
PSE = area s j - r i area r i , s j S i * 1 (16)
式中:PSE的取值范围均为[0, + ],取最佳值0时参考对象不存在欠分割现象。
PSENSR通常组合为ED2(表1)使用,NSRPSE为同一量级时ED2指数最可靠。ED2指数同时对分割结果的几何差异和数量差异进行了度量,但当大量参考对象被过分割时,NSRPSE的量级难以一致。此外,一对多过分割和多对一欠分割的共存也会导致NSR指标的无效。因此,Yang 等[49]NSRPSEED2改进,基于局部欠分割比例和过分割比例的平均值,提出了过分割指标2 (OverSegmentation 2,OS2)、欠分割指标2 (Under-Segmentation 2,US2)。
OS 2 = 1 - area r i s j area ( r i ) ¯ , s j S i * 1 (17)
US 2 = 1 - area r i s j area ( s j ) ¯ , s j S i * 1 (18)
式中:OS2和US2为归一化指标,取值范围均为[0, 1]。OS2和US3通常组合为ED3(表1)使用,可以对全局的几何差异和数量差异进行评价。
Tab. 1 Typical combined measures of supervised evaluation method

表1 典型监督评价混合指标

指标 公式 组合 文献
ED1 ED1=OS2+US22 欠分割和过分割指标 Clinton[48]
ED2 ED2=PSE2+NSR2 欠分割和数量指标 Liu[37]
ED3 ED3=1-arearisjarea(ri)2+1-arearisjarea(sj)22¯ 局部欠分割和过分割指标 Yang[49]
SEI SEIlo cali=1-arearisjareari2+1-arearisjareasj22,sjSi*21, sjSi*2
SEI=1ni=1nSEIlo cali
基于双向匹配的局部欠分割和过分割指标 Yang[38]
M M=1-RAsubij2+1-RAsuperij2+RAsubij2+RPij24 区域和距离指标 Moller[46]
ADI ADI=OE2+CE2 欠分割和过分割指标 Cheng J[50]
Cheng等[50]基于Object-Fate匹配方法,定义Omission Error(OE)和Comission Error(CE)指标对参考对象和匹配对象的差异进行评价。OE为侵入对象与参考对象重叠区域(过分割区域)与参考对象面积的比值。CE为不与参考对象重叠的扩张对象面积(欠分割区域)与参考对象面积的比值。从计算原理的角度,OE等价于OS指标,CE等价于PSE指标。因此OECE为基于Object-Fate匹配方法的过分割指标和欠分割指标,并可与基于Object-Fate的数量差异指标组合使用。
O E i = area r i s l area ( r i ) , s l Invadin g i (19)
C E i = area s k - area r i s k area ( r i ) , s k Expandin g i (20)
式中:OECE的理想取值为1。
基于过分割欠分割区域的评价测度可以直接对分割结果的过分割欠分割现象进行定量的反映。该指标不仅可以通过评价结果对分割方法的性能进行评价,比较不同分割方法的优劣,还可以依据分割结果欠分割和过分割程度对分割尺度进行调整,选取最优分割尺度。正因如此基于过分割区域和欠分割区域的测度指标逐步取代基于重叠区域的测度指标成为主流。
3.3.1.2 基于位置和边界
基于位置和边界的测度指标,是对参考对象和匹配对象之间的几何关系差异进行度量。不同于基于区域的测度指标通过重合、过分割、欠分割程度进行评价,基于位置和边界的指标通过对象间位置相似度和判断对象间边界吻合程度来评价分割结果的优劣。
(1)基于位置
Zhan等[45]提出(Quality of Object Location,qLoc),以参考对象和匹配对象的质心距离作评价对象之间的位置相似程度。
qLo c ij = dist r i 的质心 , s j 的质心 , s j S i * 1 (21)
式中:dist(a,b)为a,b之间的欧氏距离。qLoc的最小值即最佳取值为0,最大值取决于输入图像和所采用的分割方法和匹配方法。
Moller等[46]基于qLoc指标进行改进,提出相关位置指标 (Relative Poistion,RP),对qLoc的结果进行归一化。
R P ij = dist r i 的质心 , s j 的质心 max j qLo c ij , s j S i * 1 (22)
式中:RP的取值范围为[0,1],取值与位置相似度成正相关。
Cheng等[50]提出(Position Discrepancy Index,PDI)距离差异指数,通过计算扩张对向和优良对象与参考对象距离的平均值,评价基于Object-Fate匹配方法对象的位置相似度。
PD I i = j = 1 N dist r i 的质心 , s j 的质心 + k = 1 M dist r i 的质心 , s k 的质心 N + M , s j Goo d i s k Expandin g i (23)
式中:NM为优良对象和扩张对向的数量;PDI的最小值即最佳取值为0。
基于位置的评测指标,原理简单,算法复杂度低。但该指标只对位置相似度进行评价,对分割结果的评价并不充分,在位置相似度完全一致时依然可能存在严重的过分割和欠分割现象。因此,该指标不能单独使用,需要与其他评测指标组合使用。
(2)基于边界
理想的匹配分割对象应当与参考对象在边界上完全重合,因此基于边界的评测指标可以单独使用,通过边界重合度和形状相似度直接对分割效果进行评价。该算法复杂计算量较大,且不适用于一对多或多对一的匹配情况。
Lucieer等[35]提出基于基于距离指标(Distance-Based Measure,D),通过计算参考对象矢量边界上像素与匹配对象矢量边界上像元的最短欧氏距离来反应边界重合度。PRPS分别为参考对象矢量边界和匹配对象矢量边界上的NM个像元。D的值越小边界重合度越高。
D ij = n = 0 N min dist P R i n , P S j N , s j S i * 1 (24)
于欢等[51]定义矢量距离指数(Vector Distance,VD):参考对象矢量边界和匹配对象矢量边界在横纵2个方向上的距离之和。
V D ij = a = 1 n 1 H ij a n 1 + b = 1 n 2 V ij b n 2 , s j S i * 1 (25)
式中: H a 为第a条横向距离线的长度; n 1 为横向距离线总数; V b 为第b条纵向距离线的长度; n 2 为纵向距离线总数。距离线的间隔均为等距。VD的大小与分割效果负相关,当VD为0时,参考对象与匹配对象边界完全重合,分割效果最好。
刘大伟等[52]提出形状相似度指标(Shape Similarity, SS),通过比较对象质心向边界所引射线长度差异的方式计算形状相似度,并进行了归一化处理。
S S ij = k = 0 N - 1 f r i θ k - f s j θ k 2 max k = 0 N - 1 f r i θ k , k = 0 N - 1 f s j θ k , s j S i * 1 (26)
式中: θ 为射线角度间隔,发出的总射线数为 2 π θ ,记为N f θ k 是质心以旋转角度 θ k 发出射线和边界交点与对象质心之间的距离。SS的范围为[0, 1],值越低参考对象和匹配对象的差异度越低,分割效果越好。
3.3.2 数量指标
基于数量的指标是对参考对象和匹配对象的数量差异(Arithmetic Discrepancy)进行度量。对象的数量关系包括一对多、一对一和多对一,文献[37]对这3种数量关系进行了详细的定义和阐述。在分割结果理想的情况下,所有参考对象和匹配对象都应当是一对一的关系。
Strasters等[53]提出破碎度(Fragmentation,FRAG)。
FRAG = 1 1 + p m - v q (27)
式中:mv分别是参考对象和匹配对象的数量;pq是尺度参数,需要根据实际情况和应用进行设定。FRAG的取值范围为[0, 1],取1时分割效果 最好。
Liu等[37]提出分割比率数量(Number of Segments Ratio,NSR)。
NSR = m - v m (28)
式中:mv分别是参考对象和匹配对象的数量。NSR值为0时,所有参考对象和匹配对象均一一对应,分割效果最好。NSR值越大,一对多或多对一的匹配数量关系越多,间接说明过分割或欠分割现象愈发严重。
E Schöpfer等[39]基于Object-Fate匹配方法,提出后代忠诚度(Offspring Loyalty, OL)和干扰度(Interference, I),分割质量越好,优良对象占匹配对象的比重越高,入侵对象占全部分割对象的比重越低。
OL = n good n good + n expanding (29)
I = n invading n good + n expanding + n invading (30)
式中:n为对应对象的数量;OLI的理想取值为 1和0。
在分割评价中,对数量关系差异的评价和对几何差异的评价同样重要。对于理想的分割结果,对象间几何差异一定很小,但较小的几何差异并不能保证理想的分割结果。在一些极端情况下,如所有匹配对象的大小均为一个像元时,匹配对象均与参考对象完全重叠,根据几何差异的定义并不存在过分割和欠分割现象,这显然是不合理的。因此基于数量的测度指标目前受到学者们广泛的重视,将其与基于区域的测度指标联合共同对分割结果评价是未来的趋势。
3.3.3 混合指标
研究将多种不同类型的指标综合运用到监督分割评价当中,结合二者的优点,弥补各自的局限性,更加客观全面的对分割结果进行评价(表1)。若指标没有严格的定义范围,或者范围与其它的指标处于不同量级,在组合使用之前需要进行规范化、标准化或归一化处理。

3.4 存在的问题与分析

与主观评价法、系统级评价法和分析评价法相比,监督评价法一定程度上克服了人为因素带来误差,通过与真实分割结果作最直接的对比,客观定量地提供了更加精确的分割评价结果。但是监督评价法需要人工建立分割参考数据,面向GEOBIA多尺度分割进行监督评价,虽然有学者提出了针对多尺度的监督评价方法[5],但为整幅影像建立完整的数字化参考数据费时费力,且存在一定的主观性。面向典型目标识别的单一尺度分割结果进行监督评价,不用为所有地物建立参考数据,工作量相对较小,同时典型地物的对象范围一般很明确,克服了参考数据建立主观性强的缺点。此外,监督评价法在对象匹配和差异计算的过程中计算复杂,且匹配方法和差异评价测度的选择都会对最终的评价结果产生影响。因此,如何针对不同的应用或影像选择合适或设计出具有普适性的匹配方法和评价指标,是日后监督评价法研究的重点。

4 非监督评价法

非监督评价法又称为优度实验法或独立评价法(Stand Alone),该方法不需要参考影像,而是根据人类对于理想分割结果特点的感知,建立特征优度,从而对分割方法进行评价。May[54]对理想的分割结果的定义得到了广泛认可,具体为:
(1)区域内针对某些特性是均质的;
(2)相邻区域间针对区域内均质特性应当有显著差异;
(3)区域内部无空洞;
(4)区域的边界简单不破碎且准确;
但是对于具有清晰纹理结构的自然图像,特别是高分影像,只有前2个准则符合实际应用,好的分割结果应该有较大的区域内均质性和区域间异质性。因此,大多数非监督评价方法都是先计算每个分割对象的区域内均质性和区域间异质性,然后将计算后的结果复合成一个针对区域内分割对象的全局指标,最后将2个指标联合起来得到整体优度评分,对影像的分割结果进行评价。下文将对常用的均质性和异质性优度指标进行系统归纳,并对优度的复合方法进行总结。

4.1 均质性优度

均质性优度是基于理想分割第一条原则所构建的优度指标。均质性优度通过对区域内均质度的计算,可以直观有效地对分割结果进行评价。目前,评价均质性的优度主要有基于光谱差、基于光谱方差、基于纹理、基于熵4种。Weszka[55]基于阈值分割后分割对象像素与原始像素的光谱差值,提出了Dwr,该优度用于评价基于阈值的前景、背景分离分割算法。Zeboudj[56]基于分割对象区域内像素与邻接像元最大的光谱差值提出最大区域内对比度(Max Within-Region Contrast,MWC)指标。Chen等[57]基于分割对象像元与对象内平均光谱值的差值提出区域内视觉误差(Intra-Region Visual Error,Eintra);基于光谱方差的优度较基于光谱差的优度能够更加合理的反映区域内光谱值的均匀程度,Zhang[4]以区域内部光谱方差为度量提出了区域内部非均质性(Non-Uniformity within region measure,UM)指标,UM值越小区域内的均质性越强。Sahoo等[58]提出的归一化均质性指标(Normalized uniformity measure,NU),是UM的归一化版本,NU值越大区域内的均质性越强。Otsu[59]基于分割对象和邻接对象光谱方差的加权和定义了类内方差(Within-Class Variance,WV),不仅考虑了对象内的均质性,还考虑了邻接对象的均质性;针对纹理特征,Weszka等[55]依据理想分割结果中区域内部不能有强纹理且形状简单紧凑,提出繁忙性(Busyness)指数对均质性进行评价。Levine等[60]提出PV指标,利用纹理测度R描述区域内部纹理异质性,PVBusyness都是针对全局的优度指数。此外也可以将UM指标中的光谱方差替换为纹理值方差对纹理的异质性进行评价; PAL等[61]提出高阶局部熵(Higher Order Local-Entropy),通过最大化分割对象和背景的二阶局部熵对均质性进行评价。
在上述指标中基于光谱差、纹理和熵的指标在遥感领域应用较少,目前最常用的均质性优度是基于光谱方差的优度,特别是局部方差(Local Variance,LV)[62]指标,即上文提到的区域内部非均质性指标(UM)。
V i = f x , y - 1 A i f x , y 2 A i (31)
式中: V i , R i , A i 分别为是标号为i的分割对象的内部均质性,范围和面积大小。 f x , y 为横纵坐标为x,y的像素的灰度值。但该指标不是对内部均质性的直接评价,取值大小与区域内部均质性呈负 相关。
王志华等[63]LV指标进行了改进,利用邻接边长和面积为权重,提出了加权局部方差(WLV),对多个邻接分割对象的均质性进行评价,实现了最佳尺度的选择。
为了将均质性的评价范围从单个分割对象迁移到整幅影像,可以通过面积加权的方式将局部方差转换为适用于全局评价的局部方差指标。
V = i = 1 n A i V i i = 1 n A i (32)
式中:n是整幅影像分割区域的数量。

4.2 异质性优度

为了依据理想分割第二条准则进行分割质量评价,学者们提出了异质性优度。基于光谱差的异质性优度可以分为基于区域间光谱差和基于边界的局部光谱差2种。Otsu[59]通过计算分割对象与临界对象平均光谱值差的平方,提出类间平方差(Within-Class Variance σ 2 w )。张俊等[64]以邻接边长为权重,通过计算分割对象与所有邻接对象平均光谱值差值度量异质性,提出△CL指标,该指标通常和LV指标共同组成RMAS指标使用,但仅凭光谱差并不能说明对象间灰度的差异程度。Levine等[60]提出区域间灰度对比度(Grey Level Contrast Measure,GC),该优度基于相邻区域平均光谱值的差与和之比描述区域间的异质性,并通过设定面积权重的方式计算和所有邻接区域的复合异质性,可靠性较 σ 2 w 大大提升。Chen等[57]提出区域间视觉误差(Inter-Region Visual Error, Einter)指标,通过设定阈值的方式对区域之间光谱平均值的差异进行约束,并基于分割对象与邻接区域重合边界长度设定权重计算分割区域与所有邻接区域的复合异质性;基于边界的局部光谱差优度主要有区域边界梯度(Edge Gradient Measure,EG)和最大边界对比度(Max Border Contrast,MBC)[56],临界区域边界上像素的光谱梯度或光谱差越大,说明区域间的异质性越强,但该指标在各区域均质度较低时并不可靠;除了使用光谱差,还可以使用光谱方差对异质性进行评价,明冬萍等[7]基于区域间光谱方差区域间散度对比(Variance Contrast Measure),以2个邻接区域光谱方差之差和邻接区域合并后的光谱方差之比作为评价区域间异质性的优度,散度对比越大,说明区域间异质性越强;
目前,最常用的异质性测度指标是Moran′s I指数,该指数是通过区域间空间自相关程度对区域间的异质性进行评价。空间自相关由Fotheringham等[65]提出,目的是定量判定同一空间中对象与其相邻对象的相似度和依赖程度。学者们发现空间自相关指数可以很好地对区域间异质性进行度量,因此Moran′s I作为较常见的空间自相关指数被广泛用作全局性空间异质性优度,通常与全局的局部方差指标组合使用,其计算方法如下:
MI = n i = 1 n j = 1 n w ij ( y i - y ¯ ) ( y j - y ¯ ) i = 1 n ( y i - y ¯ ) 2 w ij (33)
式中:n为影像内分割对象的个数;wij是判定空间邻接的指标。若参与计算的分割对象SiSj相邻,则wij等于1,反之wij等于0。 y ¯ 为影像平均光谱值,yj为分割对象Si的平均光谱值。Moran′s I指数越小,区域间异质性越强。
由于Moran′s I是全局优度,无法对单一分割对象及其邻接对象的均质度进行评价,Johnson等[34]使用局部Moran′s I指数计算局部异质性,但局部Moran′s I指数计算使用整幅影像的平均光谱值,结果并不可靠。张建廷等[66]仅考虑邻接像元的光谱信息,对局部Moran′s I指数进行改进,提出Geary指数对局部均质性进行度量:
LG I i = j = 1 , j i k w ij ( y i - y j ) (34)
式中:wijyj的定义与Moran′s I指数相同。Geary指数越大对象间的异质性越强。

4.3 复合优度

均质性和异质性优度大多是基于局部对象的优度,为了使这些局部优度可以应用到整幅影像,我们需要将多个局部优度复合为全局优度。全局优度的复合可以将所有局部优度直接相加,比如DwrEInterEIntra等。大多数针对基于区域内光谱方差的指标还可以采用面积加权相加的方式进行复合,如NUWVLV指标等,基于光谱差的MWC也可以使用面积加权进行复合。基于边界的异质性优度可以使用边长加权对局部测度复合。
优度的复合除了从局部到全局,还可以将均质性优度和异质性优度或者其他优度复合成一个综合性优度对分割质量进行全面评价。Espindola 等[67],对LV指数和Moran′s I指数进行标准化并相加,得到综合分割测度。何敏等[68]对Espindola的方法进行改进,引入了均质性和异质性的权重指标。需要说明的是,不同类型优度的复合不仅局限于均质性与异质性指标,还可以使用间接指标,明冬萍等[7]使用宏观像元运行时间、被去除的无意义区域数目与均质性异质性优度,通过专家经验为权重进行相加,得到综合分割测度。但面向多个测度复合使用专家经验复合测度主观性较强。张仙等[69]对明冬萍的复合方法进行改进,提出使用熵权法为各优度赋予权重,取得了较好的效果。此外,还可以采用层次分析法(Analytic Hierarchy Process)对多种优度进行分层赋权。

4.4 存在的问题与分析

非监督评价法可以广泛应用于不同类型影像、不同分割方法所得到分割结果的评价当中。在不了解分割区域影像内容的情况下可对分割结果进行客观定量的评价。非监督评价法不仅可应用于分割方法性能比较\选择和参数设定当中,还可以实现分割参数的自适应调整,这是其他分割评价方法都不具备的。在对实验影像进行分割之前,需要将一系列测试影像中取得最好分割结果的参数设定为该分割方法的最佳参数,最佳参数的选取需要通过大量实验或者依赖研究人员的经验。监督评价法虽然在有测试影像参考数据的基础上可以对最佳分割参数进行自动的选择,但所获取的最佳参数并不一定适用于其他影像数据,方法缺乏普适性。因此,非监督评价法因其不需要参考数据的特性,有着极强的适应性和参数的自我调节能力,不需要人为的主观干预,特别适用于的分割信息自动化提取与分析系统当中。非监督方法的可靠性取决于所选用优度的合理性,优度之间的相关性和权重分配;此外为了避免有偏评价,所选优度的属性和基于的原理应当不同于所采用的分割算法。因此如何客观的建立具有普适性的评价优度并为测度设置合理的权重是未来研究的重点。

5 总结

对高分影像的分割结果进行评价是GEOBIA技术领域的难点之一,也是分割流程自动化必不可少的一项关键技术。本文对常用分割评价方法进行了系统总结。目前最常用的分割评价方法依旧是主观评价法。间接评价法和分析评价法是常用的辅助评价方法。但这3种评价方法都无法给出定量、客观、全面的评价指标,难以应用到自动化的分割系统当中;随着研究的逐步深入,监督评价和非监督评价法因能提供客观定量的评价指标,逐步替代主观评价法成为常用的分割评价方法。
在精确建立参考数据的前提下,监督评价法的评价结果最为牢靠。但为整幅影像建立参考数据较为困难且存在一定主观性,所以更适用于使用较少参考数据的面向目标识别的单一尺度分割结果评价当中。此外,该方法的部署严重依赖于参考数据,所以难以应用在自动化的分割系统当中。
非监督评价法的最大优点就是无需人为干预,不需要通过人工获取参考数据,一定程度上降低了评价的主观性。分割评价最终的目的是实现分割流程的自动化,但在面向目标提取的单尺度分割评价中,确定目标所在的分割区域依旧需要人工干预,降低了自动化程度,无法发挥非监督评价法的优势。因此,非监督评价法更适用于面向GEOBIA的多尺度分割结果评价,是最适合自动化分割流程的分割评价方法。
虽然目前监督评价法和非监督评价法还存在各种问题和不足,但这对这2种评价方法进行进一步深入的研究依然有重要的意义。
今后关于分割评价方法的研究可以集中在以下5个方面:
(1)针对监督评价法,提出具有普适性的匹配方法和差异指标。
(2)非监督评价法的优度指标还不够全面,可以采用更高层次的信息建立优度指标,如先验知识和语义。
(3)建立非监督评价法的优度选择体系,研究如何自动化选取优度指标并分配权重。
(4)联合使用监督评价和非监督评价法进行评价,利用各自的优势,建立综合性的评价指标和体系。
(5)随着人工智能技术的发展,应考虑将人工智能技术与分割评价方法结合,提出基于人工智能的监督差异指标和非监督优度指标。

The authors have declared that no competing interests exist.

[1]
Blaschke T, Strobl J.What’s wrong with pixels? Some recent developments interfacing remote sensing and GIS[J]. 2001,14:12-17.

[2]
Blaschke T, Burnett C, Pekkarinen A.New Contextual Approaches Using Image Segmentation for Object-based Classification[C]//De Jong S, van der Meer F, eds. Remote Sensing Image Analysis: Including the Spatial Domain. Dordrecht: Kluwer Academic Publishers, 2004:211-236.

[3]
Blaschke T, Hay G J, Kelly M, et al.Geographic object-based image analysis-towards a new paradigm[J]. Isprs Journal of Photogrammetry & Remote Sensing, 2014,87(100):180-191.The amount of scientific literature on (Geographic) Object-based Image Analysis – GEOBIA has been and still is sharply increasing. These approaches to analysing imagery have antecedents in earlier research on image segmentation and use GIS-like spatial analysis within classification and feature extraction approaches. This article investigates these development and its implications and asks whether or not this is a new paradigm in remote sensing and Geographic Information Science (GIScience). We first discuss several limitations of prevailing per-pixel methods when applied to high resolution images. Then we explore the paradigm concept developed by Kuhn (1962) and discuss whether GEOBIA can be regarded as a paradigm according to this definition. We crystallize core concepts of GEOBIA, including the role of objects, of ontologies and the multiplicity of scales and we discuss how these conceptual developments support important methods in remote sensing such as change detection and accuracy assessment. The ramifications of the different theoretical foundations between the ‘ per-pixel paradigm ’ and GEOBIA are analysed, as are some of the challenges along this path from pixels, to objects, to geo-intelligence. Based on several paradigm indications as defined by Kuhn and based on an analysis of peer-reviewed scientific literature we conclude that GEOBIA is a new and evolving paradigm.

DOI PMID

[4]
Zhang Y J.A survey on evaluation methods for image segmentation[J]. Pattern Recognition, 1996,29(8):1335-1346.This paper studies different methods proposed so far for segmentation evaluation. Most methods can be classified into three groups: the analytical, the empirical goodness and the empirical discrepancy groups. Each group has its own characteristics. After a brief description of each method in every group, some comparative discussions about different method groups are first carried out. An experimental comparison for some empirical (goodness and discrepancy) methods commonly used is then performed to provide a rank of their evaluation abilities. In addition, some special methods are also discussed. This study is helpful for an appropriate use of existing evaluation methods and for improving their performance as well as for systematically designing new evalution methods.

DOI

[5]
Zhang X, Xiao P, Feng X, et al.Toward evaluating multiscale segmentations of high spatial resolution remote sensing images[J]. IEEE Transactions on Geoscience & Remote Sensing, 2015,53(7):3694-3706.Object-based analysis of high spatial resolution remote sensing images addresses the matter of multiscale segmentation. However, existing segmentation evaluation methods mainly focus on single-scale segmentation. In this paper, we examine the issue of supervised multiscale segmentation evaluation and propose two discrepancy measures to determine the manner in which geographic objects are delineated by multiscale segmentations. A QuickBird scene in Hangzhou, China, is used to conduct the evaluation. The results reveal the effectiveness of the proposed measures, in terms of method comparison and parameter optimization, for multiscale segmentation of high spatial resolution images. Moreover, meaningful indications for selecting suitable multiple segmentation scales are presented. The proposed measures are applicable to performance evaluation and parameter optimization for multiscale segmentation algorithms.

DOI

[6]
Benz U C, Hofmann P, Willhauck G, et al.Multi-resolution, object-oriented fuzzy analysis of remote sensing data for GIS-ready information[J]. International Journal of Photogrammetry & Remote Sensing, 2004,58(3-4):239-258.Remote sensing from airborne and spaceborne platforms provides valuable data for mapping, environmental monitoring, disaster management and civil and military intelligence. However, to explore the full value of these data, the appropriate information has to be extracted and presented in standard format to import it into geo-information systems and thus allow efficient decision processes. The object-oriented approach can contribute to powerful automatic and semi-automatic analysis for most remote sensing applications. Synergetic use to pixel-based or statistical signal processing methods explores the rich information contents. Here, we explain principal strategies of object-oriented analysis, discuss how the combination with fuzzy methods allows implementing expert knowledge and describe a representative example for the proposed workflow from remote sensing imagery to GIS. The strategies are demonstrated using the first object-oriented image analysis software on the market, eCognition, which provides an appropriate link between remote sensing imagery and GIS.

DOI

[7]
明冬萍,骆剑承,周成虎,等.高分辨率遥感影像特征分割及算法评价分析[J].地球信息科学学报,2006,8(1):103-109.图像分割一直是图像处理和计算 机视觉领域中的一项关键技术。本文首先从遥感影像地学处理与应用的角度阐述了影像分割技术对于遥感信息提取和目标识别的重要性,然后提出了基于特征的高分 辨率遥感影像信息提取技术框架,建立了一套基于特征的遥感影像分割方法及分类体系。同时,鉴于遥感影像分割方法评价的重要性, 阐述了一种高分辨率遥感影像分割方法评价的思路,并对几种典型的基于特征的遥感影像分割方法进行定性和定量的试验和评价,对其各自的性能和适用面进行对比 分析。最后,指出了遥感影像特征分割方法所存在的问题及其发展趋势。

DOI

[ Ming D P, Luo J C, Zhou C H, et al.Research on high resolution remote sensing image segmentation methods based on features and evaluation of algorithms[J]. Geo-Information Science, 2006,8(1):103-109. ]

[8]
Lka D, Maier B, Seijmonsbergen A C.Improved Landsat-based forest mapping in steep mountainous terrain using object-based classification[J]. Forest Ecology & Management, 2003,183(1-3):31-46.The accuracy of forest stand type maps derived from a Landsat Thematic Mapper (Landsat TM) image of a heterogeneous forest covering rugged terrain is generally low. Therefore, the first objective of this study was to assess whether topographic correction of TM bands and adding the digital elevation model (DEM) as additional band improves the accuracy of Landsat TM-based forest stand type mapping in steep mountainous terrain. The second objective of this study was to compare object-based classification with per-pixel classification on the basis of the accuracy and the applicability of the derived forest stand type maps. To fulfil these objectives different classification schemes were applied to both topographically corrected and uncorrected Landsat TM images, both with and without the DEM as additional band. All the classification results were compared on the basis of confusion matrices and kappa statistics. It is found that both topographic correction and classification with the DEM as additional band increase the accuracy of Landsat TM-based forest stand type maps in steep mountainous terrain. Further it was found that the accuracies of per-pixel classifications were slightly higher, but object-based classification seemed to provide better overall results according to local foresters. It is concluded that Landsat TM images could provide basic information at regional scale for compiling forest stand type maps especially if they are classified with an object-based technique.

DOI

[9]
Zhang H, Fritts J E, Goldman S A.Image segmentation evaluation: A survey of unsupervised methods[J]. Computer Vision & Image Understanding, 2008,110(2):260-280.In this paper, we examine the unsupervised objective evaluation methods that have been proposed in the literature. An extensive evaluation of these methods are presented. The advantages and shortcomings of the underlying design mechanisms in these methods are discussed and analyzed through analytical evaluation and empirical evaluation. Finally, possible future directions for research in unsupervised evaluation are proposed.

DOI

[10]
Neubert M, Meinel G.Evaluation of segmentation programs for high resolution remote sensing applications[J]. 2003.Methods of image segmentation become more and more important in the field of remote sensing image analysis – in particular due to the increasing spatial resolution of imagery. The most important factor for using segmentation techniques is segmentation quality. Thus, a method for evaluating segmentation quality is presented and used to compare results of presently available segmentation programs. Firstly, an overview of the softwares used is given. Moreover the quality of the individual segmentation results is evaluated based on pan-sharpened multi-spectral IKONOS data. This is done by visual comparison, which is supplemented by a detailed investigation using visual interpreted reference areas. Geometrical segment properties are in the focus of this quantitative evaluation. The results are assessed and discussed. They show the suitability of the tested programs for segmenting very high resolution imagery. KURZFASSUNG: Die Methoden der Bildsegmentierung gewinnen in der Fernerkundung – insbesondere durch die steigende geometrische Aufl02sung der Bilddaten – zunehmend an Bedeutung. Vor diesem Hintergrund wird die Segmentierungsqualit01t derzeit verfügbarer Segmentierungssoftwares gegenübergestellt. Dabei erfolgt zun01chst eine allgemeine Darstellung der benutzten Programme. Anschlie08end wird die Qualit01t der auf Basis panchromatisch gesch01rfter IKONOS-Multispektraldaten erzielten Segmentierungsergebnisse verglichen. Eine überblicksartige visuelle Untersuchung wird um einen detaillierten Vergleich mit unterschiedlichen, visuell kartierten Referenzfl01chen erg01nzt. Gro08e Beachtung finden die für die Segmentierungsqualit01t ausschlaggebenden geometrischen Eigenschaften der Segmente. Die unterschiedlichen Ergebnisse werden bewertet und diskutiert. Sie dokumentieren die Eignung der Programme zur Segmentierung sehr hochaufl02sender Fernerkundungsdaten.

[11]
Neubert M, Herold H.Assessment of remote sensing image segmentation quality[J]. Journal of Remote Sensing, 2008.ABSTRACT Image segmentation is a crucial step within the object-based remote sensing information retrieval process. As a step prior to classification the quality assessment of the segmentation result is of fundamental significance for the recognition process as well as for choosing the appropriate approach and parameters for a given segmentation task. Thus, this research is also related to the topic of object based accuracy assessment. In this paper we present some methodical extensions of the segmentation quality evaluation process based on our previous studies. The main focus is set to increasing automation, new metrics and higher regard to spatially explicit metrics. Object differences have been analyzed as topological and geometric relationships between the segment and the according reference object. Thus, the overlapping area was calculated (absolute and percentage) to describe the area concurrence. Furthermore, the accordance of the outlines was evaluated using buffer zones around reference objects by means of proportion inside specific buffer zones. This makes it possible to draw conclusions about the geometrical correctness of the segmented outlines. In addition to that we investigated other published assessment metrics such as the Area-Fit-Index. Results of several segmentation programs have been assessed and compared using identical imagery. The software tested is: ENVI Feature Extraction Module 4.4, BerkleyImgseg 0.54, EDISON, EWS 1.0, Definiens Developer 7 and InfoPack 2.0. Some newly available programs point out new possibilities for object-based image analysis. Conclusions from a methodical and users point of view will be given. In combination with the previous studies, in total 24 segmentation programs or its releases have been evaluated. The results of all segmentations are displayed at the website www.ioer.de/segmentation-evaluation.

[12]
Gelasca E D, Ebrahimi T, Farias M C Q, et al. Towards perceptually driven segmentation evaluation metrics[C]. Conference on Computer Vision and Pattern Recognition Workshop. IEEE Computer Society, 2004:52.

[13]
Paglieroni D W.Design considerations for image segmentation quality assessment measures[J]. Pattern Recognition, 2004,37(8):1607-1617.Proximity-based association between two boundary pixels is discussed in the context of association distance. Motivated by the concept of phase-modulated signals, a penalty factor on the degree of association is then introduced as some non-negative power (phase modulation order) of the cosine of disparity in phase (boundary direction) between two boundary pixels. Families of matching measures between maps of region boundaries are defined as functions of associations between many pairs of boundary pixels. The measures are characterized as one-way (reflecting relationships in one direction between region boundaries from two segmentations) vs. two-way (reflecting relationships in both directions). Measures of inconsistency between perceived and computed matches of computer and manually generated region boundaries are developed and exercised so that effects of association distance, phase modulation, and choice of matching measure on image segmentation quality assessment can be quantified. It is quantitatively established that consistency can be significantly improved by using two-way measures in conjunction with high-order phase modulation and moderate association distances.

DOI

[14]
Baatz M, Schäpe A.An optimization approach for high quality multi-scale image segmentation[C]. Beiträge zum AGIT-Symposium, 2000:12-23.

[15]
Hay G J, Castilla G, Wulder M A, et al.An automated object-based approach for the multiscale image segmentation of forest scenes[J]. International Journal of Applied Earth Observation & Geoinformation, 2005,7(4):339-359.Over the last decade the analysis of Earth observation data has evolved from what were predominantly per-pixel multispectral-based approaches, to the development and application of multiscale object-based methods. To empower users with these emerging object-based approaches, methods need to be intuitive, easy to use, require little user intervention, and provide results closely matching those generated by human interpreters. In an attempt to facilitate this, we present multiscale object-specific segmentation (MOSS) as an integrative object-based approach for automatically delineating image-objects (i.e., segments) at multiple scales from a high-spatial resolution remotely sensed forest scene. We further illustrate that these segments cognitively correspond to individual tree crowns, ranging up to forest stands, and describe how such a tool may be used in computer-assisted forest inventory mapping. MOSS is composed of three primary components: object-specific analysis (OSA), object-specific upscaling (OSU), and a new segmentation algorithm referred to as size constrained region merging (SCRM). The rationale for integrating these methods is that the first two provide the third with object-size parameters that otherwise would need to be specified by a user. Analysis is performed on an IKONOS-2 panchromatic image that represents a highly fragmented forested landscape in the Sooke Watershed on southern Vancouver Island, BC, Canada.

DOI

[16]
Zhang X, Feng X, Xiao P, et al.Segmentation quality evaluation using region-based precision and recall measures for remote sensing images[J]. Isprs Journal of Photogrammetry & Remote Sensing, 2015,102:73-84.Segmentation of remote sensing images is a critical step in geographic object-based image analysis. Evaluating the performance of segmentation algorithms is essential to identify effective segmentation methods and optimize their parameters. In this study, we propose region-based precision and recall measures and use them to compare two image partitions for the purpose of evaluating segmentation quality. The two measures are calculated based on region overlapping and presented as a point or a curve in a precision ecall space, which can indicate segmentation quality in both geometric and arithmetic respects. Furthermore, the precision and recall measures are combined by using four different methods. We examine and compare the effectiveness of the combined indicators through geometric illustration, in an effort to reveal segmentation quality clearly and capture the trade-off between the two measures. In the experiments, we adopted the multiresolution segmentation (MRS) method for evaluation. The proposed measures are compared with four existing discrepancy measures to further confirm their capabilities. Finally, we suggest using a combination of the region-based precision ecall curve and the F -measure for supervised segmentation evaluation.

DOI

[17]
Shin M C, Goldgof D B, Bowyer K W.Comparison of edge detector performance through use in an object recognition task[J]. Computer Vision & Image Understanding, 2001,84(1):160-178.This paper presents an empirical evaluation methodology for edge detectors. Edge detector performance is measured using a particular edge-based object recognition algorithm as a “higher-level” task. A detector's performance is ranked according to the object recognition performance that it generates. We have used a challenging train and test dataset containing 110 images of jeep-like images. Six edge detectors are compared and results suggest that (1) the SUSAN edge detector performs best and (2) the ranking of various edge detectors is different from that found in other evaluations.

DOI

[18]
Li P, Guo J, Song B, et al.A multilevel hierarchical image segmentation method for urban impervious surface mapping using very high resolution imagery[J]. IEEE Journal of Selected Topics in Applied Earth Observations & Remote Sensing, 2011,4(1):103-116.This paper presents a hierarchical image segmentation method that combines multichannel watershed transformation and dynamics of watershed contours for the segmentation of very high resolution (VHR) multispectral imagery. The image gradient was first extracted from a multispectral image using a multichannel morphological method, followed by classical watershed transformation to produce an initial segmentation result. The resulting watershed contours were then analyzed according to their relevance relative to the minima of the adjacent basins to construct an image containing information about their dynamics. By thresholding the image of the contour dynamics at different levels, multilevel hierarchical segmentation results with different levels of detail were achieved. The proposed method was evaluated by comparing with existing methods through visual inspection, quantitative measures and applications in urban impervious surface mapping, using two sets of VHR image data. The experimental results showed that the proposed method produced more accurate segmentation results compared to an existing single-level segmentation method, in terms of visual and quantitative evaluations. While used for urban impervious surface mapping, the proposed method achieved an overall accuracy significantly higher than the pixel based classification method, and also higher than the object based classification using a single-level segmentation result. Compared with the most widely used segmentation method implemented in the eCognition, the proposed method achieved a comparable performance, although they have different segmentation details. The proposed segmentation method can be used in relevant VHR image processing and applications.

DOI

[19]
Hofmann P, Lettmayer P, Blaschke T, et al.Towards a framework for agent-based image analysis of remote-sensing data[J]. International Journal of Image and Data Fusion, 2015,6(2):115-137.Object-based image analysis (OBIA) as a paradigm for analysing remotely sensed image data has in many cases led to spatially and thematically improved classification results in comparison to pixel-based approaches. Nevertheless, robust and transferable object-based solutions for automated image analysis capable of analysing sets of images or even large image archives without any human interaction are still rare. A major reason for this lack of robustness and transferability is the high complexity of image contents: Especially in very high resolution (VHR) remote-sensing data with varying imaging conditions or sensor characteristics, the variability of the objects properties in these varying images is hardly predictable. The work described in this article builds on so-called rule sets. While earlier work has demonstrated that OBIA rule sets bear a high potential of transferability, they need to be adapted manually, or classification results need to be adjusted manually in a post-processing step. In order to automate these adaptation and adjustment procedures, we investigate the coupling, extension and integration of OBIA with the agent-based paradigm, which is exhaustively investigated in software engineering. The aims of such integration are (a) autonomously adapting rule sets and (b) image objects that can adopt and adjust themselves according to different imaging conditions and sensor characteristics. This article focuses on self-adapting image objects and therefore introduces a framework for agent-based image analysis (ABIA).

DOI

[20]
Dronova I, Gong P, Clinton N E, et al.Landscape analysis of wetland plant functional types: The effects of image segmentation scale, vegetation classes and classification methods[J]. Remote Sensing of Environment, 2012,127(140):357-369.Remote sensing-based analyses of vegetation function such as photosynthesis and productivity are challenging in wetlands with complex cover and difficult field access. Recent advances in object-based image analysis (OBIA) and machine-learning algorithms offer new image classification tools; however, few comparisons of different approaches have been discussed to date. We applied OBIA to delineate wetland plant functional types (PFTs) for Poyang Lake, the largest freshwater lake in China and Ramsar wetland conservation site, from a spring 2008 Landsat TM image. We targeted major PFTs that represent dominant vegetation groups along wetland inundation gradients and affect ecosystem biogeochemical cycles and ecological habitats. Classification results were compared among: a) several “small” object segmentation scales (with average object sizes 1350–900002m 2 ); b) algorithms from six families of statistical machine-learning classifiers (Bayesian, Logistic, Neural Network, Decision Trees, K-Nearest Neighbors and Support Vector Machines) and c) two hierarchical levels of vegetation classification, a generalized 3-class set and a more specific 6-class set. We also examined the response of classification accuracy to four basic object-level texture metrics. The highest accuracies (>0285–90%) and best agreement among algorithms occurred at coarser object scales rather than close-to-pixel scales. No single machine-learning algorithm was consistently superior at all scales, although support vector machine, k-nearest neighbor and artificial neural network most frequently provided the highest overall and PFT-specific accuracies. Including texture metrics had both positive and negative low-magnitude effects on classification accuracy that were not consistent among scale values, algorithms or PFT classes. Individual PFTs differed in scales at which they were best discriminated from others, reflecting their unique landscape positions, ecology of dominant species and disturbance agents. There was a 29–35% disagreement between mapped areas of generalized PFTs and their respective subclasses, suggesting potential mismatches between the ecological classification scheme and PFT landscape patch structure, and raising concern on error propagation in multi-scale classifications. We conclude that OBIA with machine-learning classifiers is useful for landscape vegetation analyses, however, considerations of spatial scale and image segmentation outcomes are critical in mapping PFTs and should be more thoroughly investigated in future work.

DOI

[21]
Laliberte A S, Rango A.Texture and scale in Object-Based analysis of subdecimeter resolution unmanned aerial vehicle (UAV) Imagery[J]. IEEE Transactions on Geoscience & Remote Sensing, 2009,47(3):761-770.

[22]
Johnson B, Xie Z.Classifying a high resolution image of an urban area using super-object information[J]. Isprs Journal of Photogrammetry & Remote Sensing, 2013,83(3):40-49.In this study, a multi-scale approach was used for classifying land cover in a high resolution image of an urban area. Pixels and image segments were assigned the spectral, texture, size, and shape information of their super-objects (i.e. the segments that they are located within) from coarser segmentations of the same scene, and this set of super-object information was used as additional input data for image classification. The accuracies of classifications that included super-object variables were compared with the classification accuracies of image segmentations that did not include super-object information. The highest overall accuracy and kappa coefficient achieved without super-object information was 78.11% and 0.727%, respectively. When single pixels or fine-scale image segments were assigned the statistics of their super-objects prior to classification, overall accuracy increased to 84.42% and the kappa coefficient increased to 0.804.

DOI

[23]
章毓晋. 图象分割评价技术分类和比较[J].中国图象图形学报,1996,1(2):151-158.本文对图象分割评价,特别是评价方法和评价准则的进展,作了一个综述,现已提出的多数分割评价方法可分为直接法和间接法,直接法研究分割算法本身,面间接法根据分割图象来评价算法的性能。直接法和间接法都需要借助一定的评价准则来进行。本文对已提出的各种准则进行了分类与分析比较,并对常用的评价准则进行了实验比较,通过分析其性能表现排出了它们的优劣次序。本研究为在实际中正确地应用不同的评价方法提出了依据并指出了若干进一步研究的方向。

DOI

[ Zhang Y J.A classification and comparison of evaluation techniques for image segmentation[J]. China Journal of Image and Graphics, 1996,1(2):151-158. ]

[24]
Liedtke C E, Gahm T, Kappei F, et al.Segmentation of microscopic cell scenes[J]. Analytical and quantitative cytology and histology / the International Academy of Cytology and American Society of Cytology, 1987,9(3):197-211.Different methods for the automated of microscopic scenes are presented with examples. The techniques discussed include edge detection by thresholding, "blob" detection by split-and-merge algorithm, global thresholding using gray-level histograms, hierarchic thresholding using color information, global thresholding using two-dimensional histograms and by "blob" labeling. Methods are more robust against insignificant changes in the scene and perform more reliably as more a priori knowledge about the scene is incorporated in the algorithm. The inclusion of both photometric and geometric a priori knowledge can result in a high level of correct , the cost of which is increased computation time.

DOI PMID

[25]
Ming D, Wang Q, Luo J, et al.Evaluation of high spatial resolution remote sensing image segmentation algorithms[C]. Image and Signal Processing, 2009. CISP '09. 2nd International Congress on, 2009:1-5.

[26]
Gerbrands J J.Segmentation of noisy images[J]. Thesis Technische Univ, 1988.

[27]
Abdou I E, Pratt W.Quantitative design and evaluation of enhancement/thresholding edge detectors[J]. Proceedings of the IEEE, 1979,67(5):753-763.Quantitative design and performance evaluation techniques are developed for the enhancement/thresholding class of image edge detectors. The design techniques are based on statistical detection theory and deterministic pattern-recognition classification procedures. The performance evaluation methods developed include: a)deterministic measurement of the edge gradient amplitude; b)comparison of the probabilities of correct and false edge detection; and c) figure of merit computation. The design techniques developed are used to optimally design a variety of small and large mask edge detectors. Theoretical and experimental comparisons of edge detectors are presented.

DOI

[28]
张石,董建威,佘黎煌.医学图像分割算法的评价方法[J].中国图象图形学报,2009,14(9):1872-1880.对医学图像分割算法的客观评价是推进算法在临床上得到应用的关键。针对目前对医学图像分割方法的研究较多,而对分割算法的评价方法的研究却很少的问题,提出了一种判断和比较医学图像分割算法优劣的评价方法。首先对现有的几种评价方法进行了综述,并总结出了一套评价系统。可靠性、精确性、区域统计特性和效率是评价一个分割方法的4个重要参数,结合医学图像分割分别对它们的定义进行了说明。这些参数互相影响,评价分割算法时必须权衡这些指标,根据不同的应用背景赋予它们不同的权重。此外,还介绍了如何建立医学图像分割金标准数据库的方法。最后,通过Insight Toolkit(ITK)软件包中的两个算法例子,结合脑白质分割的医学背景,演示了如何利用本文评价系统来对这两种分割算法做出比较。实验结果表明,该评价方法可行,比较结果具有合理性。该研究为医学图像分割算法的评价提供了科学合理的方法,同时也指出了推动医学图像分割算法在临床上应用所应解决的问题。

DOI

[ Zhang S, Dong J W, She L H.The methodology of evaluating segmentation algorithms on medical image[J]. Journal of Image and Graphics, 2009,14(9):1872-1880. ]

[29]
Zhang Y J.A review of recent evaluation methods for image segmentation[C]. Signal Processing and ITS Applications, Sixth International, Symposium on. IEEE, 2001:148-151.

[30]
Cardoso J S, Corte-Real L.Toward a generic evaluation of image segmentation[J]. IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society, 2005,14(11):1773-1782.Image segmentation plays a major role in a broad range of applications. Evaluating the adequacy of a segmentation algorithm for a given application is a requisite both to allow the appropriate selection of segmentation algorithms as well as to tune their parameters for optimal performance. However, objective segmentation quality evaluation is far from being a solved problem. In this paper, a generic framework for segmentation evaluation is introduced after a brief review of previous work. A metric based on the distance between segmentation partitions is proposed to overcome some of the limitations of existing approaches. Symmetric and asymmetric distance metric alternatives are presented to meet the specificities of a wide class of applications. Experimental results confirm the potential of the proposed measures.

DOI PMID

[31]
Graaf C N D, Koster A S E, Vincken K L, et al. Validation of the interleaved pyramid for the segmentation of 3D vector images[J]. Pattern Recognition Letters, 1994,15(5):469-475.Summary: A multiresolution pyramid with double scale space sampling (compared to the Burt \& Hong scheme) for the segmentation of 3D images, of which the elements are multiple valued, is described. Evaluation is carried out by quality constrained cost analysis (QCCA).

DOI

[32]
Hoover A, Jean-Baptiste G, Jiang X, et al.An experimental comparison of range image segmentation algorithms[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 1996,18(7):673-689.Abstract A methodology for evaluating range image segmentation algorithms is proposed. This methodology involves (a) a common set of 40 laser range finder images and 40 structured light scanner images that have manually specified ground truth and (b) a set of defined performance metrics for instances of correctly segmented, missed and noise regions, over- and under-segmentation, and accuracy of the recovered geometry. A tool is used to objectively compare a machine generated segmentation against the specified ground truth. Four research groups have contributed to evaluate their own algorithm for segmenting a range image into planar patches. Key words: experimental comparison of algorithms, range image segmentation, low level processing, performance evaluation In general, standardized segmentation error metrics are needed to help advance the stateof -the-art. No quantitative metrics are measured on standard test images in most of today's research environments. ---NSF Range Image Unde...

DOI

[33]
Mccane B.On the Evaluation of Image Segmentation Algorithms[C]. Dicta97 & Ivcnz. 1997:455-459.

[34]
Johnson B, Xie Z.Unsupervised image segmentation evaluation and refinement using a multi-scale approach[J]. Isprs Journal of Photogrammetry & Remote Sensing, 2011,66(4):473-483.In this study, a multi-scale approach is used to improve the segmentation of a high spatial resolution (30cm) color infrared image of a residential area. First, a series of 25image segmentations are performed in Definiens Professional 5 using different scale parameters. The optimal image segmentation is identified using an unsupervised evaluation method of segmentation quality that takes into account global intra-segment and inter-segment heterogeneity measures (weighted variance and Moran I, respectively). Once the optimal segmentation is determined, under-segmented and over-segmented regions in this segmentation are identified using local heterogeneity measures (variance and Local Moran I). The under- and over-segmented regions are refined by (1) further segmenting under-segmented regions at finer scales, and (2) merging over-segmented regions with spectrally similar neighbors. This process leads to the creation of several segmentations consisting of segments generated at three different segmentation scales. Comparison of single- and multi-scale segmentations shows that identifying and refining under- and over-segmented regions using local statistics can improve global segmentation results.

DOI

[35]
Lucieer A, Stein A.Existential uncertainty of spatial objects segmented from satellite sensor imagery[J]. IEEE Transactions on Geoscience & Remote Sensing, 2002,40(11):2518-2521.This research addresses existential uncertainty of spatial objects derived from satellite sensor imagery. An image segmentation technique is applied at various values of splitting and merging thresholds. We test the hypothesis that objects occurring at many segmentation steps have less existential uncertainty than those occurring at only a few steps

DOI

[36]
赵磊,陈尔学,李增元,等.基于均值漂移和谱图分割的极化SAR影像分割方法及其评价[J].武汉大学学报·信息科学版,2015,40(8):1061-1068.提出了一种基于均值漂移和谱图分割的极化SAR(PolSAR)影像分割方法。首先,通过均值漂移算法对PolSAR影像进行过分割处理,并基于Wishart统计分布和假设检验的方法构建边缘检测器,充分利用了PolSAR影像的全极化信息提取边缘信息;然后,在过分割和边缘信息的基础上构建相似性度量矩阵,并采用归一化割准则实现PolSAR影像的分割。该算法充分利用了均值漂移算法过分割的特点,降低了谱图分割算法的运算代价,并结合谱图分割算法全局优化的优点改善了PolSAR影像的分割结果;最后,利用Radarsat-2全极化影像进行了实验,并采用改进的分割效果评价方法实现了精度评价。实验表明,该算法有效地实现了PolSAR影像的分割,显著提高了谱图分割算法的效率,分割结果优良,分割精度优于eCognition软件中的多尺度分割方法。

DOI

[ Zhao L, Chen E X, Li Z Y, et al.Segmentation of PolSAR data based on mean-shift and spectral graph partitioning and its evaluation[J]. Geomatics and Information Science of Wuhan University, 2015,40(8):1061-1068. ]

[37]
Liu Y, Bian L, Meng Y, et al.Discrepancy measures for selecting optimal combination of parameter values in object-based image analysis[J]. Isprs Journal of Photogrammetry & Remote Sensing, 2012,68(1):144-156.Most object-based image analysis use parameters to control the size, shape, and homogeneity of segments. Because each parameter may take a range of possible values, different combinations of value between parameters may produce different segmentation results. Assessment of segmentation quality, such as the discrepancy between reference polygons and corresponding image segments, can be used to identify the optimal combination of parameter values. In this research, we (1) evaluate four existing indices that describe the discrepancy between reference polygons and corresponding segments, (2) propose three new indices to evaluate both geometric and arithmetic discrepancies, and (3) compare the effectiveness of the existing and proposed indices in identifying optimal combinations of parameter values for image segmentation through a case study. A Landsat 5 Thematic Mapper (TM) image and an ALOS image of arid Northwestern China were used in the case study. The four existing indices include Quality Rate ( QR ), Over-segmentation Rate ( OR ), Under-segmentation Rate ( UR ), and Euclidean Distance 1 ( ED 1). The three proposed discrepancy indices include Potential Segmentation Error ( PSE ), Number-of-Segments Ratio ( NSR ), and Euclidean Distance 2 ( ED 2). These indices measure overlap, over-segmentation, and under-segmentation between reference polygons and corresponding image segments. Results show that the three proposed indices PSE , NSR , and ED 2 are more effective than the four existing indices QR , OR , UR , and ED 1 in their ability to identify optimal combinations of parameter values. ED 2 that represents both geometric ( PSE ) and arithmetic ( NSR ) discrepancies is most effective.

DOI

[38]
Yang J, He Y, Caspersen J, et al.A discrepancy measure for segmentation evaluation from the perspective of object recognition[J]. Isprs Journal of Photogrammetry & Remote Sensing, 2015,101:186-192.Within the framework of geographic object-based image analysis (GEOBIA), segmentation evaluation is one of the most important components and thus plays a critical role in controlling the quality of GEOBIA workflow. Among a variety of segmentation evaluation methods and criteria, discrepancy measurement is believed to be the most useful and is therefore one of the most commonly employed techniques in many applications. Existing measures have largely ignored the importance of object recognition in segmentation evaluation. In this study, a new discrepancy measure of segmentation evaluation index (SEI) redefines the corresponding segment using a two-sided 50% overlap instead of one-sided 50% overlap that has been commonly used. The effectiveness of SEI is further investigated using the schematic segmentation cases and remote sensing images. Results demonstrate that the proposed SEI outperforms the other two existing discrepancy measures, Euclidean Distance 2 (ED2) and Euclidean Distance 3 (ED3), both in terms of object recognition accuracy and identification of detailed segmentation differences.

DOI

[39]
E. Schöpfer A, S. Lang A.Object fate analysis - A virtual overlay method for the categorization of object transition and object-based accuracy assessment[J]. International Archives of Photogrammetry Remote Sensing & Spatial Information Sciences, 2012.Abstract Land use often changes considerably due to shifts in political and societal systems. The border zone between Austria and Hungary (the former Iron Curtain zone) is an outstanding example for these socio-politically driven changes of land use patterns. This paper discusses a methodology to assess temporal changes, which occurred in three 2500 ha test sites near Lake Fert between 1985 (t0) and 2000 (t1), regarding altered arrangements of agricultural fields. By using Landsat TM and ETM+ imagery, changes were evaluated and quantified field-specific, i.e. comparing the spatial characteristics of t0 and t1 fields and categorizing their 'fate' through time. Object fate analysis has been used to examine spatial changes of the t0 fields by investigating the topological relationships between t0 and t1 fields. Two indices, object loyalty (OL) and interference (I), were introduced as field-specific measures to characterise field stability.

[40]
Blaschke T, Lang S, Hay G J.Object-Based Image Analysis: Spatial Concepts for Knowledge-Driven Remote Sensing Applications[M]. Springer Publishing Company, Incorporated, 2008.

[41]
Bowyer K W.Validation of medical image analysis techniques[J]. Handbook of Medical Imaging, 2000.

[42]
Marpu P.R., Neubert M., Herold H., et al. Enhanced evaluation of image segmentation results[J]. Journal of Spatial Science, 2010,55(1):55-68.Geographic object-based image analysis (GEOBIA) is a promising methodology for image analysis, in which images are first segmented into image segments (or objects) and then analysed based on shape, texture, context and spectral features. The extra dimension of data offered by the objects yields a more enhanced image analysis. The first and most important step is thus the segmentation of images. The effectiveness of the object-based image analysis depends entirely on the quality of the segmentation result. There exist several types of image segmentation algorithms developed for a variety of applications ranging from medical imaging to remote sensing image analysis. It is, therefore, necessary to have an evaluation measure to decide which algorithm can be better for a particular task. Like segmentation itself, there is no standard way of evaluating segmentation results. In this article, we provide an easy way to analyse segmentation results by defining what qualifies as under-segmentation and over-segmentation while analysing the segmentation results of user-selected reference regions. The evaluation criteria are designed to handle the results of multi-level segmentation algorithms, which are commonly used in GEOBIA.

DOI

[43]
Fram J R, Deutsch E S. On the quantitative evaluation of edge detection schemes and their comparison with human performance[J]. IEEE Transactions on Computers, 1975,C-24(6):616-628.Summary: A technique for the quantitative evaluation of edge detection schemes is presented. It is used to assess the performance of three such schemes using a specially-generated set of images containing noise. The ability of human subjects to distinguish the edges in the presence of noise is also measured and compared with that of the edge detection schemes. The edge detection schemes are used on a high-resolution satellite photograph with varying degrees of noise added in order to relate the quantitative comparison to real-life imagery.

DOI

[44]
陈秋晓,陈述彭,周成虎.基于局域同质性梯度的遥感图像分割方法及其评价[J].遥感学报,2006,10(3):357-365.提出了一种针对多波段遥感图像的快速分割方法.该方法首先对遥感图像进行量化,然后提取局域同质性梯度图像,进而利用快速分水岭变换进行初始分割,最后利用改进的区域合并方法获得最终的分割结果.利用Quickbird图像进行了相关的实验,并在像素数量误差准则的指导下进行了分割评价,结果表明所提出的方法是一种有效的遥感图像分割方法,在分割速度和精度等方面优于多分辨率分割方法.

DOI

[ Chen Q X, Chen S P, Zhou C H.Segmentation approach for remote sensing images based on local homogeneity gradient and its evaluation[J]. Journal of Remote Sensing, 2006,10(3):357-365. ]

[45]
Zhan Q M, Molenaar M, Tempfli K, et al.Quality assessment for geo-spatial objects derived from remotely sensed data[J]. International Journal of Remote Sensing, 2007,26(14):2953-2974.Airborne laser scanners and multi‐spectral scanners provide information on height and spectra that offer exciting possibilities for extracting features in complicated urban areas. We apply an object‐based approach to building extraction from image data in an approach that differs from conventional per‐pixel approaches. Since image objects are extracted based on the thematic and geometric components of objects, quality assessments will have to be made object‐based with respect to these components. The known per‐pixel‐based methods for assessing quality have been examined in the new situation as well as their limitations. A new framework for carrying out quality assessments by measuring the similarity between the results of feature extraction and reference data is proposed in this paper. The proposed framework consists of both per‐object and per‐pixel measures of quality, thus providing measures pertaining to qualitative and quantitative measurements of object quality from thematic and geometric aspects. The proposed framework and measures of quality have been applied to an assessment of the results of object‐based building extraction using high‐resolution laser data and multi‐spectral data in two test cases. The results show that the per‐object‐based method of assessing quality gives additional information to conventional per‐pixel, attribute‐only assessment methods.

DOI

[46]
Möller M, Lymburner L, Volk M.The comparison index: A tool for assessing the accuracy of image segmentation[J]. International Journal of Applied Earth Observation & Geoinformation, 2007,9(3):311-321.Segmentation algorithms applied to remote sensing data provide valuable information about the size, distribution and context of landscape objects at a range of scales. However, there is a need for well-defined and robust validation tools to assessing the reliability of segmentation results. Such tools are required to assess whether image segments are based on 'real' objects, such as field boundaries, or on artefacts of the image segmentation algorithm. These tools can be used to improve the reliability of any land-use/land-cover classifications or landscape analyses that is based on the image segments. The validation algorithm developed in this paper aims to: (a) localize and quantify segmentation inaccuracies; and (b) allow the assessment of segmentation results on the whole. The first aim is achieved using object metrics that enable the quantification of topological and geometric object differences. The second aim is achieved by combining these object metrics into a 'Comparison Index', which allows a relative comparison of different segmentation results. The approach demonstrates how the Comparison Index CI can be used to guide trial-and-error techniques, enabling the identification of a segmentation scale H that is close to optimal. Once this scale has been identified a more detailed examination of the CI- H- diagrams can be used to identify precisely what H value and associated parameter settings will yield the most accurate image segmentation results. The procedure is applied to segmented Landsat scenes in an agricultural area in Saxony-Anhalt, Germany. The segmentations were generated using the 'Fractal Net Evolution Approach', which is implemented in the eCognition software.

DOI

[47]
Weidner U.Contribution to the assessment of segmentation quality for remote sensing applications[C]. Proceedings of the 21st Congress for the International Society for Photogrametry and Remote sensing, 2008.

[48]
Clinton N, Holt A, Scarborough J, et al.Accuracy assessment measures for object-based image segmentation goodness[J]. Photogrammetric Engineering & Remote Sensing, 2010,76(3):289-299.To select an image segmentation from sets of segmentation results, measures for ranking the segmentations relative to a set of reference objects are needed. We review selected vector-based measures designed to compare the results of object-based image segmentation with sets of training objects extracted from the image of interest. We describe and compare area-based and location-based measures that measure the shape similarity between segments and training objects. By implementing the measures in two object-based image processing software packages, we illustrate their use in terms of automatically identifying parsimonious parameter combinations from arbitrarily large sets of segmentation results. The results show that the measures have divergent performance in terms of the identification of parameter combinations, Clustering of the results in measure space narrows the search. We illustrate combination schemes for the measures for generating rankings of segmentation results. The ranked segmentation results are illustrated and described.

DOI

[49]
Yang J, Li P, He Y.A multi-band approach to unsupervised scale parameter selection for multi-scale image segmentation[J]. Isprs Journal of Photogrammetry & Remote Sensing, 2014,94(8):13-24.Image segmentation is one of key steps in object based image analysis of very high resolution images. Selecting the appropriate scale parameter becomes a particularly important task in image segmentation. In this study, an unsupervised multi-band approach is proposed for scale parameter selection in the multi-scale image segmentation process, which uses spectral angle to measure the spectral homogeneity of segments. With the increasing scale parameter, spectral homogeneity of segments decreases until they match the objects in the real world. The index of spectral homogeneity is thus used to determine multiple appropriate scale parameters. The performance of the proposed method is compared to a single-band based method through qualitative visual interpretation and quantitative discrepancy measures. Both methods are applied for segmenting two images: a QuickBird scene of an urban area within Beijing, China and a Woldview-2 scene of a suburban area in Kashiwa, Japan. The proposed multi-band based segmentation scale parameter selection method outperforms the single-band based method with the better recognition for diverse land cover objects in different urban landscapes.

DOI

[50]
Cheng J, Bo Y, Zhu Y, et al.A novel method for assessing the segmentation quality of high-spatial resolution remote-sensing images[J]. International Journal of Remote Sensing, 2014,35(10):3816-3839.Image segmentation quality significantly affects subsequent image classification accuracy. It is necessary to develop effective methods for assessing image segmentation quality. In this paper, we present a novel method for assessing the segmentation quality of high-spatial resolution remote-sensing images by measuring both area and position discrepancies between the delineated image region (DIR) and the actual image region (AIR) of a scene object. In comparison with the most frequently used area coincidence-based methods, our method can assess the segmentation quality more objectively in that it takes into consideration all image objects intersecting with the AIR of a scene object. Moreover, the proposed method is more convenient to use than the existing boundary coincidence-based methods in that the calculation of the distance between the boundary of the image object and that of the corresponding AIR of the scene object is not required. Another benefit of this method over the two types of method above is that the assessment procedure of the segmentation quality can be conducted with less human intervention. The obtained optimal segmentation result can ensure maximal delineation of the extent of scene objects and can be beneficial to subsequent classification operations. The experimental results have shown the effectiveness of this new method for both segmentation quality assessment and optimal segmentation parameter selection.

DOI

[51]
于欢,张树清,孔博,等.面向对象遥感影像分类的最优分割尺度选择研究[J].中国图象图形学报,2010,15(2):352-360.影像分割是面向对象遥感影像分类的基础步骤,而分割尺度又是影像分割的核心问题。研究针对面向对象遥感影像分类中的最优分割尺度选择问题,以分割后影像区域对象矢量边界线与欲分类目标对象真实矢量边界的吻合程度为标准,通过两者多向距离量化吻合程度,提出了一种最优分割尺度定量选择的新方法——矢量距离指数法。通过两种实验,同步验证了该方法的正确性与适用性,实验1将基于矢量距离指数法选择的最优分割尺度结果与较为成熟的人为试错法的选择结果比较,结果表明针对7种地类的矢量距离指数均可以正确反映最优分割尺度;实验2挖掘了矢量距离指数法选择的结果与分类精度的关系,结果表明其中5种地类在矢量距离指数法选择的最优分割尺度上均达到了最大的分类精度,另外2种地类的分类结果最符合实地情况,与欲分类目标最为接近。基于矢量距离指数法基本原理,研究针对分割过程中的"淹没"与"破碎"现象,进一步提出了能够反映两者矛盾程度的尺度指数,该指数能够真实反映针对某种特定地物类型分割尺度的大小状况,为衡量"破碎"与"淹没"的矛盾程度提供了一种定量工具,在分割尺度选择过程中具有重要的指示意义。

DOI

[ Yu H, Zhang S Q, Kong B, et al.Optimal segmentation scale selection for object-oriented remote sensing image classification[J]. Journal of Image and Graphics, 2010,15(2):352-360. ]

[52]
刘大伟,黄磊,李斌兵.一种基于对象相似性的遥感影像最优分割尺度评价方法[J].大地测量与地球动力学,2013,33(6):137-140.针对遥感影像分割中最优尺度的选择问题,提出了根据参考对象和分割对象的灰度相似性和形状相似性构建分割尺度评价函数的方法。首先通过实地调绘和目视解译的方法选取典型地物作为参考对象,然后计算参考对象与分割对象的灰度相似性和形状相似性,并最终得到评价函数值,从而确定最优分割尺度。基于该评价函数,对WorldView-2多光谱影像进行了实验,通过实验验证了该方法的有效性和适用性。

[ Liu D W, Huang L, Li B B.An assessment method for optimal segmentation scale of remote sensing image based on similarity between objects[J]. Journal of Geodesy and Geodynamics, 2013,33(6):137-140. ]

[53]
Strasters K C, Gerbrands J J.Three-dimensional image segmentation using a split, merge and group approach[J]. Pattern Recognition Letters, 1991,12(5):307-325.Abstract A 3-D segmentation algorithm is presented, based on a split, merge and group approach. It uses a mixed (oct/quad)tree implementation. A number of homogeneity criteria is discussed and evaluated. An example shows the segmentation of mythramycin stained cell nuclei.

DOI

[54]
May R.Suevey image segmentation techniques[J]. Computer Vision Graphics & Image Processing, 1985,29(1):100-132.

[55]
Weszka J S, Rosenfeld A.Threshold evaluation techniques[J]. IEEE Transactions on Systems Man & Cybernetics, 1977,8(8):622-629.

[56]
Zé, Boudj R.Filtrage, seuillage automatique, contraste et contours : du pré-traitement à l'analyse d'image[J]. Bibliogr, 1988.Etude de quelques aspects du traitement et de l'analyse d'image : présentation d'un lissage adaptatif mettant en évidence les régions qui composent une image; introduction de la notion de contraste utile en seuillage d'image; segmentation d'image; techniques d'extraction d'information par seuillage d'image et détection de contours; classification de formes utilisant la courbure

[57]
Chen H C, Wang S J.The use of visible color difference in the quantitative evaluation of color image segmentation[C]. IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004. Proceedings. IEEE Xplore, 2004.

[58]
Sahoo P K, Soltani S, Wong A K C, et al. A survey of thresholding techniques[J]. Computer Vision Graphics & Image Processing, 1988,41(2):233-260.In digital image processing, thresholding is a well-known technique for image segmentation. Because of its wide applicability to other areas of the digital image processing, quite a number of thresholding methods have been proposed over the years. In this paper, we present a survey of thresholding techniques and update the earlier survey work by Weszka ( Comput. Vision Graphics & Image Process 7 , 1978 , 259–265) and Fu and Mu ( Pattern Recognit. 13 , 1981 , 3–16). We attempt to evaluate the performance of some automatic global thresholding methods using the criterion functions such as uniformity and shape measures. The evaluation is based on some real world images.

DOI

[59]
Ohtsu N.A Threshold Selection Method from Gray-Level Histograms[J]. IEEE Transactions on Systems Man & Cybernetics, 1979,9(1):62-66.

[60]
Levine M D, Nazif A M.Dynamic measurement of computer generated image segmentations[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 1985,7(2):155-64.This paper introduces a general purpose performance measurement scheme for image segmentation algorithms. Performance parameters that function in real-time distinguish this method from previous approaches that depended on an a priori knowledge of the correct segmentation. A low level, context independent definition of segmentation is used to obtain a set of optimization criteria for evaluating performance. Uniformity within each region and contrast between adjacent regions serve as parameters for region analysis. Contrast across lines and connectivity between them represent measures for line analysis. Texture is depicted by the introduction of focus of attention areas as groups of regions and lines. The performance parameters are then measured separately for each area. The usefulness of this approach lies in the ability to adjust the strategy of a system according to the varying characteristics of different areas. This feedback path provides the means for more efficient and error-free processing. Results from areas with dissimilar properties show a diversity in the measurements that is utilized for dynamic strategy setting.

DOI PMID

[61]
Pal N R, Bhandari D.Image thresholding: some new techniques[J]. Signal Processing, 1993,33(2):139-158.Certaines des techniques de sélection de seuil existantes ont été passées en revue de manière critique. Deux algorithmes basés sur une mesure nouvelle d'entropie conditionnelle de l'image partitionnée ont été formulés. L'algorithme de seuillage à erreur approximativement minimale de Kittler et Illingworth a été implanté, cet algorithme considérant une distribution de Poisson pour les niveaux de gris au lieu de la distribution normale communément utilisée. Une justification de l'emploi de la distribution de Poisson a également été donnée. Cette méthode se trouve être bien meilleure aussi bien du point de vue de la convergence que de celui du résultat de la segmentation. Les méthodes proposées ont été appliquées à un certain nombre d'images et il a été constaté qu'elles produisaient de bons résultats. L'évaluation objective des seuils a été conduite en utilisant la divergence, l'uniformité de région, la corrélation entre images originale et segmentée, et l'entropie de deuxìeme ordre.

DOI

[62]
Woodcock C E, Strahler A H.The factor of scale in remote sensing[J]. Remote Sensing of Environment, 1987,21(3):311-332.Thanks to such second- and third-generation sensor systems as Thematic Mapper, SPOT, and AVHRR, a user of digital satellite imagery for remote sensing of the earth's surface now has a choice of image scales ranging from 10 m to 1 km. The choice of an appropriate scale, or spatial resolution, for a particular application depends on several factors. These include the information desired about the ground scene, the analysis methods to be used to extract the information, and the spatial structure of the scene itself. A graph showing how the local variance of a digital image for a scene changes as the resolution-cell size changes can help in selecting an appropriate image scale. Such graphs are obtained by imaging the scene at fine resolution and then collapsing the image to successively coarser resolutions while calculating a measure of local variance. The local variance/resolution graphs for the forested, agricultural, and urban/suburban environments examined in this paper reveal the spatial structure of each type of scene, which is a function of the sizes and spatial relationships of the objects the scene contains. At the spatial resolutions of SPOT and Thematic Mapper imagery, local image variance is relatively high for forested and urban/suburban environments, suggesting that information-extracting techniques utilizing texture, context, and mixture modeling are appropriate for these sensor systems. In agricultural environments, local variance is low, and the more traditional classifiers are appropriate.

DOI

[63]
王志华, 孟樊, 杨晓梅,等. 高空间分辨率遥感影像分割尺度参数自动选择研究[J].地球信息科学学报,2016,18(5):639-648.lt;p>面向对象解译技术在高分辨率遥感影像信息提取中得到广泛应用,但影像分割的基础问题仍严重制约其自动化水平,尤其是分割参数选择。因此,本文以广泛使用的分型网络演化分割算法为例,开展尺度参数选择研究。借鉴对遥感影像分辨率敏感的局部方差指标,引入边长和面积权重,构造加权局部方差(WLV)指标,对多个分割结果进行评价,进而实现最佳尺度参数选择。在珠江区域2.5 m的SPOT 5融合影像上进行实验,通过计算最佳分割结果与人工分割结果的相似度对WLV进行定量验证。此外,还对WLV在分割对象最小为一个像元、最大为整景影像的全范围尺度参数的变化规律进行了实验,结果表明:在WLV随尺度参数的变化曲线中,不同极大值点的分割结果反映了实验区不同景观层级上的斑块,其中第1个极大值点对应的分割结果能够较好地反映影像的最小可识别单元。</p>

DOI

[ Wang Z H, Meng F, Yang X M, et al.Study on the automatic selection of segmentation scale parameters for high spatial resolution remote sensing images[J]. Journal of Geo-information Science, 2016,18(5):639-648. ]

[64]
张俊,朱国龙,李妍.面向对象高分辨率影像信息提取中的尺度效应及最优尺度研究[J].测绘科学,2011,36(2):107-109.本文从面向对象的遥感信息提取中的尺度效应研究入手,对影像对象的分形维数、紧凑度、面积、均值、标准差和与邻域均值差分等特征进行了实验。在此基础上,根据"类内同质性大,类间异质性大"的最佳分类原则,提出了面向对象的RMAS方法,该方法的思想是,当对象RMAS值最大时,对象内部的异质性最小,对象外部的异质性最大,此时的分割尺度为类别提取的最优分割尺度。根据最优尺度下信息提取精度最高的原理,实验验证了该方法的可行性,且能获得较好的分类结果。

[ Zhang J, Zhu G L, Li Y.Scale effect and optimal scale in object-oriented information extraction of high spatial resolution remote sensing image[J]. Science of Surveying and Mapping, 2011,36(2):107-109. ]

[65]
Fotheringham A S, Brunsdon C, Charlton M.Quantitative geography: perspectives on spatial data analysis[J]. Sage Publications, ppXI - XII Isaaks E. and R. Mohan Srivastava, 2000,50(1):143-163.Abstract: . This paper provides an overview of quantitative geography. In addition to discussing what quantitative geography is, the paper details the methods that have come to define it. Six broad categories are used to discuss the range of methods found in quantitative geography: geographic information systems; airborne sensing (global positioning system, photogrammetry, and remote sensing); statistics and exploratory spatial data analysis; mathematics and optimization; regional analysis; and computer science and simulation. Particular emphasis is given to the state of the art in each area and the contributions of geographers, with associated discussion on major unresolved issues and future research directions.

DOI

[66]
张建廷,张立民,徐涛.遥感图像的异质性测度分割效果评价[J].测绘科学技术学报,2015,32(5):479-482.针对高分辨率遥感图像分割结果的评价问题,提出一种基于异质性测度的非监督分割评价方法.首先,通过全局方差和加权Moran指数分别表示对象内异质性和对象间异质性,并利用二者归一化后的和式来对整体分割结果进行评价.其次,为了进行局部分割结果评价,提出一种基于对象方差和局部Geary指数的异质性测度.最后,利用多分辨率分割方法对遥感图像进行分割,并利用提出的方法进行分割评价.实验结果表明,提出的方法能够对不同分割尺度结果进行有效评价,同时可以对过分割区域和欠分割区域进行判断.

DOI

[ Zhang J T, Zhang L M, Xu T.Heterogeneity measure based segmentation performance evaluation for remote sensing image[J]. Journal of Geomatics Science and Technology, 2015,32(5):479-482. ]

[67]
Espindola G M, Camara G, Reis I A, et al.Parameter selection for region-growing image segmentation algorithms using spatial autocorrelation[J]. International Journal of Remote Sensing, 2006,27(14):3035-3040.Region‐growing segmentation algorithms are useful for remote sensing image segmentation. These algorithms need the user to supply control parameters, which control the quality of the resulting segmentation. An objective function is proposed for selecting suitable parameters for region‐growing algorithms to ensure best quality results. It considers that a segmentation has two desirable properties: each of the resulting segments should be internally homogeneous and should be distinguishable from its neighbourhood. The measure combines a spatial autocorrelation indicator that detects separability between regions and a variance indicator that expresses the overall homogeneity of the regions.

DOI

[68]
何敏,张文君,王卫红.面向对象的最优分割尺度计算模型[J].大地测量与地球动力学,2009,29(1):106-109.面向对象的影像分析方法适合高分辨率遥感影像信息提取,其核心问题在于实现对高分辨率遥感影像的多尺度分割。提出了一种针对面向对象分析方法中多尺度分割的最优分割尺度计算模型,并进行了影像分割实验。结果表明,此模型能快速地获取可靠的最优分割尺度。

[ He M, Zhang W J, Wang W H.Optimal segmentation scale model based on object-oriented analysis method[J]. Journal of Geodesy and Geodynamics, 2009,29(1):106-109. ]

[69]
张仙,明冬萍.面向地学应用的遥感影像分割评价[J].测绘学报,2015(S1):108-116.

[ Zhang X, Ming D P.Geo-application oriented evaluations of remote sensing image segmentation[J]. Acta Geodaetica et Cartographica Sinica[J]. 2015(S1):108-116. ]

Outlines

/