Segmentation

The first step in object-based image classification is segmentation: calculating discrete regions of image objects. This is achieved by stratification of an image. These image-objects are used further as the basic unit of analysis for developing image-analysis strategies, including classification and change detection.

To date, various image-segmentation techniques have been developed with mixed results. The earlier developments in segmentation techniques, however, were in the machine-vision domain and aimed at the analysis of patterns, the delineation of discontinuities on materials or artificial surfaces, and the quality control of products. Later, these concepts were used in remote sensing (RS) for attribute identification.

Functionally, the segmentation process partitions an image into statistically homogenous regions or objects (segments) that are more uniform within themselves and differ from their adjacent neighbors. Another conceptually important aspect of image segmentation is its relevance to spatial-scale theory in RS, which describes how local variance of image data in relation to the spatial resolution can be used to select the appropriate scale for mapping individual land-use classes. Image segmentation defines the size and the shape of image objects and influences the quality of the follow-on analysis: classification.

However, a single segmentation algorithm does not yet exist that can reproduce the same objects as a human can identify intuitively. To attempt to overcome this issue, image objects are created that are as homogeneous as possible and grouped together by using a classification process to create objects that can come close to human perception. This can be an iterative process in which various segmentation algorithms are used, or a specific algorithm is used with various parameters, to achieve a suitable result.

A data set of input imagery contains multiple channels. The underlying algorithm is a hierarchical step-wise region grown by using random "seeds" spread over the entire image. This method can be classified as a "bottom-up" optimization routine that starts with a pixel and ends with segments that are groups of like pixels.

The criteria that defines the growth of a region can be based on the difference between the pixel intensity and the mean of the region. The algorithm assesses the local homogeneity based on the spectral and spatial characteristics. The size of the object is controlled with the value for scale, which you specify. The larger the scale, the larger will be the output object. Other homogeneity criteria are based on shape and compactness.

The result of the segmentation process is as follows:
  • An initial abstraction of the original data
  • Creation of a vector (polygon) representation of the image objects

The polygons, at this level of abstraction, can be considered image objects in the image domain and do not necessarily represent objects in the real world.

Required available memory
Segmentation is intense computationally and, as such, relies on the amount of memory (RAM) on your computer. The amount of memory used depends on the size (in pixels and channels) of the input raster. To calculate the amount of RAM (in gigabytes) that will be required by Object Analyst to process a file, you can use the following formula:
  • nGB = nX × nY × (4 × nC + 29) ÷ 795364314.1
  • where:
  • nGB is the amount of required memory (RAM in gigabytes) to perform the segmentation
  • nX is the number of pixels in the x-dimension (pixels)
  • nY is the number of pixels in the y-dimension (lines)
  • nC is the number of channels
References
For more information about region-growing segmentation, scale, shape, and compactness parameters, see the following published works:
  • Baatz, Martin, and Arno Schäpe. "Multiresolution segmentation: an optimization approach for high quality multi-scale image segmentation." Angewandte Geographische Informationsverarbeitung XII 58 (2000): 12-23.
  • Dey, Vivek, Yun Zhang, and Ming Zhong. A review on image segmentation techniques with remote sensing perspective. na, 2010.

© PCI Geomatics Enterprises, Inc.®, 2026. All rights reserved.