The first step in object-based image classification is segmentation: calculating discrete regions of image objects. This is achieved by stratification of an image. These image-objects are used further as the basic unit of analysis for developing image-analysis strategies, including classification and change detection.
To date, various image-segmentation techniques have been developed with mixed results. The earlier developments in segmentation techniques, however, were in the machine-vision domain and aimed at the analysis of patterns, the delineation of discontinuities on materials or artificial surfaces, and the quality control of products. Later, these concepts were used in remote sensing (RS) for attribute identification.
Functionally, the segmentation process partitions an image into statistically homogenous regions or objects (segments) that are more uniform within themselves and differ from their adjacent neighbors. Another conceptually important aspect of image segmentation is its relevance to spatial-scale theory in RS, which describes how local variance of image data in relation to the spatial resolution can be used to select the appropriate scale for mapping individual land-use classes. Image segmentation defines the size and the shape of image objects and influences the quality of the follow-on analysis: classification.
However, a single segmentation algorithm does not yet exist that can reproduce the same objects as a human can identify intuitively. To attempt to overcome this issue, image objects are created that are as homogeneous as possible and grouped together by using a classification process to create objects that can come close to human perception. This can be an iterative process in which various segmentation algorithms are used, or a specific algorithm is used with various parameters, to achieve a suitable result.
A data set of input imagery contains multiple channels. The underlying algorithm is a hierarchical step-wise region grown by using random "seeds" spread over the entire image. This method can be classified as a "bottom-up" optimization routine that starts with a pixel and ends with segments that are groups of like pixels.
The criteria that defines the growth of a region can be based on the difference between the pixel intensity and the mean of the region. The algorithm assesses the local homogeneity based on the spectral and spatial characteristics. The size of the object is controlled with the value for scale, which you specify. The larger the scale, the larger will be the output object. Other homogeneity criteria are based on shape and compactness.
The polygons, at this level of abstraction, can be considered image objects in the image domain and do not necessarily represent objects in the real world.
© PCI Geomatics Enterprises, Inc.®, 2026. All rights reserved.