A Survey of Thresholding Techniques
A Survey of Thresholding Techniques
A Survey of Thresholding Techniques
net/publication/220632416
CITATIONS READS
2,223 5,614
3 authors:
Andrew Wong
University of Waterloo
229 PUBLICATIONS 7,718 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Segmentation of Color Images using Information Theory and PSO View project
All content following this page was uploaded by Sasan Soltani on 06 November 2018.
SURVEY
AND
Y. c. CHBN
Department of Electrical Engineering, University of Waterloo, Waterloo, Canada N2L 3GI
Received July 7,1984; accepted July 9,1987
1. INTRODUCTION
A popular tool used in image segmentationis thresholding. This paper represents
a survey of a variety of thresholding (also known as binarization) techniques
including both global and local thresholding. For the sake of discussion, global
techniques are further classified as: point-dependent and region-dependent tech-
niques. Several global thresholding methods are examined in detail to evaluate their
performance based on a given set of test images. In this paper, we endeavor to
update the earlier survey papers by Weszka [44] and Fu and M u [13]. W e also
attempt to evaluate the performance of several global thresholding methods using a
shape measure and a region uniformity measure as criterion functions. The evalua-
tion is conducted on a set of images digitized from photographs of a portrait and
natural scenes.Although our primary goal is to present a survey of bilevel threshold-
ing methods, extensions of these methods to m u ltithresholding problems are also
treated in this paper.
Let N be the set of natural numbers, (x, y) be the spatial coordinate of a
digitized image, and G = (0, 1, . . . , I - 1) be a set of positive integers representing
gray levels. Then, an image function can be defined as the m a p p ing f : N x N + G .
The brightness (i.e., gray level) of a pixel with coordinate (x, y) is denoted as
f(K Y)-
Let t E G be a threshold and B = {b,, b,} be a pair of binary gray levels and
b,,b, E G . The result of thresholding an image function f(. , a) at gray level t is a
*This work was supported by the Natural Sciencesand Engineering Reearch Council, Canada, under
Grants A4716 and G1367.
233
0734-189X/88 $3.00
Copyright 0 1988 by Academic Press, Inc.
All rights of reproduction in any form reserved.
234 SAHOO ET AL.
p,= “I.
n
Also, by convention, the gray level 0 is the darkest and the gray level 1 - 1 is the
lightest.
2. GLOBAL THRESHOLDING: POINT DEPENDENT TECHNIQUES
A. p-tile Method
One of the earliest thresholding methods is the p-tile method [lo]. In this method,
an image is assumed to consist of dark objects in a light background. By assuming
that the percentage of the object area is known, the threshold is defined as the
highest gray level which maps at least (100 - p)% of the pixels into the objects in
the thresholded image. For example, suppose an object occupies 20% of an image,
then the image should be thresholded at the highest gray level that allows at least
20% of the pixels to be mapped into the object. Obviously, this method is not
applicable to images whose object area is not known.
B. Mode Method
For images with distinct objects and background, the histogram of the gray levels
will be bimodal. In this case, a threshold can be chosen as the gray level that
corresponds to the valley of the histogram. The technique is called the mode method
[31]. Though this method is simple, it cannot be applied to images with extremely
unequal peaks or to those with broad and flat valleys.
C. Ostu Method
This method, as proposed in [27], is based on discriminant analysis. In this
method, the threshold operation is regarded as the partitioning of the pixels of an
SURVEY OF THRESHOLDING TECHNIQUES 235
image into two classes C,, and C, (e.g., objects and background) at gray level t. That
is, C, = {O,l,. . . , t}andC,={t+l,t+2,...,1-l}.Leta~,u&andc$bethe
within-class variance, between-class variance, and the total variance, respectively.
An optimal threshold can be determined by m inimiz,ing one of the following
(equivalent) criterion functions with respect to t:
4
q=<, and K=y.
UT UW
Of the above three criterion functions, n is the simplest. Thus, the optimal
threshold f* is
t* = ArgMinn,
tEG
where
I-1 I-1
U$= C(i-jJT)*Pi, PT= zipi?
i=O i=O
t
wO= CPi, 01 = 1 - wg,
i=O
PT- CL,
po=II, p‘= i ip,.
I-%= l-oo’
*0 i=O
is introduced. Ei measures the balance of the histogram about the gray level g,. For
spurious concavities, which usually lie only on one side of the histogram, the values
of E; will be small. Thus, spurious concavities can be eliminated by ignoring
maxima of h - h when Ei is small. The remaining maxima indicate possible
locations for thresholding, but they may not be optimal. Other gray levels near these
maxima should also be considered for possible improvement.
E. Entropic Methods
In these recently developed methods, the gray level histogram is considered as a
I-symbol source. The optimal threshold is obtained by applying information theory.
Pun [32] has shown the maxir&in g H’ is equivalent to maximizing the evaluation
function
+ l--
4
[ 1 log,U
4
- PA
log,m~~p,+l~.--~ p,-d
Ht = - i p;log,p;,
i=o
I-l
HT = - C PihePi>
i=O
‘t’ CPi’
i=O
SURVEY OF THRESHOLDING TECHNIQUES 237
In his second algorithm, Pun [33] proposes the use of an anisotropy coefficient, (Y,
in thresholding, where
i=o
a= I-1
1;oPi10gt3 Pi ’
5 p; 2 0.5.
i=O
However, Kapur et al. [16] have shown that this algorithm will always threshold an
image with t* 2 m, thus introducing unnecessary bias.
--PO Pl Pt
P, ’ P, ’. * . ’ p,
and
H,(t) = - i Flog, ; )
i=o t i ti
l-l
H,(t) = - c Alog, & .
i=t+l ’ - ‘t i ti
Then the optimal threshold t* is defined as the gray level which maximizes
H,,(t) + H,,,(t), that is,
238 SAHOO ET AL.
t* = Argzg{S(t) + g(t)},
where
and
i II
l-l
? i=:lpi .
i=l
F. Moment-Preserving Method
In this method [40], the threshold values are computed deterministically in such a
way that the moments of an image to be thresholded are preserved in the output
(binary) image. This i” moment mi is calculated as
m, = i ‘$ g’h(g), i = 1,2,3,
g-0
where n is the total number of pixels in the image. The threshold value t* is
obtained from the gray level histogram of the image by choosing t* as the PO-tile,
where p. is given by
z - ml
PO = (c; - &,y2
and
P(g) = i P,P(gli)9
r=l
where
However, the parameters pi, u/, and Pi (i = 1,2) of the mixture density p(g)
associated with an image to be thresholded are not usually known. To overcome the
difficulty of estimating these unknown parameters, Kittler and Illingworth [19]
introduced a criterion function J(t), which is given by
-2{wbg,pl(t) + p20)1%920)~~
where
t I-1
l%(t)=
and
l-l
c (g - dt))2h(d
&) = i g=t+l
P,(t)
240 SAHOO ET AL.
that is, the (i, j) element of M is the frequency that gray level i occurs as a
4-neighbor of gray level j.
Because of homogeneity, pixels interior to the objects or background should
contribute mainly to the near-diagonal entries of M . Also, pixels near an edge
should contribute mainly to the off-diagonal entries of M because of the gray level
change near an edge. Therefore, the matrix M can be used to define two new
histograms:
(a) A histogram based on the near-diagonal entries of M . This histogram
should have a deep valley between the object and the background gray levels.
(b) A histogram based on the off-diagonal entries of M . This histogram should
have a sharp peak between the object and the background gray levels.
A threshold for the image can then be chosen from the gray level range in which
the valley in (a) overlaps with the peak in (b).
(ii) (Gray Level, Local Average Gray Level) Scatter Plot Method
This method [18] is quite similar to the co-occurrence matrix method discussed in
the previous section. In a (gray level, local average gray level) scatter plot, the origin
is defined as the upper left-hand comer, the gray level increases from left to right,
and the local average gray level, taken over a window, increases from top to bottom.
The intensity of a point on this plot is proportional to the occurrence frequency of
the corresponding pair (gray level, local average gray level). In Kirby and Rosenfeld
[18], the local averages are taken over a 3 x 3 square window.
The near-diagonal entries in the scatter plot represent pixels whose local average
gray level is close to the gray level; such pixels are likely to be interior to the objects
or background because of homogeneity. Thus, the histogram of the gray levels of
these pixels should have a deep valley. On the other hand, the off-diagonal entries in
the scatter plot represent pixels near an edge, so the gray level histogram obtained
from these pixels should have a sharp peak. As in the co-occurrence matrix method,
a threshold can be determined from the gray level range in which the valley and the
peak of the above histogram overlap with each other.
Using the notations of Section 3B(i), the transition matrices defined in [9] can be
written as
Th = M&O),
T, = JqlJn,2)7
Tub= T,+ T,,.
Any one of the above transition matrices can be used in threshold selection and we
shall refer to the transition matrix as T in the following discussion. It was reported
in [9] that the results obtained with different transition matrices do not differ
significantly.
Let qj be the (i, j) entry of T and t be a threshold dividing the set of gray levels
into two classes: C, = (0, 1, . . . , t}andC,={t+l,t+2,...,1-l}.ThenTcan
be partitioned into four regions defined by the following parameters:
I-1 l-l
a= i &j, b= C C Tj>
i=O j=CJ i=t+l j=t+l
I-1 r
c= i ‘g 71j, d= c c qj.
i=O j=t+l i=f+l j=O
Thus a, b, c, and d represent the total number of transitions within Co, within C,,
from C, to C,, and from C, to C,, respectively. The joint and conditional
probabilities of the transition between Co and C, can be estimated by
c+d
P,(t) =
a+b+c+d’
and
Deravi and Pal [9] call P,(t) and P,(t) interaction measures. They also point out
that P,(t) is similar to the “business measure” in [47] and that P,(t) does not
depend directly on the histogram. The optimal threshold, t*, is obtained by
maximizing either one of the above interaction measures.
D. Relaxation Metho&
The idea of relaxation was introduced by Southwell [38,39] to improve the
convergence of recursive solution for system of linear equations. In image segmenta-
tion, relaxation is applied as follows. The pixels of an image are first probabihsti-
tally classified into “light” and “dark” classes based on their gray levels. Then the
probability of each pixel is adjusted according to the probabilities of the neighbor-
ing pixels. This adjustment process is iterated so that “light probabilities” (resp.
“dark probabilities”) become very high for pixels belonging to the light (resp. dark)
regions. An attractive feature of these relaxation methods is that they are p
processing techniques, as opposed to the sequential techniques discussed so far.
SURVEY OF THRESHOLDING TECHNIQUES 243
l - gi
Piqdark= -I-d’
and
gi - d
&i&t- -- 1-d’
Though the above method is simple, it does not work in cases where the object
and background gray levels do not lie on different halves of the gray level histogram.
To avoid this problem, Rosenfeld and Smith [36] suggest another initialization
scheme. Let m be the mean gray level. Then, if gj > m , let
1 1 gi-m
Piqlight=2+-- 2 l-m’
and if gi I m , let
1 lm-g,
p!&&=Z+---
2 m-d’
Fekete et al. [ll] suggest an approach in which they assume the histogram can be
divided into two Gaussian subpopulations so that the gray level distribution can be
written as the sum of two Gaussian distributions. The parameters of these Gaussian
distributions are determined by a method suggested in [7]. They find that faster
convergence of the relaxation process can be obtained with this method.
Zucker et al. [52] propose the following equation for updating the probabilities:
They point out that maximizing C(p) is equivalent to reducing the inconsistency
and ambiguity of the pixel labeling.
Here we suggest a similar method that uses a different criterion function derived
from information theory. In this method the optimal labeling scheme is determined
by
I-1
Pi(‘j)
I( Pi3 4i) = i Pi(‘j ___
j=l e 4iCxj> .
i I
F. Other Methods
Thresholding techniques for optical character recognition systems have also
received much attention. Because of the wide range of print quality distortions over
a single document, a combination of threshold operators is often used, with each
operator sensitive to a different type of distortion. Bartz [3], for example, combines
four linear threshold operators to form a single threshold. An example of these
operators is T = ku + c, where u is the average contrast over previously scanned
characters, and k and c are optimizing parameters. Wolfe [50] suggests a two-step
procedure to deal with the problem of shading in printed characters. During the first
step, the mean gray level of each pixel is computed by averaging the gray levels over
a 4 x 4 window. A pixel X is considered as belonging to a character if its mean
gray level is darker than the mean gray levels of the two pixels whose orientations
differ by 180’ and are distance 8 apart. The second step is similar to the first step,
but a larger window is used.
In Ullmann [41], the threshold of a pixel X is selected based on the gray levels in
a 5 x 5 window, W, of X. Only the pixels which are labeled n contribute to the
threshold decision (Fig. 1).
Two experimental rules, to be described later, are used to determine the threshold
of X. Let n, be the highest (brightest) gray level in W. Then rule (1) is applied if
n x < 40 and rule (2) is applied when n x > 40. The two rules are:
(1) Label X as an object point if for some point n in W, we have g(X) - g(n)
< 7, where g( -) means the gray level of (e) and r is some predefined threshold;
otherwise label it as a background point.
(2) Label X as an object point if for at least one n in W, we have g(X) <
g(n)/p, where 1-1is some predefined constant; otherwise label it as a background
point.
Morrin [25] uses gray level versus gradient value plots to convert gray scale
images with superior resolution and contrast for thresholding. Panda [28] suggests a
method in which the threshold applied at a point depends on both the gray level and
the edge value of that point. Some results and comparisons are given in [28].
0 n n n 0
n n 0 II n
n 0 x 0 n
n n Cl 0 n
0 n n n 0
4. LOCAL THRESHOLDING
In local thresholding, the original image is partitioned into smaller subimages and
a threshold is determined for each of the subimages. This yields a thresholded image
with gray level discontinuities at the boundaries of two different subimages. The
threshold of a region can be determined by either the point-dependent method or
the region-dependent method. A smoothing technique is then applied to eliminate
the discontinuities.
Chow and Kaneko [7, S] suggest the use of a 7 X 7 window for local thresholding.
In their method, the original image is divided into 7 x 7 subimages and a threshold
is computed for each subimage. However, a threshold is not computed for sub-
images with unimodal gray level histogram. Thresholds for such subimages are
interpolated from neighboring subimages. For a bimodal subimage, the threshold is
computed as follows. First the gray level histogram for a subimage is approximated
by a sum of two Gaussian distributions, then the threshold is obtained by minimiz-
ing the classification error with respect to the threshold value. Some experiments on
this method were done in [26].
For X-ray angiograms Fernando and Monro [12] suggest a local thresholding
method. Global thresholding techniques have been found to be unsatisfactory for
these images which are usually unimodal with a very narrow peak. According to this
new method, the original image is partitioned with 16 nonoverlapping subimages
and the entropic thresholding technique of Pun [32] is applied to determine the
threshold value for each of these subimages. Finally, the entire thresholded image is
processed by a low-pass filter to eliminate the gray level discontinuities at the
boundaries of subimages.
5. MULTITHRESHOLDING METHODS
Many global thresholding methods, such as Ostu [27], Pun [32,33], Kapur et al.
[16], moment preserving [40], minimum error [19] can be extended to the case of
multithresholding. In this section we discuss three multithresholding methods which
are not included in the earlier sections of this paper.
F(k) = “1”
‘h(g) ’
g=o
where F’ and F” are the first and second derivatives of F, respectively. It is pointed
out in [5] that C(x) is noisy and oscillatory and should be smoothed and approxi-
mated for use in thresholding. The zeros of the curvature determines the thresholds
as well as the gray level to be assigned to each class.
c(t)
CL(t)= -N(t)
with p(t) = 0 if N(t) = 0 and C(t) is the total contrast detected by threshold t, and
N(t) is the number of edges detected by t. For multithresholding any initial
threshold is first selected and then a new histogram of p(t) is computed by
removing the contribution of the already detected edges by the initial threshold.
This procedure is continued until the maximum average contrast for any threshold
fell below some m inimum average contrast criterion 8 > 1.
6. EXPERIMENTAL RESULTS
Not all the methods surveyed in this paper are automatic in nature (i.e., they
require no human interactions). Some require user’s feedback since unique optimal
threshold is not possible. In local thresholding methods, the image is divided into
smaller subimages and suitable global methods are adopted to compute the optimal
thresholds for each subimage. It seems that global methods are frequently used for
threshold&g. For this reason, we intend to evaluate some of the global thresholding
methods, namely the Ostu method [27], the Pun method [32], the Johannsen and
Bille Method [15], the method of Kapur et al. [16], the co-occurrence matrix
method [2], the histogram concavity analysis method [35], the method due to Deravi
and Pal [9], the moments preserving method [40], and the m inimum error method
FIG. 2. (a) Digitized cameraman image; (b) histogram of cameraman image.
[19]. These methods are considered for a comparative study since they are automatic
and in most cases do not require a subjective judgement for finding the best
threshold from a set of optimal threshold values. The above mentioned methods
are applied to three images, “cameraman”, the “building”, and the “model”. The
cameraman image is digitized from the printed version of the same image in [32].
This is done with a raster of 415 X 395 and the quantization is 256 gray levels. The
digitized image and its gray level histogram are shown in Fig. 2. The building image
is also digitized from the printed version of the same image in [32] with a raster of
411 x 403 and quantized to 256 gray levels. The image of the model is digitized
with a raster of 321 X 314 and quantized to 256 levels. The building and the image
of the model are shown in Figs. 3 and 4 along with their gray level histograms. The
threshold values obtained by methods considered above are presented in Table 1.
For the co-occurrence matrix method the distance between cells d was chosen to
be 1 and 5% of the elements of the M matrix that lie farthest from the diagonal was
projected onto the diagonal. Incidentally a threshold value of 63 was found for all
three images. This value did not change when 10% of the elements instead of 5% was
used. However, when d changed from 1 to 2 (resp. 3) the threshold values for the
FIG. 4. (a) Digitized image of the model; (b) histogram of the model image.
cameraman and the building images did not change, but the threshold value of the
model image changed to 30 (resp. 33). On changing the percentage of elements of M
projected onto the diagonal to 1% and 2% while keeping d values to 2 and 3,
respectively, we have obtained a threshold value 63 in both cases. Thus, whenever
higher values of d are used, it is advisable to decrease the percentage of elements of
A4 that lie farthest from the diagonal and projected onto the diagonal.
The Deravi and Pal method [9] was implemented for the cameraman picture only.
With their method, two local m inima were found, one at 45 and the other at 116.
Since thresholding at 45 does not provide a better binary image we select 116 as the
optimal threshold value. Figures 5-7 show the binary images obtained by the
thresholding methods considered in this section.
7. MEASURES FOR EVALUATING THRESHOLDING METHODS
In digital images, uniformity and shape of the objects play great roles in
separating objects from the background. The amount of agreement of these two
aspects of every binary image with the real image has been evaluated for the three
test images considered in the previous section. The uniformity measure used in this
TABLE 1
Optimal Threshold Value for the Set of Test Images
Co-occurrence matrix 63 63 63
Deravi & Pal 116
Histogram concavity 127 47 127
Johamwn & Bille 75 79 114
Kapur et al. 123 69 81
Minimum error 111 37 113
Moment-preserving 91 76 76
Ostu 86 73 91
b 1321 131 28 63
250 SAHOO ET AL.
FIG. 5. Binary images of the cameraman: (a) co-occurrence matrix method (t’ = 63); (b) Deravi and
Pal method (t* = 116); (c) histogram concavity method (t* = 127); (d) Johannsen and Bille method
(t* = 75); (e) Kapur, Sahoo, and Wong method (t* = 123); (f) minimum error method (t* = 111): (8)
moment-preserving method (z* = 91); (h) Ostu method (r* = 86); (i) Pun method [32] (t* = 131).
paper is adopted from Levine and Nazif [22]. For a given threshold value t, the
uniformity measure U(t) is given by
u1’+ (72”
U(t) = 1 - --y--y
where
Ri = Segmented region i
f (x, y) = The gray level of the pixel (x, y)
c f(w)
(x>Y)ER,
Pi = Ai ’
Ai = Number of pixels in R,, i = 1,2,
SURVEY OF THRESHOLDING TECHNIQUES 251
FIG. 5.-Continued
and
C = A normalization factor.
Levine and Nazif [22] have included a weighting factor in the calculation of
uniformity measure U for images with more than two regions.
A measure, called shape meusure S, is used for the measurement of the shape of
the object in the test images. The measure S for a given image is calculated as
follows: (a) assign a generalized gradient value A(x, y) to every pixel (x, y); (b) if
the pixel (x, y) has a gray value higher than the average of its neighbors then assign
the “ + ” sign to the generalized gradient value A(x, y), else assign the “ - ” value;
FIG. 6. Binary images of the building: (a) co-occurrence matrix method (t* = 63); (b) histogram
concavity method (t* = 47); (c) Johannsen and Bille method (t* = 79); (d) Kapur, Sahoo, and Wong
method (t* = 69); (e) minimum error method (f* = 37): (f) moment-preserving method (t* = 76): (g)
Ostu method (t* = 73); (h) Pun method [32] (t* = 28).
where frqx, y) is the average gray value in the neighborhood iV(x, y), t is the
threshold value of the image, C is a normalization factor, and
ifx20
W(x) = ( -: ifx < 0.
The computation of the generalized gradient value A(x, y) of the pixel (x, y) is
carried out using the formula
l/2
i
k=l
0; + fi~,(D, + D,>- fiD,(~, - 0,)
1 .
where
Using both measures, the threshold value obtained according to each me&i&
mentioned earlier are evaluated. Tables 2a, 2b and 2c show the results of this
SURVEY OF THRESHOLDING TECHNIQUES 253
FIG. 6.-Continued
evaluation. Note that these values are renormalized according to the best possible
threshold for each measure and the rank of each method is indicated within the
square bracket.
8. DISCUSSION OF EVALUATION RESULTS
Analyzing Table 2a, we find that methods of Ostu, moment-preserving, and
Johannsen and Bille are better threshold selection methods for the cameraman
image with regards to region uniformity and shape measure. For bimodal images
(see Table 3), the optimal values of uniformity and shape measure do not differ
254 SAHOO ET AL.
FIG. 7. Binary images of the model: (a) co-occurrence matrix method (t* = 63); (b) histogram
concavity method (t* = 127); (c) Joharmsen and Bille method (1: = 114); (d) Kapur, Sahoo, and Wong
method (t* = 81); (e) minimum error method (t* = 113); (f) moment-preserving method (I* = 76); (g)
Ostu method (t* = 91).
FIG. ?‘-Continued
TABLE 2a
Evaluation Results for Cameraman Image
TABLE 2b
Evaluation Results for Building Image
Method Threshold Uniformity (U) Shape (S)
Co-occurrence matrix 63 0.9098 [5] 0.8863 [5]
Histogram concavity 41 0.5889 [6] 0.5597 [6]
Johannsen & Bille 19 0.9833 [3] 0.9990 [2]
Kapur et al. 69 0.9792 [4] 0.9647 [4]
Minimum error 31 0.3820 [7] 0.2987 [7]
Moment-preserving 16 0.9980 [2] 0.9995 [l]
Ostu 73 0.9986 [l] 0.9931 [3]
Pun [32] 28 0.1988 [8] 0.0131 [X]
TABLE 2c
Evaluation Results for Image of Model
visual information to supplement our findings. For visual analysis of the binary
images, it seems reasonable to consider some important features such as facial
details and camera of the cameraman picture, edges of the building on the
right-hand side of the building image, and facial and hair features in the model
image. Along with these observations, one should also consider the amount of
distortion and loss of information in the thresholded image. A close look at the
binary images of the cameraman in Fig. 5, we observe that methods such as Pun
(Fig. 5i), histogram concavity (Fig. 5c) and co-occurrence matrix (Fig. 5a) do not
provide a good threshold for this image. This can also be observed from Table 2a.
We also observe that the co-occurrence matrix method yields a binary image in
which the details of the face and the camera of the cameraman are lost, whereas
Pun’s and histogram concavity methods yield binary images with distortions. Also,
Ostu method and moment-preserving method do not retain the details of the face in
the cameraman image. The methods which retain the facial details and other
valuable information of the image are the Kapur et al. method and the m inimum
error method. Notice that the Kapur et aE. method ranks seven (in Table 2a)
according to uniformity as well as shape measure, but it gives a good binary image,
which retains the facial details of the cameraman.
A similar examination of the images in Fig. 6 reveals that the moment-preserving
(Fig. 6f), co-occurrence matrix (Fig. 6a), Ostu (Fig. 6h), Kapur et al. (Fig. 6d),
Johannsen and Bille (Fig. 6c) methods provide reasonably good threshold values for
SURVEY OF THRESHOLDING TECHNIQUES 257
TABLE 3
Optimal Value for the Uniformity and Shape Measures
-
Measure Cameraman Building Model
-.
Uoiformity 87 74 93
Shape 86 78 57
FIG. 8. Binary images of the cameraman: (a) optimal uniformity (t* = 87); (b) optimal shape
(r* = 86).
the building picture. Among these methods, the Johannsen and Bille method and
the moment-preserving method yield good binary images. Regarding the binary
images in Fig. 7, we observe that the Kapur et al. method and the moment-preserv-
ing method are better threshold selection techniques. For the same image, the Ostu
method also provides a good threshold value. However, it is not as good as the two
methods mentioned.
Thus, from the visual aspect of the binary images as well as the objective
evaluation based on the uniformity and shape measures,we find that Johannsen and
Bille, Kapur et al., moment-preserving, and Ostu methods are good thresholding
methods. In most cases,Pun [32], histogram concavity, and m inimum error methods
do not render good performance in comparison to other methods mentioned in this
section. In an earlier study [12], unsatisfactory performance of the Pun method [32]
was observed by Fernando and Monro.
9. COMMENTS AND CONCLUSION
The indicators used in this paper for evaluating an automatic thresholding
method are shape meaSureand uniformity measure.Table 3 presents the gray levels
which optimize the uniformity and shape measure for the test images. The set of
digital test images, when binarized at the gray levels where S and U attain
maximum, are shown in Figs. 8, 9, and 10. From these binary images it is evident
that one may use these measures for image thresholding.
In summary, we have presented a survey of various thresholding methods and
evaluated the performance of some global methods which are automatic in nature.
All the methods described here optimize some criterion functions and provide
FIG. 10. Binary images of the model: (a) optimal uniformity (I* = 93); (b) optimal shape (t* = 57).
justifications for such optimization. We only investigate how the optimal values
behave corresponding to the uniformity and shape measures to a given set of test
images. From this study, we conclude that for the above set of test images
Johannsen and Bille, Kapur et al., Tsai’s moment-preserving, and Ostu methods are
reasonably good thresholding methods if one demands more uniformity and better
shape of the object in the binary image. However, whether the aforementioned
methods, in general, will perform well or not remains open.
ACKNOWLEDGMENTS
We are thankful to the referee for helping to improve the presentation of this
paper.
REFERENCES
1. N. Abramson, Information Theon, und Coding, McGraw-Hill, New York, 1963.
2. N. Ahuja and A. Rosenfeld, A note on the use of second-order gray-level statistics for threshold
selection, IEEE Trans. Systems Man Cybernet. SMC-8, 1978, 895-899.
3. M. R. Bartz, Optimizing a video processor for OCR, in Proceedings, Intemarional Joint Conference on
AT, 1969, pp. 79-90.
4. B. Bhanu and 0. Faugeras, Segmentation of images having unimodal distributions, IEEE Trans.
Pultern Anui. Mach. Intell. PAMI-4, 1982, 408-419.
SURVEY OF THRESHOLDING TECHNIQUES 259
35. A. Rosenfeld and P. De La Terre, Histogram concavity analysis as an aid in threshold selection,
IEEE Trans. Systems Man Cybernet. SMC-13,1983, 231-235.
36. A. Rosenfeld and R. C. Smith, Thresholding using relaxation, IEEE Trans. Pattern Anal. Mach.
Intell. PAMI-3, 1981, 598-606.
37. P. K. Woo, Theory and Applications of Some Measure of Uncertainties, Ph.D. thesis, Department of
Applied Mathematics, University of Waterloo, Waterloo, 1986.
38. R. Southwell, Relaxation Methoak in Engineering Science, A Treatise on Approximate Computatron,
Oxford Univ. Press, London, 1940.
39. R. Southwell, Relaxation Methods in Theoreticai Physics, Oxford Univ. Press (Clarendon), London.
1946.
40. W. Tsai, Moment-preserving thresholding: A new approach, Comput. Vision Graphics Image Process.
29, 1985, 377-393.
41. J. R. Ullman, Binarization using associative addressing, Pattern Recognit. 6, 1974, 127-135.
42. S. Wang and R. M. Haralick, Automatic multithreshold selection, Comput. Vision Graphics Image
Process. 25, 1984, 46-67.
43. S. Watanabe, and the CYBEST group, An automated apparatus for cancer processing CYBEST,
Comput. Vision Graphics Image Process. 3, 1974, 350-358.
44. J. S. We&a, A survey of threshold selection techniques, Comput. Vision Graphics Image Process. 7.
1978, 259-265.
45. J. S. Weszka, R. N. Nagel, and A. Rosenfeld, A threshold selection technique, IEEE Trans. Compuf
C-23, 1974, 1322-1326.
46. J. S. Weszka and A. Rosenfeld, Threshold Selection 4, TR-336, University of Maryland Computer
Science Center, 1974.
47. J. S. Weszka and A. Rosenfeld, Threshold evaluation techniques, IEEE Trans. Systenls Man
Cybernet. SMC-8,1978, 622-629.
48. J. S. Weszka and A. Rosenfeld, Histogram modification for threshold selection, IEEE Trans. Systems
Man Cybernet. SMC-9,1979, 38-51.
49. J. S. Weszka, J. A. Veretuon, and A. Rosenfeld, A Technique for Facilitating Threshold Selection for
Objects Extraction from Digital Pictures, TR-243, University of Maryland Computer Science
Center, 1973.
50. R. N. Wolfe, A dynamic thresholding scheme for quantization of scanned image, in Proceedings,
Automatic Pattern Recognition, 1969, pp. 143-162.
51. A. Y. Wu and A. Rosenfeld, Threshold selection using quadtree, IEEE Trans. Pattern Anal. Mach.
Intell. PAMI-4, 1982, 90-94.
52. S. Zucker, R. Hummel, and A. Rosenfeld, An application of relaxation labelling to line and curve
enhancement, IEEE Trans. Comput. C-26,1977, 394-403.