Artificial Neural Networks in Pattern Recognition: Mohammadreza Yadollahi, Ale S Proch Azka
Artificial Neural Networks in Pattern Recognition: Mohammadreza Yadollahi, Ale S Proch Azka
Artificial Neural Networks in Pattern Recognition: Mohammadreza Yadollahi, Ale S Proch Azka
IN PATTERN RECOGNITION
Mohammadreza Yadollahi, Aleš Procházka
Institute of Chemical Technology, Department of Computing and Control Engineering
Abstract
1 Introduction
Nowadays, artificial neural networks (ANN) are applicable in many scientific fields such as
medicine, computer science, neuroscience and engineering. It is believed that they will receive
extensive application to biomedical systems and engineering. They can be also applied in infor-
mation processing system to solve pattern analysis problems or feature vectors classification [1]
for example. This paper describes pre-processing stages before the use of artificial neural net-
works that include (i) filtering tools to reject noise, (ii) image segmentation to divide the image
area into separate regions of interest, and (iii) the use of mathematical method for extraction
of feature vectors from each image segment. The set of feature vectors form finally the pat-
tern matrix as the input to the artificial neural networks for their classification. This process
is followed by the application of ANN for classification of feature vectors by the unsupervised
learning process based on the Kohonen’s method. All algorithms used were designed in the
Matlab environment.
2 Mathematical Background
The mathematical methods used include the discrete Fourier transform for signal analysis, ap-
plication of difference equations for digital filtering and optimization methods for training of
artificial neural networks.
M
y(n) = bk x(n − k) (3)
k=0
where {y(n)} and {x(n)} are output and input sequences respectively and M is the filter order.
Fig. 1 displays the given sequence, output of FIR filter and both the frequency response of the
FIR filter and frequency spectrum of the given sequence.
−2
0 50 100 150 200
Index
(b) Output of FIR filter
1
−1
0 50 100 150 200
Index
(c) Frequency response of FIR filter
2
0
0 0.2 0.4 0.6 0.8 1
Frequency [Hz]
Figure 1: FIR filtering (a) the given simulated sequence formed by the sum of two harmonic
functions, (b) result of its processing by the FIR filter of order M = 60, and (c) frequency
spectrum of the given sequence together with the frequency response of the FIR filter
Infinite Impulse Response (IIR) filters [4] or recursive filters have feedback from the output to
input. They can be described by the following general equation
M
M
y(n) = − ak y(n − k) + bk x(n − k) (4)
k=1 k=0
Owing to the lower number of necessary coefficients to obtain similar accuracy the IIR filters
are more efficient and faster requiring less storage comparing to FIR filters. On the other hand
while FIR filters are always stable the IIR filters can become unstable [4].
• Single-Input Neuron (Fig. 2(a)): The scalar input p is multiplied by the scalar weight
w. The bias b is then added to the product w p by the summing junction and the transfer
function f is then applied for n=wp+b. This transfer function is typically a step function
or a sigmoidal function shifted by value of b producing the output a.
• Multiple-Input Neuron (Fig. 2(b)): Individual inputs forming a vector p=[p1 , p2 , ..., pR ]
are multiplied by weights w=[w1,1 , w1,2 , ..., w1,R ] and then fed to the summing junction.
The bias b is then added to the product w p forming value
n = w p + b = w1,1 p1 + w1,2 p2 + ... + w1,R pR + b (5)
which can be written in the following matrix form
n = wp + b (6)
This value is then processed by the non-linear transfer function to form the neuron output
a = f (w p + b) (7)
• Layer of Neurons (Fig. 2(c)): Each element of the input vector p is connected to each
neuron using coefficients of the weight matrix W.
⎡ ⎤
w1,1 w1,2 ... w1,R
⎢ w2,1 w2,2 ... w2,R ⎥
W=⎢ ⎣ ... ...
⎥
... ... ⎦
wS,1 wS,2 ... wS,R
w1,1
P
2
....
w1,R 1
P
R
1 1 1 1
n2 f a2 Sum
1
f Sum
2
f Sum
3
f 3
P Sum P3 n12 a12 n22 a22 n32 a 2
3
1 2 3
b 2 b 2 b 2
b2 .... .... .... .... .... .... .... ....
1 1 1 1
ns f as Sum f1 Sum f2 Sum f3 3 3
Sum n1s1 a1s1 n2s2 a2s2 n3s3 a s
.... ....
2 2 1 3 3 2
w s ,s w s ,s
a=f(wp+b) b1s1 2 2
b s
3 3
b s
bs
wS,R 1 1 1
1 1 1
w s ,R
P PR
R a1=f1(w1p+b1) a2=f2(w2a1+b2) a3=f3(w3a2+b3)
a3=f3(w3f2(w2f1(w1p+b1)+b2)+b3)
Figure 2: Different structures of neural networks presenting (a) a single-input neuron with a
bias, (b) a multiple-input neuron with biases, (c) a layer of neurons, and (d) multiple layers of
neurons
The i-th neuron has a summer that gathers its weighted inputs and bias to form its own
scalar output n(i) of an S-element net vector n
n = Wp + b (8)
This value is then processed by the non-linear transfer function to form the neuron output
a = f (W p + b) (9)
The row indices on the elements of matrix W indicate the destination neuron of the weight
and the column indices indicate which source is the input for that weight [7].
• Multiple Layers of Neurons (Fig. 2(d)): Each layer has its own weight matrix W, its
own bias vector b, a net input vector n and an output vector a. The weight matrix for the
first layer can be denoted as W(1) , the weight matrix for the second layer as W(2) , etc.
The network has R inputs, S (1) neurons in the first layer, S (2) neurons in the second layer,
etc. The outputs of layers one and two are the inputs for layers two and three. Thus layer
2 can be viewed as a one-layer network with R = S (1) inputs, S = S (2) neurons and weight
matrix W(2) of size S (1)×S (2). The layer 2 has input a(1) and output a(2) . The layers of a
multilayer network play different roles. A layer that produces the network output is called
an output layer while other layers are called hidden layers. The three layer network has
one output layer (layer 3) and two hidden layers (layer 1 and 2) [7].
3 Image Segmentation
Image segmentation [9] forms an important step to image regions classification. The Watershed
transform (WT) can be used to contribute to this task as it can find the watershed ridges in an
image. It assigns zero to all ridges detected and it assigns the same unique labels to all points
inside separate catchment basins (regions) using the gray-scale image as a topological surface.
The Distance transform (DT) forms the fundamental part of the Watershed transform used for
the binary image [9, 10] and based upon the calculation of Euclidean distance of each pixel to
the nearest pixel with the value 1. Result of this transformation [10] is formed by
0
for ai,j = 1
bi,j = (12)
min
∀k,l,ak,l =1 (i − k)2 + (j − l)2 for ai,j = 0
where ai,j is each pixel and ak,l is nearest pixel for i = 1, 2, ..., M and j = 1, 2, ..., N . Application
of defined formula by Eq. (12) is showed in following example of matrix
⎛ ⎞
1 1 0 0 0
⎜ 1 1 0 0 0 ⎟
⎜ ⎟
Ibw = ⎜
⎜ 0 0 0 0 0 ⎟
⎟ (13)
⎝ 0 1 1 1 0 ⎠
0 0 0 0 0
Fig. 3 displays a simulated gray-scale image, its conversion to the binary image, results of the
application of the distance transform and the final image presenting image edges obtained by
the watershed transform.
Figure 3: Process of segmentation including (a) simulated gray-scale image, (b) its conversion
to the binary image, (c) results of the evaluation of the distance transform of complementary
binary image, and (d) watershed transform
4 Results
The pre-processing stages of image filtering, segmentation, and feature extraction were applied
for real biomedical images of size 128 × 128 to enable the following classification by artificial
neural networks discussed. Fig. 4 presents the set of all these processes applied for a biomedical
image (a) and results of its median filtering (b), segmentation by the Watershed transfom (c),
image segments DFT features (d) and results of classification using artificial neural networks
and Kohonen’s learning rule (e).
(a) Biomedical image (b) Filtering (c) Segmentation
Figure 4: Processing stages of a selected biomedical image presenting (a) the given image,
(b) results of its median filtering, (c) evaluation of the Watershed transfom, (d) image seg-
ments DFT features plot, and (e) results of classification using artificial neural networks by the
Kohonen’s learning rule with class boundaries.
In the present study the median filtering has been used and results compared with other
methods. A special attention has been paid to estimation of image regions properties. Both DFT
and selected statistical characteristics have been used to form feature vectors of all segments
of the image. To allow visualization of results only two features were selected to form feature
matrix
p1,1 ... p1,k ... p1,Q
P=
p2,1 ... p2,k ... p2,Q
Then the matrix WS,2 was used to classify image features into S classes.
⎡ ⎤
w1,1 w1,2
⎢ .. .. ⎥
⎢ . . ⎥
⎢ ⎥
W=⎢ w
⎢ k,1 w k,2
⎥
⎥
⎢ .. .. ⎥
⎣ . . ⎦
wS,1 wS,2
The learning process has been based upon the Kohonen’s method to classify feature vectors with
a selected result on Fig. 4(e). The calculation of a typical structure is done in every cluster by
Table 1: Evaluation of classification.
No.of segment Class Mean distance Typical cluster Criteria value
1 A
2 A 0.0650 6
6 A
3 B 0.0850 3
4 B
5 C 0.16107
7 C
8 C 0.1060 5
9 C
10 C
computing distance of each pattern to weights of the corresponding neuron using relation
d(i) = (p1,j − wi,1 )2 + (p2,j − wi,2 )2 (15)
The feature vector with the least distance in the cluster points to the typical pattern. Evaluating
the mean distance for every cluster by relation
1
N
M SE = d(i) (16)
N
i=1
where N stands for the number of pattern values in the selected class it is possible to detect the
cluster compactness.
The quality of the separation process is then evaluated using the mean distances of patterns in
all classes and distances of their gravity centers.
5 Conclusion
Further studies will be devoted to the use of different mathematical methods for obtaining
compact cluster allowing the efficient classification.
6 Acknowledgements
The work has been supported by the research grant of the Faculty of Chemical Engineering
of the Institute of Chemical Technology, Prague No. MSM 6046137306
References
[1] R.Nayak, L.C.Jain and B.K.H. Ting. Artificial Neural Networks in Biomedical Engineer-
ing: A Review. In Proceedings Asia-Pacific Conference on Advance Computation.2001.QUT
Digital Repository :http://eprints.qut.edu.au/1480/.
[2] Shripad Kulkarni. Discrete Fourier Transform(DFT) by using Vedic Mathematics. Oc-
tober 2008.http://www.vedicmathsindia.org/Vedic%20Math %20IT/5.DFT%20Using%20
Vedic%20Math%20Kulkarni.pdf.
[3] Paulo Roberto, Nunes de Souza, Mára Regina and Labuto Fragoso da Silva.
Fourier Transform Graphical Analysis: an Approach for Digital Image Processing. Universi-
dade Federal do Espı́rito Santo.2009. http://w3.impa.br/ lvelho/sibgrapi2005/html/p13048
/p13048.pdf.
[4] Saeed V. Vaseghi. Multimedia Signal Processing: Theory and Applications in Speech,
Music and Communications . John Wiley and Sons, 2007. ISBN: 9780470062012.
[5] Belle A. Shenoi. Introduction to Digital Signal Processing and Filter Design. John Wiley
and Sons, 2005. ISBN: 9780471464822.
[6] Eeglossary. Neural Networks: Definition and Recommended Links . 1 May 2009.http://
www.eeglossary.com/neural-networks.htm.
[7] Martin T. Hagan, Howard B . Demuth and Mark H. Beale. Neural Network
Design. PWS Pub., 1996. ISBN: 9780534943325.
[8] Unsupervised learning. Lecture 8.Artificial Neural Networks. Neg nevitsky, Pearson
Education, 2002.www.neiu.edu/ mosztain/cs335/lecture08.ppt.