SSP Paper PDF

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

International conference on Signal Processing, Communication, Power and Embedded System (SCOPES)-2016

Speech Enhancement Using Combination of Digital


Audio effects with Kalman Filter
Ummidala Santosh Kumar Dr. G. Manmadha Rao
PG Scholar, Department of ECE Associate Professor, Department of ECE
IEEE Student Member GMR Institute of Technology
GMR Institute of Technology, India Rajam, Andhra Pradesh, India
uskscientist@gmail.com manmadharao.g@gmrit.org

Abstract— The term “Quality of Speech” in Speech Enhancement Spectral subtraction is used for enhancing speech degraded by
techniques is associated with Clarity and Intelligibility. Till now additive stationary background noise, but it is affected by
due to the variable nature and characteristics of noise with time musical noise and also it does not remove noise during the
and process to process, Speech Enhancement is a difficult problem
in Noisy environment. In this paper, we proposed a method to
silence period [2]. In Wiener filter based speech enhancement
improve the quality of speech based on combination of Digital method original speech signal is recovered by minimizing
Audio Effects with Improved Adaptive Kalman Filter when only Mean Square Error (MSE) between the clean speech and the
corrupted speech is available. In this approach to enhance the estimated signal [3]. Spectral and wiener filter based speech
Speech content in the Noisy speech signal, Digital audio effects are enhancement algorithms require the characteristics of clean
used. A Digital Expander generates an audio effect which operates speech. But in real time clean speech may not available in all
on a low signal level and create more likely sound characteristics.
And further, noise is removed by Auto Regressive modeled
the cases. From the literature study, we found that some of the
improved adaptive Kalman filter. The performance of the techniques have been proposed to enhance the speech. In [4]
proposed method with additive color noise is found to be better using harmonic structure of speech signal, speech is recovered
compared to other spectral subtraction, wiener and Kalman filter form noisy speech signal, in [5] sinusoidal model is adopted, in
methods in terms of Signal-to-Noise ratio and intelligibility. [6] MMSE estimator to enhance the speech was introduced by
Ephraim. At first The advantage in the use of Kalman filter for
Keywords— Kalman filter; intelligibility; Digital audio effect; digital
speech enhancement was proposed by K.K Paliwal and A.Basu
expander; Wiener filter; Spectral Subtraction..
by using estimation of speech signal parameters from clean
I. INTRODUCTION speech before it corrupted by white noise is proposed in [7].
And further extended to the random and colored noises [8]. In
The primary objective of many Speech Enhancement
these methods a tradeoff should be maintained between SNR
algorithms is to improve the perceptual quality of extracting
and intelligibility.
speech signal from noisy speech. Noise estimation is the major
Later, many changes are made to the Kalman filter for
component in speech enhancement techniques, because better
better improvement, it does not meet the expectations and also
noise estimation gives a high quality of speech extraction. Till
complexity is more. In this paper with less complexity and
now, removing noise from noisy speech is challenging issue
better performance a new adaptive Kalman filter based method
because spectral properties of non-stationary noise is very
with the combination of nonlinear digital filter called digital
difficult to estimate and predict. Noise estimation is a careful
expander is proposed to recover the speech signal from noisy
issue in speech enhancement algorithms since if the noise
speech. The additive noise is modeled as the AR process based
power is more than speech power, then that speech content may
on linear prediction coefficient estimation (LPC) in Kalman
be removed due to treating that as a noise. Due to the wide use
filtering algorithm [7]. In addition to coefficient estimation this
of Speech processing in many applications like
paper solved problem of de-noising the random and colored
teleconferencing systems, speech recognition based security
noises. We considered an assumption that the colored noise is
devices, biomedical signal processing, hearing aids, ATM
also an autoregressive process [8]. So we estimated its AR
machines and computers, Speech enhancement is a hot research
coefficients and variances by linear prediction estimation in the
area in signal processing and remains a challenging issue
same way.
because of most of the cases only the noisy speech is available
In this paper, to overcome above stated problem a new
[1]. Over the past years, researchers have developed different
adaptive Kalman filter based method with preprocessing of a
types of efficient algorithms to improve the noisy speech even
digital audio effecting technique called digital expander is
though still it poses a challenge to the researches because of
proposed to recover the speech signal from a sequence (frame)
characteristics of noise signal varies in a dramatic manner over
of noisy speech signals and the additive noise is modeled as the
time and application to application. There are many speech
AR process [9]. This estimation of time-varying auto regressive
enhancement techniques are proposed using filtering approach
(AR) speech model parameters are based on linear prediction
by researchers last ten years such as spectral subtraction
coefficient estimation (LPC). In addition to coefficient
method, wiener filtering, Kalman filter method and so on.

978-1-5090-4620-1/16/$31.00 ©2016 IEEE

1208
International conference on Signal Processing, Communication, Power and Embedded System (SCOPES)-2016

estimation, this paper solved problem of de-noising the colored Kalman Filtering:
noise. We made an assumption that the noise is also an Kalman filtering is one of the effective speech
autoregressive process [10]. So we estimated its AR enhancement technique, in which speech signal is usually
coefficients and variances by LPC in the same way. modeled as autoregressive (AR) model and represented in the
In this paper the content is organized as follows. In Section state-space domain. A Kalman filter is an estimation and
II we mentioned the theoretical and mathematical description updating process.
of proposed method. Section III is deals with Implementation In this process both the speech signal and the
and evaluation of the proposed method. Simulation results are
placed at the end. additive noise signals are treated as ( )
( ) respectively and expressed in terms of th order
autoregressive model (AR) as follows
II. MATHEMATICAL DESCRIPTION
( )= ( − )+ ( ) (1)
Digital Audio Effects:
Digital Audio Effects can be classified as Basic ( )= ( − )+ () (2)
Filtering, Time Varying Filters, Delays, Modulators, Non-
linear Processing, Spacial effects. Non-linear Digital Filters are
characterized by creating harmonic and inharmonic frequency
And Noisy speech can be expressed as
components which are not present in the original signal
intentionally or unintentionally. In Dynamic Processing signal
envelope is controlled to minimize harmonic distortion using ( )= ( )+ ( ) (3)
compressors or limiters.
Digital expander is a signal limiter which minimizes Where ( ) is the th sample of the speech single,
the distortion in the speech. Expander operates at low signal ( ) is the th sample of the additive noise, ( ) is the th
levels to boost the dynamics of the signal and it is useful to sample of noisy speech. And or AR model parameters.
create a more likely sound characteristic [9].
AR modeled speech signal can be expressed in State-
The signal x(n) is determined from the input with
space form shown below.
variable attack and release time data. The logarithm of this x(n)
signal is compared with the threshold value. If the signal is
( + 1) = ( ) ( ) + ( ( ), 0, … … … . ,0) (4)
above the threshold, then the difference is multiplied by the
negative slope of the limiter LS. Then the output is applied to
( ) … ( ) 0 … 0 0
antilogarithm. The control factor f (n) obtained is then ⎡ ⎤
1 … 0 0 … 0 0
smoothed with a first-order low pass filter. If the signal ( ) ⎢ ⎥
⎢ ⋮ ⋱ ⋮⎥
lies below the threshold level, then the signal ( ) is set to f (n) ( )=⎢ (5)
0 1 0 … 0 0⎥
= 1. The delayed input ( − 1) is multiplied by the 0 0 1 … 0 0⎥

smoothed control factor ( ) to give the output ( ). The ⎢ ⋮ ⋱ ⋮⎥
Figure (a) shows a digital expander block diagram. The ⎣ 0 … 0 0 … 1 0⎦
logarithm of the signal ( ) is taken and multiplied by 0.5.
The value obtained is compared with two thresholds in order to ( + 1) = ( ) ( ) + ( ( ), 0, … … … . ,0) (6)
determine the operating range of the static curve. If thresholds
are crossed, the resulting difference is multiplied by the ( ) … ( ) ( )
corresponding slope and antilogarithm of the result is taken. A 1 … 0 0
first-order low-pass filter subsequently provides the attack and ( )= (7)
⋮ ⋱ 0 ⋮
release time. 0 … 1 0

From above equations state vector ( ) and driving noise


vector ( ) can be written as.

( ) ( )
( )= , ( )= (8)
( ) ( )
From Eq.(3) and (4)

( + 1) = ( ) ( ) + ( ) (9)
Figure (a): block diagram of Digital Expander
( )= ( )

1209
International conference on Signal Processing, Communication, Power and Embedded System (SCOPES)-2016

intelligibility. The corresponding waveforms are shown below.


Here ( ) = ( )
= = Experimental results show that the proposed technique is
( )
effective for speech enhancement compare to conventional
Kalman filter. Iterative Kalman filter and proposed method
= (1,0, … … 0, ) with d+1 dimension and with q results and waveforms are placed below.
dimension.From[8] noise suppression can be done by
calculating the variance, Kalman gain.
Estimation: state vector propagation, parameter covariance
matrix propagation
Iterative Kalman filter time updating process is done by
following equations

= + (10)
= + (11)
Updating: compute Kalman gain, state vector update, parameter
covariance matrix update
The coefficients in the above equations are updated every time
frame by using following Discrete Kalman filter update
equations

= ( + ) (12)
= + ( − ) (13)
= (1 − ) (14)
These parameters are updated for each iteration.

Figure (a): Block diagram for combination of Digital Audio


Effect with modified adaptive Kalman filter
III. IMPLEMENTATION AND EXPERIMENTAL RESULTS
Figure (4) is output waveform of Iterative adaptive
Figure (a) shows the block diagram for combination Kalman filter, Figure (5) is output of proposed
of Digital Audio Effect with modified adaptive Kalman filter method(combination of digital audio effects with Kalman
based speech enhancement method. Based on it Matlab code is filter), Figure(6) is comparison between clean speech and
developed. A noisy speech is generated using a Clean Speech, recovered speech using proposed method, Figure (7) is related
which is taken from the Noizeus Database and random values to output Spectrogram waveforms of Kalman and Proposed
(random noise or color noise) are added to the clean speech. method, Table (1) represent corresponding SNR ratios.
Later it passed through a Digital Expander. And the output of
Digital Expander is shown in Figure3. Digital Expander
expanding factor value is set to 0.5. It is further applied to
Iterative modified Kalman filter to suppress the noise. We set
the Kalman filter AR model order to P=20. These 20 AR
coefficients are updated for every time frame of 25ms duration
which is chopped by Hanning window and analyzed using the
linear prediction analysis method (LPC).
The additive measurement noise is assumed to be
Figure 1: Input signal_5db
stationary during the each small frame. LPC coefficient
estimation order is taken as 13 for both noisy speech and noise
signals. Number of iterations are set to be 7. Real time noisy
signals (NOIZEUS database) of 0dB, 5dB, 10dB and 15dB are
considered for performance analysis, with Hanning window.
We have observed and tabulated the results of basic Spectral
Subtraction, Wiener Filter, Kalman filter methods and
compared with Digital Audio Effect based Kalman filtering
method. Compared to all these methods, proposed algorithm Figure2: Noisy Speech Signal
giving better improvement in terms of SNR as well as

1210
International conference on Signal Processing, Communication, Power and Embedded System (SCOPES)-2016

Noise/ Spectral Wiener Kalman Proposed


Method Subtractio filter filter Method
n
Clean -7.0125 -6.3115 -3.2859 -1.4122
speech
0dB -10.5875 -9.0075 -4.6595 -2.5242
Figure3: Noisy speech Vs Digital Expander output 5dB -8.5654 -7.9854 -3.5105 -2.2545
10dB -7.4850 -7.0985 -3.4589 -2.0587
15dB -7.2106 -6.6002 -3.2857 -1.5600

Table1: Input-Output SNR Comparisons


IV. V. CONCLUSION
In the present study, an improved method for speech
enhancement by combining Digital Audio Effecting techniques
Figure (4): Kalman Filter based Speech Enhancement method with improved Adaptive Kalman filter technique is proposed.
Output waveform In this paper, we discussed the drawbacks of basic methods
such as speech enhancement with spectral subtraction and
wiener filter methods. Even though other Kalman filter
approach based speech enhancement methods are giving better
results than a conventional Kalman filter, more complexity is
involved.it leads to more time taken process. In this paper, we
proposed a method to overcome the disadvantages of earlier
methods in terms of performance and Speed. All these methods
Figure (5): Combination of Digital Audio Effect with Kalman are simulated using MATLAB and input output SNR values of
Filter based Speech Enhancement recovered Output respective methods are compared. Performance of Proposed
method is analyzed with different Input SNR noise level. It is
observed that the proposed method gives better output SNR
values and its performance is comparatively superior for both
stationary and non-stationary signals.

References
[1] "Speech Enhancement" by J. Benesty, J. Chen, Eds., and S. Makino,
Figure (6): Proposed Method output (Kalman with Digital Springer, Berlin, 2005.
Expander) and clean speech waveforms [2] S.Boll,“Suppression of acoustic noise in speech using spectral
subtraction,” IEEE Transaction on Speech, Signal Process., volume.
ASSP-27,no.2 pp.113-120,1979
[3] T.V.Sreenivas and P.kirnapure,“Codebook constrained Wiener filtering
for speech enhancement”, IEEE Trans. Speech and Audio processing,sep
1996.
[4] D.V.Anderson and M.A.Clements “Audio signal noise reduction using
harmonic modeling,” in Proc.IEEE Int. conf. Acoust. ICASSP,1999
[5] by J.Jensen and J.H.L Hansen,”Speech enhancement using a constrained
iterative sinusoidal model,”IEEE Trans. Speech Audio Process., vol.9
,2001.
[6] Y.Epharaim,”A minimum mean square error approach for speech
enhancement,”in Proc. IEEE conf 1990.
[7] K.K.Paliwal and A.Basu,“A speech enhancement method based on
Kalman filtering” Proceedings of ICASSP’87,pp.177-180. Dallas, TX,
USA,1987.
[8] B.Chabane and B.Daoued”On the use of Kalman Filter for enhancing
speech corrupted by Colored Noise,” WSEAS Trans. On signal
processing 2008 Dec.
[9] ”Digital Audio Signal Processing,” second edition, by Udo Zolzer
[10] “Noizeus- A noisy speech corpus for evaluation of speech enhancement
algorithms,” http://ecs.utdallas.edu/loizou/speech/noizeus
Figure (7): Spectrogram of clean & noisy speech signals, [11] Hu.Y and Loizou.P,”Evaluation of objective quality measures for speech
enhancement,” IEEE Trans. On speech and audio processing, 2008
Kalman filter method, and proposed method

1211

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy