Reconocimiento de Voz - MATLAB
Reconocimiento de Voz - MATLAB
National Conference on Latest Initiatives& Innovations in Communication and Electronics (IICE 2016)
1
International Journal of Computer Applications (0975 – 8887)
National Conference on Latest Initiatives& Innovations in Communication and Electronics (IICE 2016)
2
International Journal of Computer Applications (0975 – 8887)
National Conference on Latest Initiatives& Innovations in Communication and Electronics (IICE 2016)
4. EXPERIMENTAL TESTING
Our speech recognition system was a speaker dependent
system. So it was dependent on the user’s voice only. In the
training of this system we created a database of nine words.
After the training of this system, a real time speech input was
Figure3.2.8: Spectrum of ‘Eight’ given to it through a good quality microphone. The system
divided the real time speech sample into small segments of
frames or continuous groups of samples. After that the energy
of each frame segment was calculated using simple energy
formula:
Ex= (2)
Energy calculated was then analyzed by a speech detection
algorithm to separate the words.
3
International Journal of Computer Applications (0975 – 8887)
National Conference on Latest Initiatives& Innovations in Communication and Electronics (IICE 2016)
4.1 SPEECH DETECTION ALGORITM human vocal cord and different sounds can have different
The speech detection algorithm was developed by processing frequencies. To predict the different frequencies it power
the prerecorded speech samples frame by frame within a spectral density measure can be a better way. So we find out
simple loop. We divided the whole frame into the segment of the frequencies by power spectral densities measure.
160 samples and each of the samples was detected by the Speech can be termed as short term stationary so MFCC
system. For the detection of each frame we used a features were again extracted and words pronounced by the
combination of signal energy and a zero crossing rate. This user were detected.
calculation became very simple with the MATLAB
mathematical and logical operators. 5. RESULTS
Real time results were obtained in the lab. The user was
4.2 ACOUSTICAL MODEL speaking through the microphone and the text representation
It is very important to create an acoustical model for the was obtained on the computer screen as shown in the figure
detection of each uttered word. So we created an acoustical 5.1. Implementation results of speech to text conversion
model. It is known that different sounds are produced by system are as follows:
4
International Journal of Computer Applications (0975 – 8887)
National Conference on Latest Initiatives& Innovations in Communication and Electronics (IICE 2016)
IJCATM : www.ijcaonline.org 5