Emotion Recognition From Physiological Signals: Review
Emotion Recognition From Physiological Signals: Review
Review
300
Emotion recognition from physiological signals 301
physiological signals were detected from these subjects as they frequent breaths. Generally, emotions with negative valence
tested the different emotional states. The tests were carried out cause irregular breathing. The energy of this signal is in the
with an incentive system controlled by computer, scrolling range of 0.1–10 Hz [6].
IAPS images. These images are intended to induce the expres-
sion of the six emotional states (amusement, joy, disgust, fear, 2.2.4. Electromyogram signal (EMG)
neutrality and sadness). Emotional tone is an involuntary, permanent and moderate
contraction of muscle fuelled by nervous energy [7]. The
2.2. Experimental equipment slight tension that affects any muscle at rest to exaggerate the
The equipment used for this experiment is the ProComp In- effort is an expression of emotion changes as a state of mental
finiti (figure 1). This is a multimodal device with eight chan- stress. It is shown that muscle activity increases during stress
nels used for biofeedback data acquisition in real time [5]. and during emotions with negative valence [1].
Physiological signals related to emotional states are de- The EMG signal is filtered by the band pass optimal finite
tected by means of six signals: skin conductance (SKC), blood impulse response filter given by equation (1), with a band pass
volume pulse (BVP), respiratory volume (RV), electromyo- frequency of 20–125 Hz [8].
J Med Eng Technol Downloaded from informahealthcare.com by University of Notre Dame Australia on 05/15/13
σ s = 2
1 T
∑
T − 1 n=1
(s(n) − µs )2
(3)
1 T−1
δ s = ∑ s(n + 1) − s(n)
T − 1 n=1
(4)
Figure 1. ProComp Infiniti Materiel.
Figure 2. The 9th sample of BVP signal (relative measure) for the six emotions for subject 2.
Figure 3. The 9th sample of EMG signal (mV) for the six emotions for subject 2.
Figure 4. The 9th sample of RV signal (relative measure) for the six emotions for subject 2.
For personal use only.
Figure 5. The 9th sample of SKT signal (°C) for the six emotions for subject 2.
γs = ∑ s(n
T − 2 n=1
+ 2) − s(n)
(7) where C is a constant that controls the compromise between
the number of classification errors and the margin width (the
The six parameters from a to f are called the Picard param- margin width being the smallest distance between training set
eters [9]. and separating hyperplane).
g. Ratio max/min of the physiological signal: Different kernel functions can be used. Such functions can
be as cited below [13]:
R = max(s(n))/min(s(n)) (8) a. Linear kernel:
h. Euclidian distance: Let sN(n) be the signal to the neutral K(x i , x j ) = x i . x j (15)
emotion. The Euclidean distance between the signal to such
an emotion and the signal sN(n) is calculated as follows: b. Polynomial kernel:
1
K(x i , x j ) = (x i . x j + c)n (16)
Ds = 2 2
1 T
∑
For personal use only.
(17)
σ
2
3.2. Frequency parameters
As frequency parameter evaluation considered in this study
the mean and the standard deviation of the spectral coher- The SVM method is applicable for binary classification tasks,
ence function. Let Sq (ω) and SN (ω) be respectively the signal but there are extensions to the multiclass classification [14].
spectrum to such an emotion and the signal spectrum to the Formally, the training and testing samples can be ordered
neutral emotion. The absolute interspectral density can be here in M classes {C1, C2,..., CM}. The one against all method
given as [10]: consists of constructing M binary classifiers by assigning the
label 1 to samples of one class and the label−1 to all others[14].
|SqN (ω) |
2
≤ Sq(ω) SN (ω) (10) In the test phase, the classifier that gives the highest margin
wins the vote.
This value can be normalized to give the spectral coherence
function: Class of x = arg maxk (hk(x)), k Є {1,….,M} (18)
Figure 6. The 9th sample of SKC signal (μS) for the six emotions for subject 2.
For personal use only.
Figure 7. The 9th sample of HR signal (bat.min-1) for the six emotions for subject 2.
subjects, linear and polynomial kernels generate improved that pertinent parameters extracted from physiological data
recognition rates compared to the Gaussian kernel. Similarly, are relatively separable.
using the linear kernel, we obtain recognition rates relatively Tables 1–4 show that if we combine the six Picard parameters
improved compared to the polynomial kernel. This means with the ratio R or the six Picard parameters with the coher-
Table 1. Recognition rate for different combinations of parameters for Table 6. Recognition rate for different combinations of physiological signals
subject 1. for subject 2.
Linear (%) Poly (%) Gauss (%) Linear (%) Poly (%)
6 Picard parameter 50 41.67 16.67 BVP 58.33 25
6 Picard parameter R 75 66.67 58.33 Fc 50 33.33
6 Picard parameter D 66.6 66.67 25 EMG 25 41.67
6 Picard parameter meancoh, σcoh 75 75 50 SKC 50 25
All parameters 83.3 75 25 SKT 58.33 50
RV 33.33 8.33
BVP,Fc 50 41.67
Table 2. Recognition rate for different combinations of parameters for
subject 2. BVP,Fc,EMG 58.33 58.33
Linear (%) Poly (%) Gauss (%) BVP,Fc,EMG,RV 75 41.67
6 Picard parameter 66.67 58.33 33.33 BVP,Fc,EMG,RV,SKT 75 50
6 Picard parameter R 75 41.67 25 All signals 83.33 66.67
J Med Eng Technol Downloaded from informahealthcare.com by University of Notre Dame Australia on 05/15/13
BVP,Fc,EMG,RV,SKT 75 66.67
Table 4. Recognition rate for different combinations of parameters for All signals 83.33 75
subject 4.
Linear (%) Poly (%) Gauss (%)
6 Picard parameter 75% 66.67% 41.67% Table 8. Recognition rate for different combinations of physiological signals
for subject 4.
6 Picard parameter R 83.33% 75% 50%
6 Picard parameter D 66.67% 50% 25%
Linear (%) Poly (%)
BVP 41.67 16.67
6 Picard parameter meancoh, σcoh 91.67% 91.67% 41.67%
Fc 50 41.67
All parameters 91.67% 83.33% 25%
EMG 58.33 50
SKC 25 33.33
SKT 33.33 50
Table 5. Recognition rate for different combinations of physiological signals
for subject 1. RV 41.67 50
Linear (%) Poly (%) BVP,Fc 50 25
BVP 50 16.67 BVP,Fc,EMG 58.33 66.67
Fc 33.33 25 BVP,Fc,EMG,RV 66.67 66.67
EMG 58.33 25 BVP,Fc,EMG,RV,SKT 83.33 66.67
SKC 25 33.33 All signals 91.67 83.33
SKT 25 41.67
RV 33.33 25 signal alone cannot give improved recognition rates of emotions.
BVP,Fc 58.33 25 For the four subjects, the tables also show that if we increase the
BVP,Fc,EMG 66.67 58.33 number of physiological signals which are combined, we obtain
BVP,Fc,EMG,RV 66.67 66.67 improved recognition rates. The best recognition rate of 83.33%
BVP,Fc,EMG,RV,SKT 66.67 66.67 is obtained when all physiological signals are combined.
All signals 83.33 75 From these results, we conclude that combination of physi-
ological signals is feasible and efficient to give good recogni-
ence function, the recognition rate is improved compared to tion rates of emotions.
the results found using only six Picard parameters. Thus, we
conclude that the six Picard parameters, coherence function,
5. Conclusion
ratio R and distance D characterize well the six emotions.
Tables 5–8 show the results obtained by combining different In this paper, an approach to emotion recognition has been
physiological signals. These tables show that one physiological proposed, studied and evaluated. This approach is based on