tmpDB29 TMP
tmpDB29 TMP
tmpDB29 TMP
Edited by
Gernot Mller-Putz, Gnther Bauernfeind, Clemens Brunner,
David Steyrl, Selina Wriessnegger, Reinhold Scherer
c 2014
Verlag der Technischen Universit
at Graz
http://ub.tugraz.at/Verlag
ISSN 2311-0422
ISBN 978-3-85125-378-8
DOI 10.3217/978-3-85125-378-8
Preface
After the positive responses to the last event, first time held as a conference, we decided to keep the style and
expand the event also with satellite events, since we want to provide a platform for scientific exchange within the
Brain-Computer Interface research community. Here, researchers can present their own work either in the form of
a talk or as a poster. For this purpose, we encouraged participants to submit papers, which were peer-reviewed
and are published in this ebook. The BCI conferences held in Graz, Austria, may be considered to be a European
initiative in the field of EEG-based Brain-Computer Interfaces that contributes to a stronger orientation towards
scientific cooperation.
We are lucky that outstanding experts in the field, Dr. Scott Makeig (Swartz Center for Computational Neuroscience, Institute for Neural Computation, University of California San Diego), Prof. Nick Ramsey (UMC Utrecht,
Rudolf Magnus Institute, Department of Neurology and Neurosurgery), and Manfred Halver (FFG - Austrian Research Promotion Agency, European and International Programmes) were able to accept our invitation to present
keynote addresses at the conference.
During preparation of this conference many people have been involved. Here we want to acknowledge the work of
the reviewers who contributed with their expertise and knowledge:
Allison Brendan
Brouwer Anne-Marie
Cichocki Andrzej
Cuntai Guan
Farquhar Jason
Guger Christoph
Kanoh Shinichiro
K
ubler Andrea
Mattia Donatella
M
uller-Putz Gernot
Ojemann Jerey G.
Rupp R
udiger
Silvoni Stefano
Tangermann Michael
Zander Thorsten
Bauernfeind G
unther
Brunner Clemens
Cincotti Febo
Daly Ian
Friedrich Elisabeth
Halder Sebastian
Kleih Sonja
Lee Adrian Kc
McCane Lynn
Nam Chang S.
Pinegger Andreas
Ryan David
Sitaram Ranganatha
Wagner Johanna
Zoltan Bart
Billinger Martin
Brunner Peter
Congedo Marco
Darvas Felix
Gao Shangkai
Horki Petar
Kothe Christian
Leeb Robert
Mehring Carsten
Noirhomme Quentin
Ramsey Nick
Scherer Reinhold
Solis Escalante Teo
Wander Jeremiah
Blankertz Benjamin
Chavarriaga Ricardo
Coyle Damien
Faller Josef
Grosse-Wentrup Moritz
Jin Jing
Kreilinger Alex
Lotte Fabien
M
uller Klaus-Robert
Ofner Patrick
Ron Angevin Ricardo
Seeber Martin
Steyrl David
Wriessnegger Selina
We gratefully acknowledge the support of the Graz University of Technology for providing the facilities and thank
the sta of the Institute of Knowledge Discovery, BCI Lab, for their dedicated assistance.
DOI:10.3217/978-3-85125-378-8-1
Key Laboratory of Advanced Control and Optimization for Chemical Processes, Ministry of
Education, East China University of Science and Technology, Shanghai, China
2
Brain embodiment lab, School of Systems Engineering, University of Reading, Reading, UK
jinjingat@gmail.com, i.daly@reading.ac.uk
Abstract
In this paper, a new paradigm is presented, to improve the performance of audiobased P300 Brain-computer interfaces (BCIs), by using spatially distributed natural
sound stimuli. The new paradigm was compared to a conventional paradigm using
spatially distributed sound to demonstrate the performance of this new paradigm. The
results show that the new paradigm enlarged the N200 and P300 components, and
yielded significantly better BCI performance than the conventional paradigm.
1 Introduction
Visual-based P300 brain-computer interfaces (BCIs) requires users to have control over their gaze
direction and, therefore, are not useable by blind patients or other patients who cannot maintain eye
gaze [1]. In order to enlarge the group of users who may benefit from use of P300 BCIs, the audiobased P300 BCI was developed [2-3]. It has been shown that an audio paradigm using spatially
distributed stimuli is better than using different pitches from one speaker [4]. Hhne et al (2011)
presented a paradigm using high, medium, and low unnatural spatially distributed sounds from
headphones [5]. However, these two studies did not show that this paradigm was better than a
paradigm using same sound played by spatially distributed speakers. In our study, a new audio-based
P300 BCI was designed by using three different natural sounds. The three different natural sounds
were female, male, and child voices, which were played by six spatially distributed speakers. This
paradigm was called the 123 pattern. Another paradigm using unnatural sounds (a beep),
randomly played by six spatially distributed speakers, was used to evaluate the efficacy of our
proposed pattern, this was called the beep pattern.
Corresponding author
Corresponding author
Article ID 001-1
DOI:10.3217/978-3-85125-378-8-1
exxperimental prrocedure beforre any subjectts participated. All subjects native languuage was Manddarin
Chhinese. Two suubjects (labeleed S7 and S9) had used a vissual-based P300 BCI beforee this study.
Figure 1: T
The location off speakers and sounds that
were played by tthe speakers.
w
Article ID 001-2
DOI:10.3217/978-3-85125-378-8-1
3 Resultss
123-P(%)
B-P(%)
S1
75
75
S2
90
15
S
S3
880
995
S4
80
65
S5
90
70
S6
10
10
S7
80
60
S8
80
55
S9
70
90
S10
990
115
S11
60
50
S
S12
995
330
Avgg
7522.77
5329.22
Table 1: Perrformance (classsification accuracy) from onlline feedback ruuns across all 12 participantss who
participated in online runs. 123-P denotes th
he 123 patternn and B-P the beep patternn.
Figure 2 shoows the topogrraphic map off r-squared valuues during preesentation of tthe 123 patternn and
thee beep patternn. It shows thaat the 123 patttern obtained higher r-squaared values thaan that of the beep
paattern in the ceentral region foor the N200, and
a in the centtral and parietaal regions for the P300.
Article ID 001-3
DOI:10.3217/978-3-85125-378-8-1
Table 1 shows the online classification accuracy using five trials for constructing the average ERP
waveform. Paired samples tests were used to show the difference between the two patterns. This
shows that the classification accuracy of the 123 pattern is significantly higher than that of the beep
pattern (t=2.3, p<0.05).
5 Acknowledgements
This work was supported in part by the Grant National Natural Science Foundation of China,
under Grant Numbers 61074113, 61105122, 61203127 and 61305028 and was also supported by the
Fundamental Research Funds for the Central Universities (WH1114038, WH1314023).
References
[1] Klobassa D S, Vaughan T M, Brunner P, Schwartz N E, Wolpaw J R, Neuper C, Sellers E W.
(2009) Toward a high-throughput auditory P300-based brain-computer interface, Clin.
Neurophysiol. 120, 1252-1261
[2] Hill N J, Lal T M, Bierig K, Birbaumer N, Schlkopf B (2004) An auditory paradigm for braincomputer interface NIPS, books.nips.cc/papers/files/nips17/NIPS2004_0503.ps.gz
[3] Furdea A, Halder S, Krusienski D J, Bross D, Nijboer F, Birbaumer N, Kbler A 2009 An auditory
oddball (P300) spelling system for brain-computer interfaces, Psychophysiology 46, 617-625.
[4] Belitski A, Farquhar, Desain P 2011 P300 audio-visual speller J. Neural Eng. 8, 025022
[5] Hhne J, Schreuder M, Blankertz B, Tangermann M 2011, A novel 9-class auditory ERP
paradigm driving a predictive text entry system Front Neurosci 22,5:99
[6] Beville H J, Lawson D 1987 Attention to central and peripheral visual space in a movement
detection task III. Sepatate effects of auditory deprivation and acquisition of a visual language
Brain Res 405, 284-294
[7] Kbler A, Furdea A, Halder S, Hammer E M, Nijboer F, Kotchoubey B 2009 A brain-computer
interface controlled auditory event-related potential (p300) spelling system for locked-in patients
Ann N Y Acad Sci 1157, 90-100.
Article ID 001-4
DOI:10.3217/978-3-85125-378-8-2
1 Introduction
One of the major concerns in recent studies is the discrimination between vegetative and
minimally conscious state (Georgiopoulos, et al., 2010). The correct discrimination between these two
conditions has major implication in subsequent rehabilitation of patients. In particular, establishing a
communication with them would be advantageous and desirable. Several techniques, including
functional magnetic resonance imaging (fMRI), cognitive event-related potentials (ERPs) and
quantitative EEG analysis (qEEG), are currently developed to assess patients correctly and to attempt
the communication with them (Monti, et al., 2010) (Cruse, et al., 2010) (John, et al., 2011). This work
describes a procedure to investigate a subjects pattern of activation during mental imagery tasks. It
aims to design a brain computer interface system for communication. Healthy subjects and patients
with different levels of disorders of consciousness underwent EEG recording during yes/no personal
questions. The first aim of the study was to develop a procedure of features selection in order to
reduce the number of electrodes required. The second aim was to design a classifier that, after the
training with two questions with known answers, is able to forecast the third unknown answer.
2 Methods
2.1 Subjects
Five healthy subjects (HS) (age 26 to 37) and two patients (P) took part in the study. The first
patient was a 21-year-old female and her level of cognitive functioning (LCF) was 7. The second
patient was a 28-year-old female and her LCF was 3. The etiology of the injury was traumatic for both
patients.
*
Collected and analysed the data and created the first version of this document
Analyzed the data and supervised this document
Article ID 002-1
DOI:10.3217/978-3-85125-378-8-2
2.2 Protocol
The experiment consisted of a Communication Trial. Simple yes/no personal questions, with
known answers, were asked to the subjects (e.g. Are you married?). Subjects were instructed to
imagine for 30 seconds a movement of the right hand for an affirmative answer and a movement of
the right foot for a negative answer. The Trial comprised six questions which were repeated six times
for HS and twice for P. The experiment was repeated on two consecutive days (sessions) for HS only.
No feedback was provided to the subjects.
where i is the electrode index, j the band index, k the repetition index, n the number of repetitions, and
P1, P2 and P3 are the power of the first, second and third answer of the triple, respectively. If the
third answer is the same as the first one, s tends to zero, while if the third one is equal to the second, s
tends to 1. Using s we calculated the similitude between equal answers and we selected the 10 BE that
optimize s for all HS. The classification of the third answer is performed using the following
conditions:
Article ID 002-2
DOI:10.3217/978-3-85125-378-8-2
The aim of this preliminary selection on the HS was to reduce the problem dimensionality. After this
selection, we applied again the same criterion for each HS and each P separately, with the aim to find
a list of BEC according to the similitude criterion.
Article ID 002-3
DOI:10.3217/978-3-85125-378-8-2
future practical application of our protocol, each communication session will be preceded by a brief
configuration session in which the classification algorithm selects the optimum electrode subset from
the fixed BE. Furthermore it will be necessary to train the classifier with two known questions. Even
if the procedure is long and repetitive, it guarantees an high classification accuracy for the patients.
Table 1: The table shows, for each HS, each P and each session (S), the best couples electrode-band
(BEC) selected using the similitude index s and the related classification accuracy (CA).
S
HS 1
HS 2
HS 3
HS 4
HS 5
P1
P2
1
2
1
2
1
2
1
2
1
2
1
1
BEC
O1
Cz-PO4
C3- T6-PO3
P3-PO4
O1-PO3-P3
Fc1-Fc2
Fc2
Cz
PO4
Fc2-Fz
PO3-C3
PO3-C3
C3
O1-Cz-PO3P3-PO4
Fc1-P3
C3-P3
C3-Cz-PO4
P3-Fc1-T6
C3
C3
Fz
Fc2
PO3-O1-T6
O1-PO4
Fz-T6-Fc1P3-O1-PO3
Fz-C3
C3
Fc2-PO3
O1-PO3
O1
PO4
CA
MeanSD
90 %
90 %
80 %
80 %
83.512%
100 %
75 %
80 %
60 %
100 %
80%
80%
100%
9014.1%
References
Cruse, D., Chennu, S., Fernandez-Espejo, D., Payne, W. L., Young, G. B., & et al. (2010). Detecting
Awareness in the Vegetative State: Electroencephalographic Evidence for Attempt
Movements to Command. PLos ONE, e49933.
De Lange, F., Jensen, O., Bauer, M., & Toni, I. (2008). Interaction between posterior gamma and
frontal alpha/beta oscillation during imagined actions. Frontiers in Human Neuroscience,
10.3389.
Georgiopoulos, M., Katsakiori, P., Kefalopoul, Z., Elleul, J., Chroni, E., & al., e. (2010). Vegetative
Stae and Minimally Conscious State: A Review of the therapeutic Interventions. Stereotactic
and Functional Neurosurgey(88), 199-207.
Ishizu, T., Noguchi, A., Ito, Y., Ayabe, T., & Kojima, S. (2009). Motor activity and imagery modulate
the body-selective rgion in the occipital-temporal area: A near-infrared spectroscopy study.
Neuroscience Letter, 85-89.
John, E. R., Halper, J. P., Lowe, R. S., Merkin, H., Defina, P., & et al. (2011). Source imaging of
QEEG as a method to detect awareness in a person in vegetative state. Brain Injury, 426-432.
Lebon, F., Lotze, M., Stinear, C. M., & Byblow, W. D. (2012). Task-Dependent Interaction between
Parietal and Controlateral Promary Motor Cortex during Explicit versus Implicit Motor
Imagery . PLos ONE, e37850.
Monti, M. M., Vanhaudenhuyse, A., Coleman, M. R., Boly, M., Pickard, J. D., & et al. (2010). Willful
modulation of brain activity in disorder of consciousness. New Engalnd Journal of Medicine,
579-589.
Article ID 002-4
DOI:10.3217/978-3-85125-378-8-3
Abstract
This work presents a comparative study between low and high frequencies of visual stimulation used in a Brain-Computer Interface (BCI) based on Steady State Visual Evoked
Potentials (SSVEP). This comparison has the goal of evaluating the visual tiredness produced by flickering visual stimuli in two distinct frequency ranges (low and high frequency).
For this purpose, five volunteers with disabilities operated a wheelchair through a SSVEPbased BCI. In the experiments, each subject answered a questionnaire about performance
and tiredness associated to the use of the BCI. Average ITR obtained for low- and highfrequency stimuli were 20.3 bits/min and 15.0 bits/min, respectively. Despite of a lower
average ITR, it was found that high-frequency stimuli were more comfortable and could
lead to a better performance in the accomplishment of the navigation tasks.
Introduction
Materials
A BrainNet BNT-36 acquisition system was used to acquire EEG signals. Twelve EEG channels
with the reference electrode at the left ear lobe and filtered between 0.1 and 100 Hz were digitized
at 600 Hz. Using the extended international 10-20 system, the locations for the electrodes were
P7 , PO7 , P5 , PO3 , POZ , PO4 , P6 , PO8 , P8 , O1 , O2 , and OZ .
Article ID 003-1
DOI:10.3217/978-3-85125-378-8-3
Two FPGA are used to produce stimuli in low- and high-frequencies. The stimuli in lowfrequency range are shown in a 12 LCD display at a distance of 0.5 m from the user. They
are composed of four black/white checkerboard stripes, flickering on the screen at 5.6 Hz (top),
6.4 Hz (right), 6.9 Hz (bottom) and 8.0 Hz (left), as illustrated in Figure 1 (a). On the other
hand, stimuli in high frequency range are illuminated by high efficiency green LEDs flickering
at 37 Hz (top), 38 Hz (right), 39 Hz (bottom) and 40 Hz (left) on the LCD sides, also shown in
Figure 1 (a). Each checkerboard stripe or led corresponded to a movement of the wheelchair:
forward, left, right and stop.
Methods
Before operating the wheelchair, each volunteer performed a training session with the BCI.
They were asked to follow the verbal cues to gaze at a stripe for 30 seconds. Visual feedback
denoting the detected stripe was presented to the user. Then, the volunteers could operate the
wheelchair. The four tasks are illustrated in Figure 1 (b). The room dimensions are 8.75m long
by 7.07m wide. The goal of all tasks was to reach the area next to the door.
Finally, the volunteers answered a questionnaire with questions related to tiredness and
comfort when using the BCI. This questionnaire allows evaluating the influence of tiredness
and concentration on user performances qualitatively. The questions were:
(A) Are you tired?
Article ID 003-2
DOI:10.3217/978-3-85125-378-8-3
(B) Did the screen oscillations interfere with your concentration? (This question is related to screen
oscillations due to wheelchair movements, particularly when it begins or ends a movement).
(C) Was the stimuli colour annoying?
These questions should be answered according to the ranking: 1 - None; 2 - A little; 3 Medium; 4 - Quite.
The EEG signal processing method is fully described in [3]. Basically, the EEG is filtered
and then, the Power Spectral Density (PSD) was determined. Later, a Spectral F-Test (SFT)
is applied in the feature extraction step [1]. A classifier based on a decision tree was implemented with attributes that maximize the discrimination among classes. The training step is
unnecessary for this classifier because its operation is straightforward. Moreover, baseline or
reference signal are unnecessary and supervisor intervention is not required. Thus, since the
user sits on the wheelchair and wears the EEG cap, he will be ready to use the BCI. This BCI
worked asynchronously and detections were performed at each second, accordingly a command
is sent to the wheelchair every second. On one hand, the BCI considered the first three harmonics in SSVEP detection for low frequencies. On the other hand, only the first harmonic
was considered for high frequencies.
Results
Table presents the hit rate and the ITR obtained just before the wheelchair operation for
low- and high-frequency stimuli. Table 1 presents the average detection accuracy (Acc) among
the four classes and its respective average ITR, calculated according to [5]. The number of
navigation tasks completed by each volunteer is presented as well.
Table 1: Results for low- and high-frequency stimulation.
Vol
1
2
3
4
5
Average
Acc SD
46% 6.05
44% 7.5
78% 5.35
62% 9.88
41% 11.81
54% 8.12
Low Frequency
ITR
Completed Tasks
8.9
4
7.4
1
53.5
2
26.4
2
5.3
1
20.3
2
Acc
60% 13.12
40% 10.24
63% 15.04
51% 13.12
41% 9.6
51% 12.22
High Frequency
ITR
Completed Tasks
23.7
4
4.7
3
27.8
4
13.4
3
5.3
2
15.0
3
Stimulation parameters are a very important issue for a SSVEP-based BCI implementation
and can aect the system performance and the user comfort and safety. A statement issued in
Article ID 003-3
DOI:10.3217/978-3-85125-378-8-3
Table 2: Questionnaire applied to the volunteers after the robotic wheelchair operation.
Question
A
B
C
Frequency Range
Low
High
Low
High
Low
High
Vol1
Medium
A little
None
None
None
A little
Vol2
Medium
A little
Medium
Medium
Quite
A little
Vol3
A little
None
Medium
None
None
A little
Vol4
Medium
A little
None
None
Quite
None
Vol5
A little
None
A little
A little
None
None
Median
Medium
A little
A little
None
None
A little
BCI bibliography is that high-frequency stimulation produces less visual tiredness than lower
frequency stimulation [2], [6]. However, is high-frequency stimulation less annoying than lower
frequency stimulation? The current research tries to answer partially this question by evaluating the two commonly used stimulation system in SSVEP-based BCI, checkerboards in
low-frequency and LEDs in high-frequency. Note that the dierent stimulus patterns could
aect the SSVEP signals.
In low frequency stimulation, only volunteer Vol1 could execute the four navigation tasks.
Volunteers reported less tiredness for high-frequency LED stimulation than for low-frequency
checkerboard stimulation (except Vol1), which was statistically significant (p=0.025). However,
the other analyzed variables such as colour and display movement, do not seem to aect the
volunteer performance, since those dierences were non-significant (p>0.05).
Although average ITR values in low-frequency (20.3 bits/min) and in high-frequency (15
bits/min) were dierent, the volunteers could perform more tasks with high-frequency stimuli.
Then, less tiring stimuli are visually more comfortable and could lead to a better performance.
Although the SSVEP are stronger in low-frequency range (and consequently their detection
is easier), the developed BCI could detect the visual evoked potentials in high-frequency range
with good performance. Results show that the LED stimuli in high-frequency range produce
lower visual tiredness on the users, compared with low-frequency checkerboard. Note that the
small sample size limits the conclusions exposed on this paper.
References
[1] Antonio Mauricio F. L. Miranda de S
a, Humberto C. Thiengo, Ingrid S. Antunes, and David M.
Simpson. Assessing time- and phase-locked changes in the eeg during sensory stimulation by means
of spectral techniques. Proc. IFMBE 2009, 25:21362139, 2009.
[2] Pablo F. Diez, Vicente Mut, Eric Laciar, and Enrique Avila. Asynchronous bci control using
high-frequency ssvep. Journal of Neuroengineering and Rehabilitation, 8(39), 2011.
[3] Sandra Mara Torres M
uller, Antonio Mauricio F. L. Miranda de S
a, Teodiano Freire Bastos-Filho,
and M
ario Sarcinelli-Filho. Spectral techniques for incremental SSVEP analysis applied to a BCI
implementation. Proceedings of V Latin American Congress on Biomedical Engineering, CLAIB
2011, IFMBE Proceedings, 33:4pp, 2011.
[4] R. L. Scheaer and J. T. McClave. Probability and Statistics for Engineers. Editorial Iberoamericana, Mexico, 1993.
[5] Francois-Benoit Vialatte, Monique Maurice, Justin Dauwels, and Andrzej Cichocki. Steady-state
visually evoked potentials: Focus on essential paradigms and future perspectives. Progress in
Neurobiology, 90:418438, 2010.
[6] Y. Wang, R. Wang, Xiaorong Gao, B. Hong, and Shangkai Gao. A practical VEP based brain
computer interface. IEEE Trans. on Neural Syst. and Rehab. Eng., 14(2):234239, 2006.
[7] Danhua Zhu, Jordi Bieger, Gary Garcia Molina, and Ronald M. Aarts. A survey of stimulation
methods used in SSVEP-based BCIs. Computational Intelligence and Neuroscience, 702357, 2010.
Article ID 003-4
DOI:10.3217/978-3-85125-378-8-4
Brain Embodiment Lab, School of Systems Engineering, University of Reading, RG6 6AY, UK
i.daly@reading.ac.uk
Interdisciplinary Centre for Computer Music Research, University of Plymouth, PL4 8AA, UK
Abstract
A Brain-computer music interface (BCMI) is developed to allow for continuous modification of the tempo of dynamically generated music. Six out of seven participants are
able to control the BCMI at significant accuracies and their performance is observed to
increase over time.
Keywords Brain-computer music interface, Music generation, EEG, Online continuous control
Introduction
Brain-computer music interfaces (BCMIs) are devices that allow control over, or interaction
with, music via brain activity and without activation of the eerent nervous system [6]. BCMIs
therefore allow individuals to interact with, create, or modify music in situations where this
would not otherwise be possible. For example, individuals with severe movement restrictions
may benefit from use of a BCMI for recreation or as a therapeutic device.
Previous studies have presented BCMIs for passive modification of musical properties, such
as beat and loudness, via measurements of complexity of dierent frequency bands in the
electroencephalogram (EEG) [5]. They have also demonstrated selection from a discrete set of
two-tone bass frequency drone sounds via emotional imagery [3] and selection from a discrete
set of musical scores via steady-state visual evoked potentials (SSVEPs) [4].
Thus, BCMIs have been shown to allow active control of music generation via selection
from a discrete set of instructions or passive interaction with music. However, there is an
opportunity to further investigate whether BCMI users can wilfully modify musical properties
on a continuous scale during a period of online music control.
Therefore, a BCMI is constructed to allow users to modulate the tempo of a piece of music
dynamically via intentional control. Specifically, users are able to increase the tempo of the
music via kinaesthetic motor imagery (MI) [7] and decrease the tempo via relaxation. Music
tempo is mapped to the strength of the users motor imagery allowing them to move the tempo
continuously across a specified range.
2
2.1
Methods
Participants
Seven healthy right-handed individuals (median age=23; SD=2.9; 6 males) voluntarily participated in the experiment. All participants gave informed consent, and the study was approved
as per the University of Reading guidelines for ethics.
Article ID 004-1
2.2
DOI:10.3217/978-3-85125-378-8-4
Measurements
EEG was recorded from 19 channels positioned via the international 10/20 system, referenced
to a central electrode at FCz, and a ground electrode at AFz.
EEG was sampled at a rate of 1,000 Hz via a Brain Products BrainAmp EEG amplifier
(Brain Products, Germany). Impedances were kept below 5 k for all participants.
2.3
Music generation
To sustain novelty and avoid complications caused by user familiarity, a generative algorithm
was implemented to produce monophonic piano music, with real-time control over tempo.
The generative algorithm allows the user to specify three parameters from which it creates
sequences of tone rows (string of pitch classes with no repeated notes) and a pool of rhythm
data. In our system the generated row is used to supply the selection of notes from which the
musical sequences are derived.
Our BCMI uses 6 notes in a tone row. The rhythm pool is generated according to a list of
default values (eight quavers per sequence), with a number of variations introduced according
to the starting parameters. The rhythm pool is used to select notes from the generated tone
rows randomly, creating a large variation of possible musical sequences (musical streams created
by combining notes from the tone row with rhythms from the pool) from a small amount of
seed data. The starting note was randomly selected for each participant, and tempo was scaled
between the range of 99 beats per minute (bpm) to 288 bpm.
2.4
Brain-computer interface
Musical tempo was mapped to the mean alpha (8-13 Hz) band-power recorded over electrodes
F3, T3, C3, Cz, and P3 centred over the left motor cortex. Participants were instructed to
increase the tempo of the music via kinaesthetically imagining squeezing a ball in their right
hand or decrease the tempo by relaxing. Mean alpha band-power was inverted, scaled by a
constant k, and mapped to music tempo. Thus, relaxing increases alpha and decreases tempo.
A single trial of the BCMI control paradigm first presented a fixation cross from -4 s to 0 s.
Music playing began at a fixed tempo of 148 bpm with the appearance of the cross. An arrow
cue (either up; increase tempo, or down; decrease tempo) was presented in the centre of the
screen (visual angle 7.5 ) at second 0. The cue remained on screen for 12 seconds, while the
tempo was mapped to the alpha band power recorded over the left motor cortex and updated
every 100 ms. Upon disappearance of the cue the music generation ceased and a visual feedback
stimuli was presented for 500 ms. This took the form of either a happy face or a sad face.
The paradigm was split into 9 runs. The first was a calibration run to train the parameter
k. This run contained 30 trials in pairs of increase and decrease tempo trials. After each pair k
was either increased by ( = 10 at the start of the run) if tempo was reduced in both trials,
decreased if tempo increased in both trials, or held constant. If k was held was halved.
For all subsequent runs the term k was held constant at the value arrived at after the
calibration run. Each run contained 18 trials and participants were given breaks between runs.
The BCMI was constructed using open standards developed in the TOBI framework [1].
2.5
Analysis
EEG was visually inspected for artefacts via a scorer blinded to the contents of the trials and
results. Trials were rejected if they contained artefacts on channels F3, T3, C3, or P3.
Article ID 004-2
Participant
1
2
3
4
5
6
7
Avg.
DOI:10.3217/978-3-85125-378-8-4
Session
4
5
Avg.
0.444 0.450 0.444 0.500 0.556 0.500 0.550 0.500 0.50 (0.04)
0.357 0.527 0.518 0.300 0.548 0.341 0.732 0.571 0.52 (0.14)
0.708 0.764 0.750 0.687 0.625 0.556 0.500 0.375 0.66 (0.13)
0.800 0.800 0.875 0.800 0.889 0.687 0.700 0.889 0.80 (0.07)
0.500 0.500 0.500 0.512 0.667 0.722 0.556 0.587 0.53 (0.08)
0.917 0.812 0.722 0.632 0.611 0.687 0.71 (0.12)
0.300 0.300 0.312 0.625 0.833 0.750 0.417 0.333 0.38 (0.22)
0.50 0.53 0.52 0.63 0.63 0.69 0.55 0.54
(0.23) (0.19) (0.19) (0.13) (0.13) (0.14) (0.12) (0.19)
Table 1: Balanced accuracies over sessions for each participant. Statistically significant accuracies (p < 0.05) are in bold. Bonferroni correction is applied on a per participant basis.
Event related (de)synchronisation (ERD/S) was identified in the EEG recorded over the left
motor cortex by extracting band-power measures of alpha (8-13 Hz) frequency band activity
relative to a baseline period, defined as the fixation cross presentation period (-4 to 0 s).
Balanced accuracy was used to evaluate each users ability to control the BCMI [2].
Results
A total of 154 trials were excluded due to artefact contamination (the presence of blinks,
movement, and other sources of noise). Additionally, one participant (participant 6) did not
complete the last two runs of the experiment. Thus, a total of 818 (84.16 %) of the trials were
artefact free and may be used to evaluate user performance.
Table 1 reports accuracies achieved by each participant over the sessions. Figure 1 illustrates
the median and standard deviation of accuracies over sessions achieved by participants.
0.6
Balanced accuracy
0.9
0.8
0.7
0.6
0.5
0.4
0.4
0.2
0.2
0.4
0.6
0.3
0.2
1
Session
0.8
2
10
11
12
Time (s)
Figure 2 illustrates the median change in alpha band-power over the left motor cortex
(channel C3) relative to the pre-cue baseline period. Note, for the reduce tempo (motor imagery)
condition the user was able to produce a sustained event-related desynchronisation (ERD).
Article ID 004-3
DOI:10.3217/978-3-85125-378-8-4
Discussion
Music can be actively controlled via 6 out of our 7 BCMI users with significant accuracy
in one or more sessions. It may be argued that multiple comparison correction is needed.
However, sessions are not independent, thus Bonferroni correction on a per session basis is
not appropriate. Instead, Bonferroni correction is applied to mean accuracies per participant,
revealing 3 participants to be able to control the BCMI at significant accuracies.
There is considerable variability and for some sessions participants were not able to control
the BCMI at significant accuracies. Nonetheless, our results demonstrate that music tempo can
be understood and utilised as a feedback mechanism by the majority of BCMI users.
These results open up the possibility of allowing users a greater level of control of music
than previously explored [4, 3]. The performance increase over sessions visible in figure 1 is
also encouraging as it suggests that users can, through increased experience, improve their
ability to control tempo. We also note, however, that in the final sessions our users displayed
a performance reduction. This may be due to fatigue, but further exploration is required.
Future work will seek to compare users ability to learn to control properties of a piece
of music with feedback modalities more traditionally used in BCI control (for example, visual
feedback). We will also explore the eects of more advanced computational methods on the
performance of the BCI. Additionally, the interaction of music and emotion is an interesting
research angle and it would be very interesting to extend the work presented in [3] to allow
continuous emotional imagery based control of musical features such as tempo, mode, or timbre.
Therefore, future work will also look at extending our BCMI to continuously detect emotions.
Acknowledgments
This work was supported by the EPSRC grants (EP/J003077/1 and EP/J002135/1).
References
[1] C Breitwieser, I Daly, C Neuper, and G Muller-Putz. Proposing a Standardized Protocol for Raw
Biosignal Transmission. IEEE trans. on bio-med. eng., 59(3):852859, December 2012.
[2] Kay Henning Brodersen, Cheng Soon Ong, Klaas Enno Stephan, and Joachim M. Buhmann. The
Balanced Accuracy and Its Posterior Distribution. In 2010 20th Int. Conf. on Patt. Recon., pages
31213124. IEEE, August 2010.
[3] S. Makeig, G. Leslie, T. Mullen, D. Sarma, N. Bigdely-Shamlo, and C. Kothe. First Demonstration
of a Musical Emotion BCI. Aecti. Comput. and Int. Interact. Lect. Notes in Comp. Sci., 6975:487
496, 2011.
[4] E. R. Miranda, W. L. Magee, J. J. Wilson, J. Eaton, and R. Palaniappan. Brain-Computer Music
Interfacing (BCMI): From Basic Research to the Real World of Special Needs. Music and Medicine,
3(3):134140, March 2011.
[5] Eduardo Reck Miranda. Brain-Computer music interface for composition and performance. IntJ
Dis Human Dev, 5(2), 2006.
[6] E.R. Miranda, S. Roberts, and M. Stokes. On Generating EEG for Controlling Musical Systems.
Biomedizinische Technik, 49(1):7576, 2004.
[7] Christa Neuper, Reinhold Scherer, Miriam Reiner, and Gert Pfurtscheller. Imagery of motor actions:
dierential eects of kinesthetic and visual-motor mode of imagery in single-trial EEG. Brain
research. Cognitive brain research, 25(3):66877, 2005.
Article ID 004-4
DOI:10.3217/978-3-85125-378-8-5
Introduction
Stroke is one of the leading causes of death and disability in western societies [1]. Cognitive
impairments, such as attention deficits play a prominent role [2] and EEG biofeedback (or
neurofeedback, NF) has been proposed as an eective treatment [3]. Negative slow cortical potentials (SCP) reflect summation of synchronized excitatory postsynaptic potentials from apical
dendrites, whereas positive SCPs derive from reduced inflow to apical dendrites or inhibitory
activity. In line with this view, studies show that an increase of negative SCP amplitudes
is related to better task performance when attention and motor reactions are required [4, 5].
However, the knowledge about the relation between attention and slow cortical potentials [6]
has not yet been exploited for improving attentional functioning in stroke [2].
In this study we sought to relate the progress in SCP control during NF training to latencies
(as a measure of attention) in two reaction time (RT) tasks. In the simple task subjects had
to monitor a series of 31 random letters for 8 occurrences of the letter X and respond to X
as fast as possible via a button-click. In the learning task subjects had to monitor a series of
31 random letters for 6 occurrences of the two-letter sequence AX and respond to X as
fast as possible via a button-click. Optimal performance in this task required subjects to learn
the contingency of A being always followed by X. ITI was 690 ms, and both tasks were
presented in blocks of nine sequences each.
Across training, we hypothesized a learning eect for SCP regulation, i.e. increasing separation of negativity and positivity trials, and for the learning tasks, i.e. reduced latencies. Our
main interests were in the association of regulatory success and response latencies, hypothesizing that successful SCP regulation would correlate with reduced latencies in the learning but
not in the simple task.
Article ID 005-1
DOI:10.3217/978-3-85125-378-8-5
Methods
Each NF session included 250 trials in which cortical positivity had to be increased and 250
trials in which cortical negativity had to be increased. Trials lasted for 8 s (baseline: 0-2 s,
active phase: 2-8 s). Online feedback, from electrode Cz referenced to linked mastoids, consisted
of a circle whose size and color indicated whether subjects regulated successfully with regard
to baseline activity. Trials were judged successful, and success indicated to the participant,
if brain activation was regulated as required by the task (towards positivity or negativity,
respectively). Vertical eye movements were corrected for using a regression procedure. After
getting acquainted with the RT tasks, eleven healthy student subjects (mean age: 23.20, SD:
5.20, range 20-38) trained for three days during one week. On days one and three, training
consisted of two sessions (A & B), and each session comprised one neurofeedback training
and the two RT-tasks. On day two, the single training session consisted of the sequence RTtasksneurofeedback trainingRT-tasks. Here, we evaluate associations between latencies and
amplitudes in the EEG at days one and three.
Preprocessing steps for the oine analysis corresponded to the steps used during online
training (10 Hz low-pass filtering, detrending to compensate for DC drift, epoching, alignment
to pre-trial baseline, correction for ocular artifacts, exclusion of trials with absolute voltages
exceeding 100 V ). Then, trials were averaged per regulatory condition, and the mean activity
during the 2 s long interval starting from 4 s after the beginning of each trial exported for
statistical analysis. One participant was excluded due to low quality of the EEG recording,
and another participant did not participate in all training sessions and was therefore excluded.
The total sample size available for analyzing the association between amplitude and latencies
is, thus, n = 9.
To analyze the relation between latencies in the RT tasks and success during SCP regulation,
we followed a two-step approach. First, we established that latencies in the RT tasks decreased,
and that SCP regulatory success, i.e. the absolute dierence between negativity and positivity
trials, increased across training. Then, we analyzed the association between the change in
latencies and the change in SCP regulatory success between day three and day one. Reported
p-values are two-sided.
Results
Repeated measures ANOVA of latencies in the learning task revealed significant eects of
training day (F1,9 = 3.51, p = .047), session (F1,9 = 10.37, p = .011), and their interaction
(F1,9 = 7.01, p = .027). Follow-up analysis indicated that latencies decreased across sessions at
day one but remained stable thereafter (see Figure 1 A, Table 1). Latencies in the simple task
did not change across training (all p > .30). Mean number of errors was low (range 1 4%)
and did not dier between conditions (all p > .16).
Results for SCP regulation (see Figure 1 B) showed significant eects of regulation (F1,9 =
16.63, p = .003), the interaction of regulation and training day (F1,9 = 11.63, p = .008) and a
marginal eect of the interaction of regulation and session within each day (F1,9 = 4.68, p =
.059; see Table 1).
Article ID 005-2
DOI:10.3217/978-3-85125-378-8-5
Day
1
1
3
3
Session
a
b
a
b
SCP (V )
M
SD
7.94
8.61
8.10
7.95
8.85
8.96
21.59 13.77
100
10
20
30
40
Analysis of the association between learning in the learning task and learning of SCP regulation between day three and day one indicated a large but marginal negative association
(Spearmans = .63, p = .076). Figure 2 shows that subjects who were able to increase the
dierence in SCP amplitudes between days three and one, tended to show faster response latencies on day three in comparison to day one. In the simple task, changes in latencies between
day three and day one were not associated with changes in SCP regulatory success ( = .017,
p = .982).
Article ID 005-3
DOI:10.3217/978-3-85125-378-8-5
Discussion
To summarize, our data show that healthy participants can learn to regulate their SCPs during
the course of a short training. Latencies in the learning task decreased across training, but
latencies in the simple task remained stable. Further, learning of SCP regulation was marginally
associated with reduced latencies in the learning, but not in the simple task. However, because
we employed a correlative design it is unknown whether changes in SCP regulatory success play
a causal role for latencies in the learning task.
Our results support previous findings that successful SCP regulation is associated with
faster reaction times [5]. However, this association was found only for a task which involved
learning, but not in a simple RT task. This may suggest that the choice of tasks used in
neurofeedback studies may play a crucial role. We thus speculate that associations between
SCP regulatory success and behavioral data should be strongest with tasks of medium difficulty
involving executive functions [7].
Our results show that successful regulation of SCPs is possible even with limited training
and that this change is associated with increased performance in a task requiring attention and
motor reaction. Based on these encouraging results, studies are underway aiming to replicate
these findings in a large sample of chronic stroke patients.
Acknowledgments
This work is supported by the European ICT Programm (Project FP7-287320, contrastproject.eu). We thank Anna-Sophia Fritsch for her support during data collection.
References
[1] P. Heuschmann, O. Busse, M. Wagner, M. Endres, A. Villringer, J. R
other, P. Kolominsky-Rabas,
and K. Berger. Schlaganfallh
aufigkeit und versorgung von schlaganfallpatienten in deutschland.
Aktuelle Neurologie, 37(07):333340, 2010.
[2] J. A. Michel and C. A. Mateer. Attention rehabilitation following stroke and traumatic brain injury.
a review. Europa medicophysica, 42(1):5967, 2006.
[3] Lonnie Nelson. The role of biofeedback in stroke rehabilitation: Past and future directions. Topics
in Stroke Rehabilitation, 14(4):5966, 2007.
[4] Werner Lutzenberger, Thomas Elbert, Brigitte Rockstroh, and Niels Birbaumer. The eects of
self-regulation of slow cortical potentials on performance in a signal detection task. International
Journal of Neuroscience, 9(3):175183, 1979.
[5] B. Rockstroh, T. Elbert, W. Lutzenberger, and N. Birbaumer. The eects of slow cortical potentials
on response speed. Psychophysiology, 19(2):211217, 1982.
[6] N. Birbaumer, T. Elbert, A. G. Canavan, and B. Rockstroh. Slow potentials of the cerebral cortex
and behavior. Physiological reviews, 70(1):141, 1990.
[7] Aljoscha C. Neubauer and Andreas Fink. Intelligence and neural efficiency. Neuroscience & Biobehavioral Reviews, 33(7):10041023, 2009.
Article ID 005-4
DOI:10.3217/978-3-85125-378-8-6
Introduction
2
2.1
Methods
Task description
The experimental task was similar to the one described by Milekovic et al. [7] in which the
subject had to play a simple video game. The subject used the thumbstick of a gamepad to
control the angle in which the cursor on the screen moved. The task was to avoid collisions of
the cursor with blocks dropping from the top of the screen with a constant speed. The speed of
the falling blocks was set to a level that the game was challenging and the player collided with a
Article ID 006-1
DOI:10.3217/978-3-85125-378-8-6
block from time to time. In case of a collision, the game resumed for 1 second and then stopped.
The delay of 1 second was introduced to make sure that the reaction measured in the EEG
originates from the subject recognizing the collision (outcome error) and not from the game
stopping or restarting. To study the execution error, which is happening when the interface
delivers erroneous feedback, the angle of the cursor movement was modified for the duration of
2 seconds. The degree of modification was randomized (45 , 90 , 180 to either the left or
the right side). The time between two execution errors was randomized to be between 5 and 8
seconds.
2.2
Experimental setup
10 healthy subjects (mean age: 24.1 1.1 years) were recruited for this study. EEG was measured with two g.tec g.USBamp amplifiers and a Brainproducts Acticap System. 29 electrodes
were placed on the scalp of the subject to measure EEG, while 3 electrodes where placed below
the outer canthi of the eye and above the nasion for electrooculogram (EOG) recordings. The
data was recorded with a sampling rate of 512 Hz and a 50 Hz notch filter was applied to
filter out power line noise, as well as an additional bandpass filter between 0.5 Hz and 60 Hz.
The position of the thumbstick as well as information about outcome or execution errors was
transmitted to the recording software using the parallel port of the computer.
2.3
The data was segmented into dierent trials with a length of 1 second: execution errors, timelocked to the start of an angle modification; outcome errors, time-locked to the collision event;
and noError trials, where neither a collision nor an angle modification has happened during the
trial or in the 1 second before or after the trial. For each subject about 1 hour of EEG was
recorded resulting on average in 597 22 execution errors, 86 30 outcome errors and 475 39
noError trials.
An EOG-based regression method was used to reduce the eect of eye artifacts. To estimate
classification accuracies we used a 10-fold cross-validation. For classification we used a Support
Vector Machine (SVM) with a linear kernel. For all channels the samples in the time range 0.2 s
to 0.9 s were used for classifiation and the signal was rereferenced to the common average. To
investigate how well the error can be classified, outcome error and execution error, respectively,
were classified against noError trials. To see if the two types of errors can be discriminated, we
also classified execution errors against outcome errors. Since the number of trials were dierent
for each class, the dataset was balanced to obtain an even amount of trials for each class.
To test if the subjects movements (due to gamepad control) or eye movements influence
classification, classification was also done on the EOG data and on the recorded position of the
thumbstick.
Results
Averaged over all subjects, execution errors and noError trials could be classified with an average accuracy of 65.0 % based on EEG, 50.9 % based on EOG and 52.5 % based on the thumbstick
position. For outcome error against noError trials, average accuracies of 73.9 % (EEG), 54.9 %
(EOG) and 56.0 % (thumbstick) could be reached. For the classification of the two error types,
execution error and outcome error, we obtained average accuracies of 75.3 % (EEG), 56.4 %
(EOG) and 55.3 % (thumbstick). Detailed results of the classification on the EEG data is shown
Article ID 006-2
DOI:10.3217/978-3-85125-378-8-6
in table 1. For the classification results of the EEG data we performed a permutation test (1200
permutations) to test for significance and all EEG results were found to be significantly above
chance level.
Table 1: Classification accuracies based on EEG data obtained by 10-fold cross-validation.
Classes were balanced and therefore the chance level is at 50 %. All results are significantly
above chance level.
subject Execution vs. Outcome Outcome vs. noError Execution vs. noError
S01
77.9 %
74.4 %
69.7 %
S02
76.3 %
78.5 %
65.4 %
S03
69.6 %
68.2 %
59.9 %
S04
72.1 %
75.0 %
60.1 %
S05
70.2 %
60.3 %
64.3 %
S06
67.7 %
76.5 %
63.4 %
S07
73.6 %
76.3 %
62.8 %
S08
85.0 %
80.4 %
68.6 %
S09
78.1 %
71.3 %
64.5 %
S10
82.1 %
78.0 %
71.2 %
mean
75.3 %
73.9 %
65.0 %
Figure 1 shows the average waveform of the execution error at electrode Cz for all subjects
as well as the topographic distribution of the potential. Although the waveform of the two error
potentials diered strongly, the topographic distribution was very similar for both errors and
all subjects with the maximum around electrode FCz and Cz.
Figure 1: Execution error-related potential at electrode Cz. The colored lines depict the ErrP for
the dierent subjects, while the bold black line is the average over all subjects. The topographic
distribution of the potential averaged over all subjects at 285 ms and 360 ms is shown at the
top. For display of the ErrPs, the dierence between the error trials and noError trials was
calculated.
Since the angle of the movement was randomly modified with dierent degrees, we also
tested if execution errors with a dierent degree can be classified, e.g. 45 against 180 , but
did not achieve significant results.
Article ID 006-3
DOI:10.3217/978-3-85125-378-8-6
In this study we could show that ErrPs can be measured in the EEG, due to an erroneous
response during continuous feedback. While we have further shown that two dierent types of
errors can be discriminated in EEG (outcome error vs. execution error), we could not detect the
severity of an execution error (e.g. 45 or 180 ). When looking at the shape and topographic
distribution of the execution error, it is similar to the results of earlier studies [2, 6], thereby
showing that classification is based on an electrophysiological response and not on artifacts.
This is supported by the lower classification accuracies obtained on EOG and gamepad data,
which are not significantly above chance level for most subjects.
Although we were able to classify both types of error, the classification accuracy needs to
be improved to be useable for adaptation or error correction in a BCI application. Therefore it
needs to be tested if the power spectrum of the EEG yields additional information to classify
those ErrPs. While we could show that ErrPs are elicited during continuous feedback and we
are able to classify them based on the EEG, the classification itself is still event-locked and it
needs to be tested in a further study how well classification works if it is done continuous [8].
Acknowledgements
The authors would like to thank Johannes Anufrienko for programming the experimental task.
This study was funded by by the German Federal Ministry of Education and Research (BMBF,
BFNT F*T, Grant UT
u 01 GQ 0831) and the Deutsche Forschungsgemeinschaft (DFG, Grant
RO 1030/15-1, KOMEG).
References
[1] Pierre W Ferrez and Jose del R Millan. Error-related eeg potentials generated during simulated
braincomputer interaction. Biomedical Engineering, IEEE Transactions on, 55(3):923929, 2008.
[2] Martin Sp
uler, Michael Bensch, Sonja Kleih, Wolfgang Rosenstiel, Martin Bogdan, and Andrea
K
ubler. Online use of error-related potentials in healthy users and people with severe motor impairment increases performance of a P300-BCI. Clinical Neurophysiology, 123(7):13281337, 2012.
[3] Nico M Schmidt, Benjamin Blankertz, and Matthias S Treder. Online detection of error-related
potentials boosts the performance of mental typewriters. BMC neuroscience, 13(1):19, 2012.
[4] Martin Sp
uler, Wolfgang Rosenstiel, and Martin Bogdan. Online adaptation of a c-VEP BrainComputer Interface (BCI) based on Error-related potentials and unsupervised learning. PloS one,
7(12):e51077, 2012.
[5] Alberto Llera, Marcel AJ van Gerven, Vicenc G
omez, Ole Jensen, and Hilbert J Kappen. On
the use of interaction error potentials for adaptive brain computer interfaces. Neural Networks,
24(10):11201127, 2011.
[6] Alex Kreilinger, Christa Neuper, and Gernot R M
uller-Putz. Error potential detection during continuous movement of an artificial arm controlled by braincomputer interface. Medical & biological
engineering & computing, 50(3):223230, 2012.
[7] Tomislav Milekovic, Tonio Ball, Andreas Schulze-Bonhage, Ad Aertsen, and Carsten Mehring.
Error-related electrocorticographic activity in humans during continuous movements. Journal of
neural engineering, 9(2):026007, 2012.
[8] Tomislav Milekovic, Tonio Ball, Andreas Schulze-Bonhage, Ad Aertsen, and Carsten Mehring. Detection of Error Related Neuronal Responses Recorded by Electrocorticography in Humans during
Continuous Movements. PLoS ONE, 8(2):e55235, 02 2013.
Article ID 006-4
DOI:10.3217/978-3-85125-378-8-7
Introduction
2
2.1
Method
RC paradigm
Because the available frequencies in a SSVEP BCI are often restricted, as the RC paradigm in
P300 BCI, the targets in SSVEP BCI system are also detected by their row and column coordinates (see Figure 1). 36 characters arranged in a 6 6 matrix are employed in our paradigm
because it includes all the basic items necessary for spelling. Each cell in the proposed SSVEP
BCI flickers between white and black at a constant frequency. Furthermore, we employed six
frequencies, set at 8.18, 8.97, 9.98, 11.23, 12.85, and 14.99 Hz, in the design of a periodic stimulus mechanism. In Figure 1, the numbers on the top and left sides of the graphical user
interface (GUI) denoting the stimulus frequencies. Specifically, to determine the row coordinate
Article ID 007-1
DOI:10.3217/978-3-85125-378-8-7
del
Figure 1: Illustrations for the stimulus configuration of the proposed SSVEP BCI.
of the target, all the items in same row flickered at same frequencies. Subsequently, all items
in the same column flickered at same frequencies to detect the targets column coordinate.
2.2
During the spelling process, the subjects might reduce their level of attention on the centre of
the SSVEP flicker, thus lead to a decrease in accuracy [4]. To this end, we designed a real-time
feedback mechanism to improve the efficiency of the SSVEP speller. Feedback is given in realtime to the subject by changing the color of the current target character to green. Specifically,
the system first changes the color of the characters within the selected row, and then changes
the color of the character associated with selected row and column coordinates. During the
spelling process, if subjects find that the character they are looking at has not changed color,
they must increase their focus on the target stimuli.
2.3
CCA-RV method
In our approach, canonical correlation analysis (CCA) was used to calculate the correlation
between the stimulus frequency and the multi-channel electroencephalogram (EEG) data. The
correlation coefficient of each stimulus frequency is calculated in real-time (scorei (t)). Then we
T
obtained the average non-target scores (scoreN
(t)) of each frequency associated with dierent
i
time points following the initial stimulus presentation. Finally, the SSVEP response scores were
evaluated in the following manner:
Scorei (t) =
2.4
T
scorei (t) scoreN
(t)
i
N
T
scorei (t) + scorei (t)
(1)
We employ the parctical ITR (PITR) proposed by Townsend et al. for the optimal spelling
time selection, which provides a more realistic estimation of ITR [5]. The time corresponding
to maximum PITR is selected as optimal stimulus time. The PITR can be expressed as follows:
Article ID 007-2
DOI:10.3217/978-3-85125-378-8-7
(3)
column
where t
and t
are the row and column stimulus times, respectively, and I is the time
between successive selections.
The optimal stimulus time is estimated using calibration data and is fixed prior to online
BCI use. During online spelling, the SSVEP speller provides the spelling results once the
optimal stimulus time is met. To avoid visual fatigue, we set the maximum stimulus time to 10
s. The peak PITR was taken as a measure of the possible performance of the SSVEP speller.
2.5
Experimental Setup
Six healthy subjects participated in the study. All subjects had normal or corrected-to-normal
vision. All subjects signed an informed consent form in accordance with the Declaration of
Helsinki. EEG signals were recorded using a BrainAmp DC Amplifier. Nine-channel active
electrodes were selected for the SSVEP detection and were placed at Pz, P3, P4, Oz, O1, O2,
POz, PO7 and PO8, referenced to P8 and grounded to Fpz. The EEG signals were sampled at
250 Hz and filtered using a 50-Hz notch filter. The experiments were performed in a normal
office room. The subjects were seated in a comfortable chair located approximately 70 cm from
a 2700 LED monitor. The oine session consisted of two dierent stimuli conditions - one with
real-time feedback and one without. Six runs of each condition were collected. In each run,
the subjects were required to input 12 symbols in a random order to avoid adaptation. For
each letter selection, the SSVEP stimuli appeared on the screen and remained for 10 s. During
online spelling, the subjects were required to spell their own names in Latin letters three times.
When an incorrect symbol was detected, the subject had to correct the misspelling by selecting
the del option located at the bottom right of the matrix, followed by the correct letter.
EXPERIMENTAL RESULTS
Figure. 2 presents a comparison of the average classification results achieved using the proposed
approaches (CCA-RV and real-time feedback) and traditional SSVEP approaches (CCA and
without real-time feedback). The results indicate that the proposed approaches achieved higher
average classification accuracy than the traditional SSVEP approaches. The online performance
of the proposed SSVEP speller is summarized in Table 1. The average PITR obtained across
all subjects using the fixed optimization approach reached 37.59 5.55 bits/min.
Subject
S1
S2
S3
S4
S5
S6
AVG
STD
Task
Characters
30
24
24
15
36
27
26
7.01
Stimulus
Time (s)
8.24
3.92
3.92
4.48
4.72
4.40
4.95
1.64
Practical
Selections
30
28
32
19
50
31
31.67
10.13
PITR
(bits/min)
30.29
44.91
39.30
35.59
33.24
42.21
37.59
5.55
Article ID 007-3
DOI:10.3217/978-3-85125-378-8-7
100
98
96
Accuracy (%)
94
92
CCA, without realtime feedback
CCARV, with realtime feedback
CCA, without realtime feedback
CCARV, with realtime feedback
90
88
86
84
3
10
Time (s)
Conclusions
In this paper, a optimized SSVEP BCI system was proposed in order to increase ITR. Only six
frequencies was used to establish the SSVEP BCI with 36 items. In addition, we proposed the
CCA-RV method and the real-time feedback mechanism to enhance the accuracy. To achieve
reasonable online performance, we designed a fixed optimal approach for selecting stimulus
time. Experimental results suggest that the proposed SSVEP speller can provide improved
performance compared to traditional BCI approaches. More specifically, online spelling PITR
using the fixed optimization approach achieved 37.59 5.55 bits/min.
Acknowledgments
This work was supported in part by the National High-Tech R&D Program of China (863
Program) under Grant 2012AA011601, the National Natural Science Foundation of China under
Grant 61375117.
References
[1] H. Cecotti. Spelling with non-invasive brain-computer interfaces-current and future trends. J. Physiol. - Paris, vol. 105, pp. 106114, 2011.
[2] F. Vialatte, M. Maurice, J. Dauwels, and A. Cichocki. Steady-state visually evoked potentials:
Focus on essential paradigms and future perspectives. Prog. Neurobiol., vol. 90, pp. 418438, 2010.
[3] J. R. Wolpaw, D. J. M. N. Birbaumer, P. G, and T. M. Vaughan. Brain-computer interfaces for
communication and control. Clin. Neurophysiol., vol. 113, no. 6, pp. 767791, 2002.
[4] S. Kelly, E. Lalor, C. Finucane, R. Reilly, and J. Foxe. Visual spatial attention tracking using highdensity SSVEP data for independent brain-computer communication. IEEE Trans. Neural Syst.
Rehabil. Eng., vol. 13, no. 2, pp. 172178, 2005.
[5] G. Townsend, B. K. LaPallo, C. B. Boulay, D. J. Krusienski, G. E. Frye, C. K. Hauser, N. E.
Schwartz, T. M. Vaughan, J. R. Wolpaw, and E. W. Sellers. A novel P300-based brain-computer
interface stimulus presentation paradigm: moving beyond rows and columns. Clin. Neurophysiol.,
vol. 121, no. 7, pp. 11091120, 2010.
Article ID 007-4
DOI:10.3217/978-3-85125-378-8-8
1 Introduction
The BCI application Brain Painting (Mninger, et al., 2010) allows an end-user to paint on a virtual
canvas using brain activity without the requirement of motor pathways. Brain Painting is controlled by
a BCI with event-related potentials (ERP) as input signal.
Brain Painting allows the end-user to select basic shapes of different color, opacity and size on a
virtual canvas displayed on a monitor, such that the user can combine them and compose pieces of art.
This system was designed to match the requirements of paralyzed or locked-in end-users (Zickler,
Halder, Kleih, Herbert, & Kbler, 2013). Formerly restricted to lab or demonstration purposes, the
system has been adapted for home use by Holz and colleagues (Holz, Botrel, Kaufmann, & Kbler,
under review) and was given to two locked-in patients so they could use Brain Painting on a regular
basis for several months. Our first end-user reported to be highly satisfied with the system, and we could
measure an improvement in the quality of life. The end-user expressed the wish to have more painting
features and a signal validation tool that would ensure a correct setup prior to every use.
Following the user-centered design (Kbler, et al., Manuscript submitted for publication), Brain
Painting Version (V) 2 has been developed to face the new challenges encountered in home use
environment. Thus, Brain Painting V2 is not only an extension of Version 1, but has been newly
programmed to incorporate additional painting features such as lines and text. An autocalibration was
also implemented that can easily be performed prior to every session. With the following study, we
aimed at evaluating the usability of the system with a sample of healthy users in the laboratory.
Article ID 008-1
DOI:10.3217/978-3-85125-378-8-8
2 Software
The Brain Painting V2 application uses, Python 2.7, Qt, Pygame and BCI2000. It is composed of 3
independent but connected modules: (I) The graphical user interface (GUI), coded in Python and Qt,
contains the painting canvas in the middle and a status bar on the left (see Figure 1) to display important
information such as selected colors and brush size. The menu bar on the top allows to load or save
paintings. Below the menu bar, there are play, pause and stop buttons that respectively allow to start,
pause and stop a Brain Painting session. (II) Running in background, BCI2000 processes the EEG
signal, classifies and sends stimulations to the speller. The GUI uses the Telnet protocol interface of
BCI2000 to load parameters, start and stop signal processing. (III) The custom P300 speller matrix
feedback uses Pygame. It displays the Brain Painting matrix such that one symbol represents one
painting function on the canvas. Famous faces flashes are implemented for stimulation to increase the
signal-to-noise ratio (Kaufmann, et al., 2013). Several matrices were created to include all requested
painting functions, and can be reached from the main menu.
Figure 1: (left) Graphical User interface with the Canvas and the status bar, positioned on the side. The session
control buttons are above the status bar. (right) 6x8 Speller matrix with symbols associated with Brain Painting
functions positioned in front of the participant. The painting on the left was composed during a live demonstration
in the Psychologie und Gehirn conference in Wrzburg (PuG 2013; http://tagung.dgpa.de/).
3 Methods
Ten healthy participants (mean age 25 SD=2,8, all students) were included. Each participant was
seated in a comfortable chair for the duration of the study (see Figure 2) with a speller stimulation
monitor 0.5 to 1 meter distance in front. The second monitor, placed aside, displayed the Brain Painting
canvas. A custom 14 electrodes EEG cap (Debener, Minow, Emkes, Gandras, & Vos, 2012) was used
for EEG recording. The cap is a low-budget combination of a customer grade EEG amplifier with
passive EEG electrodes commonly used in lab experiments.
Teaching someone how to use the Brain Painting is the first step for its use. Bearing in mind
independent home use of Brain Painting, we created a tutorial video that would ensure a complete
understanding of Brain PaintingV2 without the experimenter being present. The tutorial video lasted 29
min and presented all Brain PaintingV2 functions available in the matrix, by displaying their effect on
the painting Canvas (see Figure 1). A female speaker uses a mouse cursor to navigate in the matrices
and select available functions while commenting every action. Brain PaintingV2 features were
summarized in 12 different categories such as colors, lines or text and presented point per point.
Article ID 008-2
DOI:10.3217/978-3-85125-378-8-8
Calibration trials comprised 15 stimulations of each rows and columns. Stimulated symbols were
overlaid during 62.5ms with a picture of Einstein. The inter stimulus interval was set to 125ms. The
user was instructed to focus on one symbol and count how many times the overlaying Einstein face was
visible. After 10 trials, classifier weights were adjusted using a stepwise-linear discriminant analysis
for 800 ms post-stimulus. The calibration process returned the minimum number of sequences required
to reach 100% spelling accuracy. To ensure high accuracy the number of sequences returned was
increased by 2; and maintained throughout copy and free painting.
During the Copy Painting task participants were asked to select target indicated on the matrix.
Each and every selection had an effect on the canvas. If a wrong selection was made, the feedback
instructed the user to revert the error using the undo function within the matrix. Accuracy was
retrieved during this Copy Painting task, and reported the number of correct selections divided by the
number of total selections.
In the Free Painting mode, participants were given 15 to 60 minutes to compose two paintings of
their own with the new Brain PaintingV2. The experimenter did not provide any hint or help concerning
the painting interface during the Free Painting task nor in the Copy Painting task. Time spent on Free
Painting was recorded
Participants were given visual analogue scales (VAS) at three different time points during the
session: after the tutorial (t1), the Copy Painting task (t2) and the Free Painting task (t3). They were
asked to rate, on a scale from 0 to 10, how well they understood each of the 13 Brain Painting features
we distinguished.
After the Free Painting task, users were asked to answer on a VAS from 0 to 10, how satisfied and
how frustrated they were during the Free Painting session. Participants also filled in the NASA-TLX to
indicate the workload of the Brain Painting V2 (0 for no workload to 100 for maximum workload).
Subject
Nb.
Seq.
Accuracy
Work
load
Satisfaction
Frustration
A
B
C
D
E
F
G
H
I
J
Mean
SD
4
4
5
5
5
5
6
4
5
5
4.8
.63
.97
.95
.95
1
1
.91
.90
.87
.97
.84
.94
.05
19
66
75
54
n/a
59
40
74
20
60
52
21
7.7
5.1
3.7
9.6
9.4
6.2
7.1
4.7
5.8
7
6.6
1.9
.9
6.5
4
0
1.2
4.5
2.5
8.1
4.8
.8
3.3
2.7
Free
Painting
duration
32
23
68
60
60
51
58
53
52
55
51
12
Selection
/
minute
3.3
3.6
2.9
2.8
2.9
2.9
2.5
3.3
2.9
2.8
3
.31
Evaluation (t3)
9.9
8.6
6.8
9.2
1
6.7
8.4
8.5
8.7
7.9
8.5
1.1
Table 1: Individual results indicating parameters, performance and evaluation results. Mean and standard
deviation are mentioned at the bottom of the table. Nb. Seq. states for number of sequences.
4 Results
Participants achieved mean accuracy of 94% (see Table 1 for results of all participants) during the
Copy Painting task. Participants spelled with an average of 4.8 sequences which allowed an average of
Article ID 008-3
DOI:10.3217/978-3-85125-378-8-8
3 selections per minute. According to the VAS participants understood well the Brain Painting features
with an average rating of 9 after the tutorial video (t1), 9 after the Copy Painting task (t2) and 8.5 after
the Free Painting task (t3). During the Free Painting task, participants painted during on average 51
minutes. In the VAS applied after Free Painting, participants reported being satisfied with their Free
Painting session (M=6.6) and reported low to average frustration (M=3.3). The total workload according
to the NASA TLX was M= 52.
5 Discussion
Firstly, our results indicate that the auto-calibration worked fine for all subjects. Further, the tutorial
video was very effective because participants reported good understanding of the Brain Painting
features. Accuracy was high during Copy Painting corroborating the functionality of Brain Painting V2.
After more than 1 hour of sustained use, participants reported average to high satisfaction and low
frustration for a medium workload. That indicated good efficiency and high overall satisfaction (Kbler,
et al., Manuscript submitted for publication). They spent on average 51 minutes with Free Painting
although the experimenter only asked for at least 15 minutes, meaning that participants truly enjoyed
Brain Painting. The low-budget EEG cap renders the system even more applicable. It is now safe to
provide our locked-in patients with the Brain Painting V2, and investigate whether they will benefit
from the practical outcomes of the enhancements of Brain PaintingV2.
Acknowlegments
The study was funded by the European Community for research, technological development and
demonstration activities under the Seventh Framework Programme (FP7, 2007-13), project grant
agreement number 288566 (BackHome). This paper reflects only the authors views and funding
agencies are not liable for any use that may be made of the information contained herein.
References
Debener, S., Minow, F., Emkes, R., Gandras, K., & Vos, M. (2012). How about taking a low-cost, small,
and wireless EEG for a walk? Psychophysiology, S. 1617-1621.
Holz, E. M., Botrel, L., Kaufmann, T., & Kbler, A. (under review). A. Long-term independent braincomputer interface home use improves quality of life of a patient in the locked-in state: A case
study. Archives PMR.
Kaufmann, T., Schulz, S. M., Kblitz, A., Renner, G., Wessig, C., & Kbler, A. (2013). Face stimuli
effectively prevent braincomputer interface inefficiency in patients with neurodegenerative
disease. Clinical Neurophysiology, pp. 893-900.
Kbler, A., Holz, E. M., Kaufmann, T., Zickler, C., Kleih, S. C., Staiger-Slzer, P., . . . Mattia, D.
(Manuscript submitted for publication). The User-Centered Design in Brain-Computer
Interface (BCI) research: A novel perspective for evaluating the usability of BCI- controlled
applications. Journal of Neurorehabilitation and Neural Repair.
Mninger, J. I., Halder, S., Kleih, S. C., Furdea, A., Raco, V., Hsle, A., & Kbler, A. (2010). Brain
Painting: first evaluation of a new braincomputer interface application with ALS-patients and
healthy volunteers. Frontiers in neuroscience, S. 4.
Zickler, C., Halder, S., Kleih, S. C., Herbert, C., & Kbler, A. (2013). Brain Painting: Usability testing
according to the user-centered design in end users with severe motor paralysis. Artif Intell
Med., 99-110.
Article ID 008-4
DOI:10.3217/978-3-85125-378-8-9
This paper presents a framework for the development of cognitive skills. In particular,
we focus on the definition and provision of cognitive rehabilitation tasks to people with
neurological disorders returning at home after a discharge. Those tasks will be executed
through a Brain-Neural Computer Interface system and will consist of desktop-based serious games. The framework and the implemented games are part of the BackHome project
and are currently under testing at the end-users facilities, in Belfast and W
urzburg.
Introduction
Article ID 009-1
DOI:10.3217/978-3-85125-378-8-9
services provided by the BackHome platform. Moreover, the implemented games are fully
integrated in the telemonitoring and home support system developed within the project [7].
Method
Working directly with service providers for people with traumatic brain injury, a framework
for the development of cognitive skills has been created against which cognitive tasks will be
mapped. The adopted framework as well as the implemented games have been defined together
with Occupational Therapists and Speech and Language Therapists. Initially, two focus groups
were held (N= 8; N=58) to identify the user requirements for the overall system. A research
collaboration was then formed with Specialist neurological Occupational Therapists and Speech
and Language therapist who have worked with us over the past year to develop and review the
various prototype of the CR application. The therapists have meet with the researchers on
three occasions to date in the lab (N=10; N=9; N=3). The technical developers integrated
the therapists recommendation into the system for the therapists to review the next iteration
of the CR prototype. The system development has been iterative and incremental to enable
reflections with the therapists prior to changes in the system becoming permanent.
As shown in Figure 1, the domains emerging with the framework reflect various levels of cognitive complexity: perception, attention and concentration, memory, and executive functions.
The sequential approach is important, as problems in lower levels will impact on the ability of
the person to advance in their cognitive competencies. The framework provides a backdrop to
CR tasks performed using a BNCI system. It also provides a base for the future development
of validated CR interventions by therapists with expertise in this area.
Currently, two serious games have been defined and implemented: find-a-category and memorycard. In this paper, we presents the memory-card game that is aimed at enhancing memory
skills. The game starts showing on the P300 interface all the cards covered (see Figure 2), the
user chooses which card turning out and then the second one looking for its pair. If the pair is
Article ID 009-2
DOI:10.3217/978-3-85125-378-8-9
not found, the cards are covered again. On the contrary, the game follows with the remaining
cards. The game ends when all the pairs have been found.
Dierent levels of difficulty have been defined, depending on the ability of the user. In
particular, difficulty increases by augmenting the number of available cards -i.e., the dimension
of the P300 matrix (level 1, 8 cards; level 2, 12 cards; and level 3, 20 cards)- and the complexity
of the adopted images (level 1, shapes; level 2, fruits; and level 3, animals). Figure 3-a shows an
example for each of the given levels (for the sake of clarity, the P300 matrix has been omitted).
Results are then shown to the user in numerical and graphical view (see Figure 3-b). The
numbers of corrected moves with respect to the total moves represents the score. The score
for the last 8 played games (a line for each level) is presented. Moreover, the number of moves
together with the elapsed time are shown and some statistics given. All the results are also sent
to the therapist.
Figure 3: A screenshot of memory-card game for each given level (a) and the results display
(b).
Discussion
In this qualitative research a user centered approach has been adopted. User centered design is
a process of engagement with end users that adopts a range of methods to place the end of user
of a technology at the center of the design process in terms of development and evaluation.
It is beyond the scope of our work to develop, validate, and illustrate clinical outcomes.
Nonetheless, therapists are able to interact with users in real time, assigning game sessions
depending on the level of disability, thanks to a therapist station2 . Then, they monitor the
use and outcomes of the CR tasks to attain therapeutic results. In fact, the ability for the
therapist to plan, schedule, telemonitor and personalize the prescription of CR tasks using a
therapist station will facilitate that the user performs those tasks inside her therapeutic range
(i.e. motivating and supporting her progress), in order to help to attain beneficial therapeutic
results.
The second-year-prototype of BackHome integrates all the implemented games as well as
the therapist station. That prototype is currently under testing by the end-users of the project,
2 https://station.backhome-fp7.eu:8443/BackHome
Article ID 009-3
DOI:10.3217/978-3-85125-378-8-9
Acknowledgments
The research leading to these results has received funding from the European Communitys,
Seventh Framework Programme FP7/2007-2013, BackHome project grant agreement n. 288566.
References
[1] David Braddock, Mary C Rizzolo, Micah Thompson, and Rodney Bell. Emerging technologies and
cognitive disability. Journal of Special Education Technology, 19(4), 2004.
[2] Keith D Cicerone, Cynthia Dahlberg, Kathleen Kalmar, Donna M Langenbahn, James F Malec,
Thomas F Bergquist, Thomas Felicetti, Joseph T Giacino, J Preston Harley, Douglas E Harrington,
et al. Evidence-based cognitive rehabilitation: recommendations for clinical practice. Archives of
physical medicine and rehabilitation, 81(12):15961615, 2000.
[3] J. Daly, E. Armstrong, F. Miralles, E. Vargiu, G.R. M
uller-Putz, C. Hintermller, C. Guger, A. Kuebler, and S. Martin. Backhome: Brain-neural-computer interfaces on track to home. In RAatE 2012
- Recent Advances in Assistive Technology & Engineering. 2012.
[4] P. Rego, P.M. Moreira, and L.P. Reis. Serious games for rehabilitation: A survey and a classification towards a taxonomy. In Information Systems and Technologies (CISTI), 2010 5th Iberian
Conference on, pages 16, June 2010.
[5] Marcia J Scherer. Assistive technologies and other supports for people with brain impairment.
Springer Publishing Company, 2011.
[6] David J Schretlen and Anne M Shapiro. A quantitative review of the eects of traumatic brain
injury on cognitive functioning. International Review of Psychiatry, 15(4):341349, 2003.
[7] Eloisa Vargiu, Juan Manuel Fern
andez, Sergi Torrellas, Stefan Dauwalder, Marc Sol`
a, and Felip
Miralles. A sensor-based telemonitoring and home support system to improve quality of life through
bnci. In Assistive Technology: From Research to Practice, AAATE 2013. Encarnao, P., Azevedo,
L., Gelderblom, G.J., Newell, A., Mathiassen, N.-E. (Eds.), 2013.
Article ID 009-4
DOI:10.3217/978-3-85125-378-8-10
Abstract
Performance variation is one of the critical issues to be resolved in brain-computer
interface field. Subjects exhibit different performances from session to session and even
across trials. To overcome this issue, three strategies have been commonly proposed,
including extraction of robust features, adaptation methods, and monitoring and
rejection of bad trials. In this work, we suggest a new multi-classifier strategy using trial
data shuffling. This strategy generates different classifiers according to various noise
states. Our proposed strategy showed an improvement in classification performance of
approximately 3 percent, and a trial-wise quality measure facilitated to monitor bad
trials. This seems a promising method to improve the reliability of the BCI system.
1 Introduction
Brain-Computer Interface (BCI) suffers from performance variation. Within subjects, the nonstationary properties of brain signals are considered to be a major cause of this problem. Existing
strategies can be categorized as follows: 1) extraction of robust features (Cho et al., 2012), 2)
adaptation of feature or classifier (Krusienski et al., 2011; Shenoy et al., 2006; Satti et al., 2010), and
3) monitoring and rejecting bad trials (Ferrez and del R Millan, 2008). Brain signals yield both
meaningful (which is used to decode a users intention or thought) and non-relevant information
(background noise). We focused primarily on the first type of information under the assumption that
meaningful information is the same over trials. Therefore, most studies have assumed that features are
extracted well and reflect task-related information. In practice, however, noise varies over time; this
variation influences the overall signal property, which degrades the performance of the feature
extractor. Thus, noise information should be considered carefully. Moreover, a user may change the
way s/he conducts a mental task or may not be in a proper mental state. Therefore, these trials may
violate the assumption above, thus calling into question the use of the methods in Strategies 1 and 2.
Grosse-Wentrup and Schlkopf (2012) demonstrated that the prediction of poor mental state, in which
a user is not likely to generate a motor imagery-related signal, is possible before the user begins the
imagination task. It is remarkable that BCI can predict a users condition in advance; therefore, the
system enables us to deal more readily with the potentially unreliable state of a user. In this study, we
propose a new multi-state driven (MSD) method that deals with the problem of fluctuations in BCI.
The concept of this MSD strategy is described in Section 2. Materials, evaluation and results are
presented in the subsequent sections. Finally, we discuss related ideas and our future work.
*
Corresponding author, scjun@gist.ac.kr: Research supported by National Research Foundation of Korea (Grant No. NRF2013R1A1A2009029).
Article ID 010-1
DOI:10.3217/978-3-85125-378-8-10
Article ID 010-2
DOI:10.3217/978-3-85125-378-8-10
using invariant Common Spatio-Spectral Pattern (iCSSP) and FLDA, as in (Cho et al., 2012). This
classifier was applied in the online experiment. Subjects conducted 150 trials for two conditions (75
trials for each) over 3 runs. A feedback trial similar to the trials in the offline experiment was designed.
However, in the feedback trials, we gave a classified result and moved the ball in that direction.
2.3 Evaluation
The purpose of this study was to evaluate our proposed strategy and compare it to the conventional
strategy that uses whole trials for one classifier. For this evaluation, we fixed informative intervals as
8-30 Hz and 0.4-2.4 sec. for spectral and temporal filtering (Ahn et al., 2012). The evaluation was
conducted through in-phase and phase to phase performances. To estimate performance in the
calibration/feedback phase, we first applied cross-validation to check in-phase performance. Whole
trials were divided into 10 groups and 7 groups were chosen as a training set, while the remaining 3
groups were used as a testing set. CSP and FLDA were applied, such that 120 iterations yielded 120
accuracy estimates; we assigned the mean accuracy to conventional (CV) performance. We also
estimated MSD accuracy. In 10 groups of trials, we chose nine groups as training data and the
remaining group was used as testing. From 9 groups in the training set, 7 groups were selected to
generate a classifier; therefore, we were able to construct an MSD decoder consisting of 36 classifiers.
This decoder evaluated each trial in the test data, and the process was repeated through 10-fold
validation; thereafter, a trial-wise discriminant value was estimated and we calculated MSD accuracy.
Finally, we obtained both CV and MSD accuracy for each calibration and feedback phase.
Conventional phase-to-phase performance is designed so that a classifier constructed by calibration
data evaluates the trials in the feedback phase. We applied this conventional method for our
calibration and feedback data through CSP and FLDA. In addition, MSD was implemented and
applied. The MSD decoder constructed from calibration data was applied to discriminate the trials in
the feedback phase.
MSD produces trial-wise discriminability from classifiers, which facilitates the evaluation of trial
quality. It is possible to reject a bad trial that falls beneath a certain threshold. This was evaluated for
phase to phase performance. The MSD decoder constructed using calibration data evaluated trials in
the feedback phase and bad trials were rejected. For this evaluation, we used a 50% threshold, which
means that the MSD decoder identified a certain class that received at least more than one evaluation.
However, we applied different criteria to examine how the accuracy changed. Based on this criterion,
we accepted trials showing higher probability than the threshold and the hit rate was calculated.
Table 1: Performance (%) comparison between conventional approach (CV) and MSD
Datasets
In-phase (Calibration)
In-phase (Feedback)
85.0
87.5
92.8
95.8
86.6
90.8
80.6
84.2
88.5
89.2
80.9
85.0
89.4
91.7
81.8
86.7
60.2
60.0
60.8
61.7
75.3
78.3
85.7
87.5
80.6
83.2
10.5
11.3
0.00097656 (***)
72.9
80.7
58.9
62.0
65.8
71.4
94.1
95.3
80.5
83.3
84.8
87.3
76.5
79.3
62.9
66.0
51.6
51.3
61.7
62.0
87.1
87.3
94.7
94.0
74.3
76.7
14.3
14.0
0.0068359 (*)
A1
A2
A3
B1
B2
B3
C1
C2
C3
D1
D2
D3
Mean
Standard Deviation
p value (*:<.01, **:<.005, ***:<.001)
CV
MSD
CV
MSD
Phase to phase
CV
MSD
75.3
78.7
75.3
75.3
70.7
75.7
70.0
71.3
80.7
80.7
69.3
75.3
77.3
81.3
58.0
61.3
51.3
56.0
50.0
51.3
72.0
76.0
92.0
93.3
70.2
73.0
12.1
11.7
0.0019531 (**)
Article ID 010-3
DOI:10.3217/978-3-85125-378-8-10
Figure 1: Accuracy (left) and accepted trials (right) in % over varying threshold levels (50% to 100%).
Each thin, gray line denotes the results for each dataset and the thick, blue line indicates the mean.
References
Ahn, M., Hong, J., & Jun, S. (2012). Feasibility of approaches combining sensor and source features
in braincomputer interface. Journal of Neuroscience Methods, 204(1), 168 - 178.
Cho, H., Ahn, M., Sangtae, A., & Jun, S. (2012). Invariant Common Spatio-Spectral Patterns. 3rd
TOBI Workshop, (pp. 31-32).
Ferrez, P., & del R Millan, J. (2008). Error-related EEG potentials generated during simulated braincomputer interaction. IEEE transactions on bio-medical engineering, 55(3), 923-929.
Grosse-Wentrup, M., & Schlkopf, B. (2012). High gamma-power predicts performance in
sensorimotor-rhythm brain-computer interfaces. Journal of Neural engineering, 9(4), 046001.
Krusienski D. J., Grosse-Wentrup M., Galan F., Coyle D., Miller K. J., Forney E., Anderson C. W.
(2011) Critical issues in state-of-the-art brain-computer interface signal processing. Journal
of Neural engineering, 8(2), 025002.
Lotte, F., Congedo, M., Lcuyer, A., Lamarche, F., & Arnaldi, B. (2007). A review of classification
algorithms for EEG-based braincomputer interfaces. Journal of Neural Engineering, R1.
Satti, A., Guan, C., Coyle, D., Prasad, G. (2010) A Covariate Shift Minimisation Method to Alleviate
Non-stationarity Effects for an Adaptive Brain-Computer Interface. 20th International
Conference on Pattern Recognition, (pp. 105108).
Shenoy, P., Krauledat, M., Blankertz, B., Rao, R., & Mller, K.-R. (2006). Towards adaptive
classification for BCI. Journal of neural engineering, R13-23.
Article ID 010-4
DOI:10.3217/978-3-85125-378-8-11
{anne-marie.brouwer,maarten.hogervorst,jan.vanerp}@tno.nl,
manjebrinkhuis@gmail.com, boris@cortext.nl,
Abstract
The P300 event-related potential (ERP) can be used to infer whether an observer is
looking at a target or not. Common practice in P300 BCIs and experiments is that
observers are asked to fixate their eyes while stimuli are presented. First studies have
shown that it is also possible to distinguish between target and non-target fixations on
the basis of single ERPs in the case that eye movements are made, and ERPs are
synchronized to fixations (fixation-related potentials or FRPs) rather than to stimulus
onset. However, in these studies small object sizes ensured that participants could only
identify whether the object was a target or non-target after fixating on it. We here
compare (non-)target FRPs when objects are identified before versus after fixation. We
also examine ERPs from static eyes conditions. FRP shapes are in accordance with the
notion that the late component of the P300 is associated with identifying a target, and
eye movements do not substantially affect the P300. Even when the time of object
identification is unknown, it is possible to distinguish between target and non-target
FRPs on a single FRP basis. These results are important for fundamental science and for
developing applications to covertly monitor observers interests.
1 Introduction
The P300 event related potential (ERP) occurs 300-500 ms after a stimulus has been presented that
draws attention, either through bottom-up mechanism as in odd-ball paradigms or through conscious
guidance, as made use of in P300 BCIs. In P300 experiments and BCI paradigms, participants are
usually asked not to move their eyes around the time that the P300 is expected to occur, in order to
avoid eye movement artefacts in the EEG. It is however expected that the same kinds of processes and
ERPs occur when observers actively sample their visual environment themselves by making eye
movements. In this case, one would need to examine the EEG traces relative to self-paced fixation
onset (fixation-related potential, or FRP). There are only few studies that demonstrate that fixationrelated P300s are elicited by top-down determined target identification (Kamienkowski et al., 2012;
Brouwer et al., 2013). These studies control for potentially confounding factors that (may) differ
systematically between target and non-target fixations, such as fixation duration, preceding saccade
length and low-level visual properties of the objects of interest. Brouwer et al. (2013) showed that
target and non-target FRPs could be distinguished above chance on a single FRP level. In the
Article ID 011-1
DOI:10.3217/978-3-85125-378-8-11
experiments by Kamienkowski et al. and Brouwer et al., small object size ensured that participants
could only identify the object (target or non-target) after fixating it. However, in real life, targets can
also be identified parafoveally (in the corner of the eyes), prior to fixation on the target. ERP
experiments in which participants fixated a fixation cross demonstrated that target stimuli presented in
the periphery also elicit P300s. We expect P300 FRPs to occur earlier when targets are identified
before than after fixation. In the current study, we examine FRPs when a target has been identified
before fixating it (large target) and after (small target). We also include a condition in which the eyes
do not move in order to examine whether planning (or inhibiting) a saccade affects the P300 latency
and amplitude. Our main interest is in whether we can still distinguish between targets and non-targets
at a single FRP level in the case that time of target identification relative to fixation is unknown. This
would be required if one is interested in monitoring the interest of naturally behaving observers.
2 Methods
EEG-electrodes were placed at Fz, Pz, Cz, POz, Oz, P3, P4, PO7 and PO8 with a ground electrode
at FPz. The EEG electrodes were referenced to linked mastoid electrodes. Four electrodes were used
for EOG recording: two at the outer canthi left and right; one above and one below the left eye. Both
horizontal and vertical EOG-electrodes were referenced to each other. Data were recorded at 256 Hz
and filtered online using a 0.1 Hz high-pass, a 100 Hz low-pass and a 50 Hz notch filter.
Stimuli were shown on a 20 tft-screen with a refresh rate of 60 Hz and at a viewing distance of 60
cm. They were circular patches with a black center and white surround, or the other way around. One
type was designated as target, the other as non-target. Center and surround had an equally sized
surface area. They were shown against a gray background in one of four locations (top-left, top-right,
bottom-left, or bottom-right) at 12 of visual angle from a fixation cross, located in the center. The
stimuli were sized either 4 or 0.25 in diameter, so that only the large stimuli could be identified
when gaze was at the fixation cross, whereas an eye movement was required for the small stimuli. All
patches were surrounded by a black square frame which had thickness of two pixels, allowing the
participants to easily detect the location of the (small) stimulus.
Twelve observers participated. During the task, the fixation cross was present continuously. One
after the other, the stimuli were presented for 1000 ms in one of the four corners (random order) with
an inter-stimulus interval of 750 ms. The participants were asked to count targets. After every 21
trials, the participant indicated the number of counted targets through the keyboard.
There were three conditions: one in which participants were asked to keep fixating the cross and
that contained only large stimuli (static-large); one in which they were asked to make an eye
movement to the stimulus as soon as it appeared that contained only large stimuli (saccade-large); and
the same one but with small stimuli (saccade-small). Each condition contained 336 trials, 112 of
which were target presentations. Targets and non-targets were presented in random order. Trials from
the two saccade conditions were presented in random order within one saccade block. Half of the
participants started with the saccade block, the other with the static fixation block. Type of target
(black center or white center) was also counterbalanced within these two participant groups.
To determine fixation onset in the saccade condition, we detected the peak EOG eye movement
speed after stimulus onset and set eye fixation onset at the end of the saccade related peak. For some
analyses, we identified and removed eye movement artefacts from the EEG using ICA (Jung et al.,
2000). Independent components that reflected EM activity were selected manually by comparing
components with the original EOG data. For eleven participants the first two components and for one
the first three components were identified to reflect EM and removed from the original data.
For each participant, we estimated whether ERPs (running from stimulus onset until 500 ms later)
and FRPs (from fixation onset until 500 ms later) could be correctly classified as associated with a
Article ID 011-2
DOI:10.3217/978-3-85125-378-8-11
target or non-target. These estimates were produced using a five-fold cross-validation procedure. We
used the same classification pipeline as described in Brouwer et al. (2013). In short, an L2-regularized
logistic regression classifier was trained on (9 electrodes * 129 samples) 1161 dimensional feature
vectors. All adjustable parameters were optimized independently of the test sets. The classification
procedure was performed on 8 different sets of data as indicated in Table 1. For ERPs, we determined
classification performance for data from the static large condition. For FRPs, we determined
classification performance for data from the two saccade conditions separately as well as together. All
sets of data were examined with and without eye movement artifacts removed. We used Equal Error
Rate as a measure of classification performance since this measure is independent of the percentage of
misses (targets classified as non-targets) and false alarms (non-targets classified as targets). It reflects
the proportion of errors for the case that the percentage of false alarms and misses are equal.
3 Results
Figure 1 shows the average target- and non-target ERPs and FRPs in the most relevant comparison
conditions for electrode POz (where effects tended to be clearest, consistent with Brouwer et al.,
2013), after removing the eye movement artefacts. The shaded areas represent p-values lower than
0.01 (dark) or 0.05 (light) as indicated by paired sample t-tests performed on every time sample of
target and non-target values. As expected, ERPs (i.e. stimulus locked traces) and especially the target
minus non-target difference trace are similar, regardless of eye movements being made. Also as
expected, the difference peak of FRPs (i.e., fixation locked traces) occurs earlier when objects could
be identified in the periphery (large object) compared to when not (small objects).
Figure 1: ERPs (left) and FRPs (right) for target (blue) and non-target (red) traces in different conditions,
averaged over participants for electrode POz. Dotted curves indicate the difference wave.
Table 1 shows the results of the ERP and FRP classification analysis, separately for each
condition, for the two saccade conditions grouped, and both with and without eye movement artefacts
removed. Single trial classification is possible above chance in all cases as indicated by Wilcoxon
signed rank tests on 0.5 (chance level) minus Equal Error Rates per participant (all eight p-values
<0.01). On average, the lowest error rates are achieved when eyes are static and EEG traces are locked
to stimulus onset, while the highest error rates are observed for the most difficult case, i.e. traces
Article ID 011-3
DOI:10.3217/978-3-85125-378-8-11
locked to fixation onset and unknown time of object identification (large and small stimuli mixed).
Removing eye movement artefacts does not increase error rate, confirming the idea that distinction
between targets and non-targets is based on brain rather than eye signals.
Eye artefacts removed
Raw traces
Static large
0.290.06 [0.20-0.39]
0.290.06 [0.21-0.38]
Sacc large
0.350.08 [0.15-0.42]
0.350.06 [0.23-0.43]
Sacc small
0.310.06 [0.17-0.38]
0.400.05 [0.30-0.48]
Sacc small & large
0.370.06 [0.19-0.44]
0.410.04 [0.32-0.46]
Table 1: Equal Error Rates averaged across participants the standard deviation. Between brackets are the
minimum and maximum Equal Error Rates as observed across participants. Chance level is at 0.5.
Stim locked
Fix locked
4 Discussion
We found that we can still distinguish between target and non-target FRPs in the case that time of
target identification relative to fixation is unknown. This is important when one is interested to use
FRPs to monitor an observers interest since for most visual exploration tasks, time of target
identification is unknown. Equal Error Rates may seem high at first sight in comparison to other P300
studies. However, one has to note that these results reflect single trial classification and are based on
data with a relatively large proportion of targets which is expected to result in relatively low P300
amplitudes.
Since attention and eye movements are intimately connected (Kowler, 2011) one might have
expected that planning or inhibiting saccades would interfere with P300 latency and amplitude.
However, we found that the static-large and saccade-large FRPs are very similar.
As expected, when objects are identified before fixation, the FRP P300 difference peak between
targets and non-targets occurs earlier compared to identification after fixation. The FRPs in the
saccade-small condition nicely reflect parafoveal and foveal visual processing. The first peak (after a
peak that corresponds with a presaccadic spike potential) is similar for target and non-target traces, as
well as for the traces in the saccadic-large condition. This peak probably reflects the detection of a
relevant visual event in the periphery, i.e. stimulus onset. Then, there is a peak only for targets which
is delayed in the case of small target size since targets can only be identified after an eye movement
has been executed. The target trace of saccade-small may show us the P3a and P3b (Polich, 2007)
more separated in time than is usually the case in experimental settings.
References
Brouwer, A-M., Reuderink, B., Vincent, J., van Gerven, M.A.J., van Erp, J.B.F. (2013).
Distinguishing between target and nontarget fixations in a visual search task using fixationrelated potentials. Journal of Vision, 13(3):17, 110.
Jung, T.P., Makeig, S., Humphries, C., Lee, T.W., McKeown, M.J., Iragui, V., Sejnowski, T.J. (2000).
Removing
electroencephalographic
artifacts
by
blind
source
separation.
Psychophysiology,37, 163-78.
Kamienkowski, J.E., Ison, M.J., Quiroga, R.Q., Sigman, M. (2012). Fixation-related potentials in
visual search: A combined EEG and eye tracking study. Journal of Vision, 12(7):4, 1-20.
Kowler, E. (2011). Eye movements: the past 25 years. Vision research, 51(13), 1457-83.
Polich, J. (2007). Updating P300: an integrative theory of P3a and P3b. Clinical neurophysiology,
118(10), 2128-48.
Article ID 011-4
DOI:10.3217/978-3-85125-378-8-12
Introduction
Brain-computer interfaces (BCIs) can provide a means of communication for severely paralyzed
persons. Many potential users of BCIs lose gaze control due to progressive neurological disorders
or acquired brain injuries. Thus, these users can benefit from BCIs that are controlled without
visual stimulation.
The P300 event-related potential (ERP) is a component of the electroencephalogram (EEG)
that can be used for controlling BCIs in the visual and also in the auditory domain. Controlling
a P300 BCI requires a focus of attention on one of several stimuli that are presented in a random
order. When selecting symbols with an auditory P300 BCI one can use this method to first
select the row in the letter matrix and then the column. How easy this is for the user depends
on the auditory stimuli. If they dier too much some stimuli may involuntarily attract the
attention of the user. If they are very similar the user may not be able to dierentiate the
stimuli, especially if presentation times are short. Our first implementation of an auditory
P300 BCI used spoken numbers as stimuli [1, 11]. These can be dierentiated but require a
long presentation time. Ideally, for a P300 BCI the sound can be dierentiated with the onset
of the stimulus. Other early implementations also needed long letter selection times and did not
achieve high accuracies [9]. Our second implementation, inspired by [16], used artificial tones
with additional spatial information [7]. This enabled faster presentation of the stimuli but they
were difficult to discriminate for some users. In a refined version animal sounds were shown
to be much better stimuli for auditory P300 BCIs [14]. They require only short presentation
times but can still be easily discriminated. Additionally, using groups of alike sounding animals,
stimuli sets that are similar but still dierentiable can be created. Most recently, training was
shown to lead to substantial improvements in communication speed with healthy participants
[2]. So far multi-choice auditory BCIs have not been successfully used by a person with severe
motor impairments [11, 17]. In this study we proceed to show for the first time that training
enables severely motor impaired users to control an auditory P300 BCI.
Article ID 012-1
2
2.1
DOI:10.3217/978-3-85125-378-8-12
Methods
Participants
Five users with severe motor impairments took part in the study. User 1 (male, 70 years old)
was diagnosed with muscle dystrophy (ALS-functional rating scale revised (ALS-FRSR) score
40; range 0-48), User 2 (male, 56 years) was diagnosed with diuse brain damage due to hypoxia
(ALS-FRSR 18), User 3 (female, 45 years) was diagnosed with multiple sclerosis (ALS-FRSR
24), User 4 (female, 53 years) was diagnosed with ALS (ALS-FRSR 25) and User 5 (male, 73
years) was also diagnosed with ALS (ALS-FRSR 35).
2.2
Procedure
Each of the participants performed five separate sessions on dierent days with the auditory
P300 BCI system. Each session consisted of spelling two words with five symbols (two times
AGMSY) for calibration of the classifier (stepwise linear discriminant analysis; which was retrained for every session) and five words with five symbols (chosen to make the stimuli needed
for selection equally distributed) for feedback (VARIO, GRUEN, RUBIO, TUMBI, PHLEX).
There was a pause of twelve seconds (to give the user enough time to between letter selections.
Between rows and columns there was a pause of two seconds. Each stimulus was presented for
187.5 ms with 250 ms between stimuli. During training each stimulus was presented ten times.
For the five words spelled with feedback stimulus repetitions was adjusted individually to the
number of sequences needed to reach 70% plus three (to prevent ceiling eects).
The stimuli used in this study were a duck, bird, frog, gull and pigeon sound appearing to
originate from the left, middle left, front, middle right and right side of the participant (via
simulation of direction using stereo headphones). Each sound codes for a row and a column in
a matrix with 5x5 symbols (the letters A-Y). The user attends to one of the sounds to select
the row, then after a short pause to one of the sounds to select the column.
2.3
Data acquisition
EEG was recorded with a g.tec g.USBamp EEG amplifier with a bandpass from 0.1 to 30 Hz,
notchfilter at 50 Hz and 256 Hz sampling rate. Sixteen g.gamma electrodes were positioned at
FC3, FCz, FC4, C3, Cz, C4, CP5, CPz, CP6, P3, Pz, P4, Po5, Poz, Po6 and Oz.
BCI2000 was used for controlling all aspects of stimulus presentation, signal processing and data
recording [15]. Recordings were made on a Hewlett-Packard ProBook 6460b with a dual-core
CPU, 4 GB of RAM and a 64-bit Windows 7.
2.4
Signal processing
Stepwise linear discriminant analysis was used for online classification of the data. Information
transfer rates (ITRs) were calculated with the formula suggested by Wolpaw [19].
Results
Average ITR increased from 0.17 bits/min to 3.09 bits/min for all users. Note that these values
include the two users that did not learn to control the BCI (User 2 and User 3). The three
users that learned to control the BCI increased their ITRs from 0.15 bits/min to 5.12 bits/min.
Article ID 012-2
DOI:10.3217/978-3-85125-378-8-12
Symbol selection accuracies increased from 11.2% to 52.8% for all users and from 9.3% to 84%
for the three users that learned to control the BCI.
Discussion
Three out of five users learned to control the BCI through training. None of the users were
able to control the BCI in the first session. This shows that the training is not only a reliable
method to increase ITR but mandatory for users with severe motor impairments.
Compared to the data from previous implementations of auditory P300 BCI spellers the extent
of the performance increase becomes apparent. In the initial implementation by Furdea et al.
[1] the ITR with healthy participants was on average 1.54 bits/min. The same implementation
was evaluated with patients and none of the four participants achieved accuracies above 50%,
which would be needed to transfer information [11]. A first implementation with spatial cues
and artificial tones lead to an increase in ITR to 2.76 bits/min when using artificial tones [7]
and to 4.23 bits/min when using animal sounds [14]. Finally, we were able to show that training
leads to an ITR of 5.59 bits/min in healthy participants [2], an increase to 360% compared to
2009. In this paper we were able to show that motor impaired users can reach 3.09 bits/min
(this includes two users that did not learn to control the BCI) which is an ITR unprecedented
by motor impaired users with auditory P300 BCIs.
Unrelated to BCIs it has been shown before that training can increase the amplitude of ERPs
[10, 18]. We believe that the increase in BCI performance can be attributed to similar eects.
Users 2 and 3 did not learn to control the BCI. This may be related to the diagnosis: user
2 was diagnosed with diuse brain damage due to hypoxia and user 3 with multiple sclerosis.
Both may have an impact on the generation of the ERPS needed for BCI control. This was
particularly evident for user 3 who did exhibit not any stimulus locked responses, also to
auditory oddballs.
The data shown here underlines again that no BCI works for every user. This is especially
the case with BCIs that have high attentional demands, such as auditory spellers, which also
a large percentage of healthy controls fails to use [3]. Using binary choice auditory BCIs may
be a viable alternative [5, 4, 6, 13, 12]. Nonetheless, it cannot be concluded from this data
that the presented auditory BCI will enable users who have previously been unable to use an
auditory BCI will now be able to do so [11, 8, 17]. The diversity of the epidemiologies that lead
to such severe motor impairments that only communication with non-visual BCIs is possible
will always require an individualized solution.
Acknowledgements
The study was funded by the European Community for research, technological development
and demonstration activities under the Seventh Framework Programme (FP7, 2007-13), project
grant agreement number 288566 (BackHome). This paper reflects only the authors views and
funding agencies are not liable for any use that may be made of the information contained
herein.
References
[1] A. Furdea, S. Halder, D. J. Krusienski, D. Bross, F. Nijboer, N. Birbaumer, and A. K
ubler. An auditory oddball (P300) spelling system for brain-computer interfaces. Psychophysiology, 46(3):617
25, May 2009.
Article ID 012-3
DOI:10.3217/978-3-85125-378-8-12
Article ID 012-4
DOI:10.3217/978-3-85125-378-8-13
Abstract
This paper describes the inclusion of temporal signal information in the process of
Empirical Mode Decomposition (EMD) applied to the electroencephalogram (EEG). Key
information in the classification of motor imagery EEG is very frequency centric but also
contains strong temporal cues due to the nature of the experimental paradigm. Taking
the temporal dynamics of the EEG into account, multiple snapshots of the signal in time
are input into a multi-variate EMD process. Features were created from the processed
signal using Common Spatial Patterns and these were input to a Support Vector Machine
for classification. The results showed that the added temporal dynamics gave no major
improvement to the sensitivity or specificity compared to regular EMD.
Introduction
Motor Imagery (MI) works by using the changes in brain activity over the sensorimotor cortex
during imagined localised limb movements [5]. These changes include the suppression of the
rhythm on the contralateral hemisphere. The rhythm is limited to a frequency band of 8-13
Hz, predominantly around 10 Hz. There is also weaker resonance activity in the beta bands
around 20 Hz. During the commencement of MI there will be a large decrease in potential
followed immediately by a large increase and then a return to pre-MI levels in the contralateral hemisphere as the limb being imagined moved. The lateral hemisphere will see a similar
eect but to a much lesser degree, so called Event Related Desynchronisation/Event Related
Synchronisation (ERD/ERS).
Empirical Mode Decomposition (EMD) is a process that can decompose non-stationary
and non-linear signals into a group of frequency harmonics called Intrinsic Mode Functions
(IMFs) and residual signal noise [2]. This makes it readily applicable to electroencephalogram
(EEG) recordings. In this paper we work with the specific Brain-Computer Interface (BCI) [5]
paradigm of MI. EMD has already been shown to be a useful signal processing technique for
MI [1]. Here we introduce a new version of EMD that uses Takens theorem [?] to incorporate
temporal data into the EMD sifting process. This new algorithm will be tested in classifying
MI trials, compared to standard EMD.
Any continuous single-spatial channel signal can be converted into a multi-temporal channel
signal using Takens theorem as described in reference [3]
x(t) = (x(t
), x(t
2 ), ..., x(t
(m
1) ) 2 R,
where is the lag and m is the embedding dimension. A similar method was used to apply
Independent Component Analysis (ICA) to a single EEG channel [3]. The embedding dimension
Article ID 013-1
DOI:10.3217/978-3-85125-378-8-13
needs to be greater than 2D + 1, where D is the number of signal sources. From [3] we set the
dimension size of m = 30 and = 1.
2
2.1
Methodology
Dataset
The EEG dataset contains 90 MI trials from 11 dierent subjects. Each trial is 8.2 seconds long,
consisting of 4.1 seconds neutral activity, a stimulus cue that is randomly either left or right,
and 4.1 seconds of sustained MI. The recordings were made with a 64-channel EEG running
BCI2000 software with a sampling frequency of 160 Hz [6]. The electrodes selected for analysis
were FC3, FC4, C5, C3, Cz, C4, C6, CP3, CP4, T7 and T8 according to the 10-10 system and
were determined to cover the motor cortex. This is the same electrode selection as previous
EMD studies [4].
2.2
Feature Extraction
EMD works by applying an iterative sifting process to the data, where the mean envelope
of each channel is subtracted repeatedly until the data has been decomposed into a group of
oscillations of varying frequencies that are symmetrical with respect to the x axis. However,
Park et al [4] came up with an enhanced version that uses spatial information to help form
the envelopes, causing each channels IMFs to be equal in number to the rest and occupy the
same frequency bands, making them consistent across all channels and far easier to analyse,
this is called Multi-variate EMD (MEMD) and was shown to perform well compared to existing
standard methods such as wavelet transforms and the application of a Butterworth filter.
MEMD works by calculating the mean envelope of n-directional signals, instead of just one
signal in isolation, by projecting the signal along dierent directions in n-variate spaces and
then averaging the projections to get the local mean. Low discrepancy Hammersley sequences
are used to obtain quasi-uniform points on high dimensional spheres to form the projection
vectors. In this case instead of applying MEMD to EEG data of multiple channels, we use
dynamical embedding to create a multi-temporal signal and apply MEMD to that. The aim of
this is to use temporal data to form more accurate and consistent IMFs from a signal channel
using the temporal data, this is called Temporal MEMD (T-MEMD). Channels FC3, FC4, C5,
C3, Cz, C4, C6, CP3, CP4, T7 and T8 according to the 10-10 system were selected for analysis.
The resulting IMFs for each trial, for either EMD or T-MEMD, are un-embedded and
then selected or discarded based on two dierent methods, one knowledge-based and one
performance-based. The knowledge-based method attempts to select relevant IMFs based on
previous knowledge of their features. A Fast-Fourier Transform (FFT) is calculated for the
duration of the signal that was recorded after the cue as it contains the MI. IMFs with at
least 5% of their total power between 8-13 Hz (the rhythm) are considered to have possible
MI relevant data and are kept. The performance-based method uses a brute force method to
identify the combination of IMFs that gives the best result per subject in terms of the number
of trials correctly classified. As with this dataset the number of IMFs per channel never exceeds
13 and the IMFs containing useful information never exceeds 5 in number, it means all possible
combinations of IMFs (maximum number of combinations: 2379) can be tested in a practical
amount of time.
The selected IMFs are summed to form the processed signal and features are created for
each trial using Common Spatial Patterns (CSP). CSPs calculate a set of spatial filters that
Article ID 013-2
DOI:10.3217/978-3-85125-378-8-13
Figure 1: This figure shows the spectrograms for the average processed signal with the waveform itself
superimposed in purple. The plots are for electrodes C3 and C4, for the classes think Left and think
Right respectively for the EEG data of User 11. The first row is regular EMD and the bottom row is
T-MEMD with the stimulus occurring exactly half way across the x-axis. In this case, T-MEMD has
created clearer ERD/ERS peaks.
Method
EMD
knowledge-based
T-MEMD
knowledge-based
EMD
brute force
T-MEMD
brute force
53.3
50.3
49.2 10.6
46.5
38.7 24.9
52.1
28.9 30.6
35.6
37.8
53.2
49.0 11.6
37.5
44.2 28.2
45.6
30.1 29.3
73.3
75.6
68.9
68.9
5.3
69.0
53.2 16.2
68.4
50.6 15.2
62.2
71.1
70.7
69.6
6.8
65.0
46.5 20.3
66.8
51.9 14.4
Table 1: The sensitivity, specificity and standard deviation of each EMD and T-MEMD method for a
single dataset applied to all channels, the average sensitivity, specificity and standard deviation of each
EMD and T-MEMD method for all datasets applied to all channels and applied to a single channel,
C3 and C4 respectively.
maximise the variances of one class and minimises them in the other [7]. The features are then
input to a Support Vector Machine (SVM). As there were only 90 trials per user it was decided
to use the Leave One Out method (LOO). None of the CSP features or brute force classifier
made use of the test trials.
EMD and T-MEMD both achieved very similar performance with the dierence between the
two being well within their respective standard deviations in every case. The brute force method
Article ID 013-3
DOI:10.3217/978-3-85125-378-8-13
returned significantly higher performance than the knowledge based method for both decomposition methods. Whilst the IMFs chosen with the brute force method were not consistent
between users or decomposition methods, they did focus on the frequency bands known to
contain MI information that were identified in the introduction (Figure 1). A single channel
analysis of the high-performing brute force method was carried out to further see if embedding temporal data added anything to the EMD process. As CSPs need multiple channels to
function the variance of each processed trial was used as the input of the SVM.
As Table 1 shows, there is negligible dierence in performance indicating that the added
temporal dynamics do not contain any new information. In part, this may be due to the
fact that the underlying processes for MI aect all channels, and whilst ERD/ERS and the
rhythm are expected to be stronger on one side of the brain versus the other, the changes still
occur simultaneously for both hemispheres. This implies that the information content for MI
is inherent in the lateralisation of the changes - i.e. spatially. In [3] adding temporal dynamics
to ICA had a greater impact than with EMD because ICA contains zero temporal information,
whilst EMD still uses the data laid out in chronological order. EMD can also only discard
background noise if it is unstructured due to its criteria for identifying IMFs.
Conclusion
Ultimately the added temporal dynamics did not significantly improve the classification performance. However it might still have some eect on performance if applied to an EEG signal
with significant temporally independent features, which is not the case in MI. Another way to
enhance the method could be to incorporate both spatial and temporal information by decomposing all channels in their multi-temporal form simultaneously using MEMD.
References
[1] S.R.H. Davies and C.J. James. Novel use of empirical mode decomposition in single-trial classification of motor imagery for use in brain-computer interfaces. In Engineering in Medicine and Biology
Soc. (EMBC), 2013 35th Annual Intl. Conf. of the IEEE, pages 56105613, July 2013.
[2] Norden E. Huang, Zheng Shen, Steven R. Long, Manli C. Wu, Hsing H. Shih, Quanan Zheng,
Nai-Chyuan Yen, Chi Chao Tung, and Henry H. Liu. The empirical mode decomposition and the
hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. of the Royal Society
of London. Series A: Mathematical, Physical and Engineering Sciences, 454(1971):903995, 1998.
[3] C. James and D. Lowe. Using dynamical embedding to isolate seizure components in the ictal EEG.
Science, Measurement and Technology, IEE Proceedings -, 147(6):315320, Nov 2000.
[4] Cheolsoo Park, D. Looney, N. ur Rehman, A. Ahrabian, and D.P. Mandic. Classification of motor
imagery bci using multivariate empirical mode decomposition. Neural Systems and Rehabilitation
Engineering, IEEE Transactions on, 21(1):1022, Jan 2013.
[5] G. Pfurtscheller, C. Brunner, A. Schlgl, and F.H. Lopes da Silva. Mu rhythm (de)synchronization
and EEG single-trial classification of dierent motor imagery tasks. NeuroImage, 31(1):153 159,
2006.
[6] Gerwin Schalk, Dennis J. Mcfarl, Thilo Hinterberger, Niels Birbaumer, and Jonathan R. Wolpaw.
Bci2000: A general-purpose brain-computer interface (bci) system. IEEE Transactions on Biomedical Engineering, 51, 2004.
[7] Zhisong Wang, Alexander Maier, Nikos K. Logothetis, and Hualou Liang. Single-trial classification
of bistable perception by integrating empirical mode decomposition, clustering, and support vector
machine. EURASIP J. Adv. Sig. Proc., 2008.
Article ID 013-4
DOI:10.3217/978-3-85125-378-8-14
Signals measured during brain-computer interface (BCI) tasks are nonstationary, which
can lead to classification errors. The error-related potential (ErrP) has been proposed for
BCI error detection as well as partially-supervised classifier adaptation. We discuss how
the ErrP can be incorporated into several adaptive classification methods, and the unique
sensitivity that these methods have to misidentification of the ErrP. We find that the risk
associated with these methods varies as a function of false positive rate for a realistic ErrP
detector receiver operating characteristic and we recommend individualized biasing of the
ErrP detector to account for these eects.
Introduction
Adaptive classification of brain-computer interfaces (BCIs) can be used to address the inherent
non-stationarity of EEG data during mental tasks such as motor imagery. Class labels typically
used for classifier adaptation are not available in a true BCI session, thus unsupervised adaptation has been employed as an alternative to supervised adaptation [9]. However, unsupervised
methods may not be suitable when the nonstationarity aects relative class positions.
A potential compromise is to use error-related potentials (ErrPs) to generate labels for
partially-supervised adaptation. Adaptation using such uncertain labels has been proposed by
[5, 10], and [7] adapted an SVM classifier in the context of a code-modulated visual evoked
potential speller, with benefits for participants. However, the validity of the labels depend
on the accuracy of the ErrP detector, with some correct trials inevitably being interpreted as
incorrect, and some incorrect trials being interpreted as correct. There is limited discussion on
the risk associated with adaptation using incorrect labels and what methods are most suitable
in this situation. To this end, we evaluate two adaptation methods across several ErrP detection
accuracies. We assume stationary performance of the ErrP detector after the results of [2], but
note that risks would increase with a nonstationary assumption.
The performance of motor imagery classifiers is dependent on choice of frequency band,
and the authors of [8] showed that the most discriminative frequency changes from session
to session. Therefore we build a classifier based on the filter bank common spatial pattern
(FBCSP) [1] framework that uses a majority weighted vote from linear discriminant analysis
(LDA)-based classifiers in each FBCSP band. This framework allows us to adapt both the
ensemble weights and the base LDA classifiers separately or concurrently to either re-weight
individual frequency components or change the decision boundaries in each band. In this study
we evaluate the consequences of incrementally adapting these two components of the classifier
at several accuracies of the ErrP detector.
Article ID 014-1
DOI:10.3217/978-3-85125-378-8-14
Methods
Data are taken from dataset IIb of the IVth BCI competition [4], which is comprised of 9
participants performing 5 sessions of left and right hand motor imagery, with 120-160 trials in
each session. EEG is recorded with three bipolar electrodes above C3, Cz, C4 at 250 Hz with
a 50 Hz notch filter. We epoch the data and apply a zero-phase filter bank of 4 Hz pass-band
non-overlapping filters from 4 40 Hz. In each band we extract CSP features from a 2 s window
starting 1.5 s post cue. The first and last CSP features are used, which results in two features
from each filter band.
We train LDA classifiers on the features from each band and combine their decisions using
a weighted majority vote. Data from session 1 are used for training and the remaining sessions
are used for testing. Initial weights are determined using a 10-fold cross-validation evaluating
Cohens kappa from each base classifier and normalizing the results. Then, the base LDA
classifiers are retrained using all the data from session 1.
To simulate ErrPs with realistic detection accuracies, we estimated the values of the receiver
operating characteristic (ROC) curves from the online ErrP detectors found in [6] and evaluated
several points along the median curve, which appear in Table 1. Using these false positive rates
() and true positive rates, we simulated ErrPs for each trial in sessions 2-4 as in [10].
We adapt both the base LDA classifiers (denoted BaseAdapt) and the weights of the
ensemble (denoted Reweight) incrementally after every trial. Our estimate of the true class,
y 2 {0, 1}, is derived from the classifiers output on the current trial, y 2 {0, 1}, and our belief
2 {0, 1} (i.e. we detect an ErrP). Thus, y is incorrect whenever
that an error occurred, E
E is incorrect. BaseAdapt updates the class means and global covariance according to the
supervised LDA classifier in [9]; the Pmean-Gcov unsupervised adaptive classifier from this
group is included for comparison. The learning parameter, , is set to 0.05. The Reweight
to implement a variant of the dynamic weighted majority of [3] that decrements
strategy uses E
the weight of incorrect experts by a factor of 0.9 and increments correct experts by a factor of
1.1. No experts are removed or created. We simulate the performance of these two adaptation
strategies, as well as their combination (denoted Hybrid), 50 times with independent randomly
on each repetition.
generated E
False Positive Rate ()
True Positive Rate
0.05
0.70
0.10
0.79
0.15
0.87
0.20
0.93
0.25
0.93
0.30
0.95
0.35
0.95
0.40
0.98
Table 1: False positive and corresponding true positive rates of the ErrP detector.
Results
The average classification accuracies across all 4 evaluation sessions and all participants are
shown in Fig. 1a for each adaptation strategy. We see that across most values of , on average the semi-supervised adaptation improved the accuracies over the case of no adaptation
(horizontal line). Errorbar length indicates 2 standard deviations (2 ) of all 50 simulation
repetitions averaged across participant and session. , shown as a function of in Fig. 1b,
gives a quantitative measure of the risk associated with each adaptive method. With increasing
, there is increasing risk that the adaptation could be detrimental instead of beneficial. In
general, Fig. 1 indicates that the accuracy of the BaseAdapt method decreases with increasing
, while increases. Even at low , it performs no better than its unsupervised counterpart.
Article ID 014-2
DOI:10.3217/978-3-85125-378-8-14
The Reweight method has optimal accuracy and minimal at 0.2. The Hybrid method
obtains the best optimal average classification accuracies, but it also has the highest across
most values of ; this is likely because it combines variability from both methods.
3.5
A
70
65
Static
PmeanGcov
BaseAdapt
Reweight
Hybrid
60
55
0.05
0.1
0.35
75
2.5
1.5
0.4
BaseAdapt
Reweight
Hybrid
0.05
0.1
0.35
0.4
95
Static
PmeanGcov
BaseAdapt
Reweight
Hybrid
90
85
80
75
70
65
60
55
0.35
0.3
0.25
0.2
0.15
0.1
0.05
50
45
01
02
03
04
05
06
07
08
09
B
0.4
Best false positive rate,
100
01
02
03
04
05
06
07
Participants
08
09
BaseAdapt
Reweight
Adaptation Type
Hybrid
Figure 2: a. Barplot of average classification accuracies (across session) at the best false positive
rate for each participant. Error bars indicate of 50 simulations averaged across session. b.
Best false positive rate chosen for each participant (refer to legend) for each adaptation method.
Article ID 014-3
DOI:10.3217/978-3-85125-378-8-14
Discussion
We find that the variance, and thus the risk associated with employing these adaptation methods
changes across in a manner unique to each method. The adaptation of the LDA base classifiers
has lower risk at low , while the re-weighting method has lower risk when and true positive
rate are balanced. The hybrid method combines the risk from both methods such that, although
the average performance is the highest, it also has a high variance for most .
LDA may be more sensitive to high because when a false positive occurs, the error is due
to a sample more likely to be further from the adapted class mean than in the case of a false
negative. This drives classes closer together, which can lead to erratic movement of the class
boundary as discussed in [10]. The Reweight method may not favor a single error type as such.
We found that, averaged across all participants, re-weighting the ensemble had the lowest
risk. However, we find that the adaptation method with the best performance is unique to
individuals, so that participant specific adaptation methods may be required. A few participants appear to benefit much more from the Reweight method compared to the BaseAdapt
method. This may be due to particularly strong shifts in the most discriminative frequency for
these participants. However, re-weighting may not be helpful for individuals with stationary
discriminative frequency. These findings also suggest that ErrP detectors should be biased to
particular depending on the adaptation method chosen and the individual.
Future work should attempt to reduce the risk associated with adaptation using potentially
uncertain labels. This may be achieved by combining the detection of an ErrP with the confidence of the BCI task classifier, or using a fuzzy classifier where weights of training samples
are determined by the ErrP strength on individual trials.
References
[1] K. K. Ang, Z. Y. Chin, H. Zhang, and C. Guan. Filter bank common spatial pattern (FBCSP) in
brain-computer interface. In IEEE IJCNN, pages 23902397. IEEE, 2008.
[2] P. W. Ferrez and J. R. Mill
an. Error-related EEG potentials generated during simulated braincomputer interaction. IEEE Transactions on Biomedical Engineering, 55(3):923 929, 2008.
[3] J. Z. Kolter and M. A. Maloof. Dynamic weighted majority: An ensemble method for drifting
concepts. J Mach Learn Res, 8(12):2755 2790, 2007.
[4] R. Leeb, F. Lee, C. Keinrath, R. Scherer, H. Bischof, and G. Pfurtscheller. Braincomputer
communication: motivation, aim, and impact of exploring a virtual apartment. IEEE T Neur Sys
Reh, 15(4):473482, 2007.
[5] A. Llera, M. van Gerven, V. G
omez, O. Jensen, and H. Kappen. On the use of interaction error
potentials for adaptive brain computer interfaces. Neural Networks, 24(10):11201127, 2011.
[6] N. M. Schmidt, B. Blankertz, and M. S. Treder. Online detection of error-related potentials boosts
the performance of mental typewriters. BMC Neurosci, 13:19, Feb. 2012.
[7] M. Sp
uler, W. Rosenstiel, and M. Bogdan. Online adaptation of a c-VEP brain-computer interface(BCI) based on error-related potentials and unsupervised learning. PLOS ONE, 7(12):e51077,
Dec. 2012.
[8] K. P. Thomas, C. Guan, C. T. Lau, A. P. Vinod, and K. K. Ang. Adaptive tracking of discriminative frequency components in electroencephalograms for a robust braincomputer interface. J
Neural Eng, 8(3):036007, 2011.
[9] C. Vidaurre, M. Kawanabe, P. von B
unau, B. Blankertz, and K. R. M
uller. Toward unsupervised
adaptation of LDA for braincomputer interfaces. IEEE T Biomed Eng, 58(3):587597, 2011.
[10] T. J. Zeyl and T. Chau. A case study of linear classifiers adapted using imperfect labels derived
from human event-related potentials. Pattern Recogn Lett, 37(0):5462, Feb. 2014.
Article ID 014-4
DOI:10.3217/978-3-85125-378-8-15
1 Introduction
One of the main goals of the project BackHome is to advance existing BCI systems such that
they can be used as assistive technology in a home environment and operated without the need of
technical skills. Based on previous work (Faller, et al., 2013), we implemented a BCI system that
provides end users with a simple graphical interface and several applications for communication,
creativity, cognitive rehabilitation, entertainment and smart home control. Thereby we aim at
facilitating activities of daily living for persons with severe paralysis and improving the users social
integration. From the beginning of the project, we engaged with potential end-users to gather users
needs and feedback on the functionalities of a first prototype. This user-centered approach (Kbler,
Holz, & Kaufmann, 2013) informed the development of the current BCI prototype that allows users to
control the following applications: a multimedia player and web browser (see Halder et al., in
preparation), a speller, Brain Painting for creative expression (Mninger et al., 2010), a TV, a
webcam and two games (see Vargiu et al., this conference). The first game (find-a-category) aims at
improving the semantic and reasoning skills of the participant. A certain category is presented and
users have to identify the picture that matches the category among four alternatives (see Figure 1A).
The level of difficulty is varied by the level of abstraction needed to identify the correct picture. The
second game (pairs) is an implementation of the memory-cards game, in which users have to find a
pair of matching cards. In the beginning, all cards are depicted face down. Each card is assigned a
*
Article ID 015-1
DOI:10.3217/978-3-85125-378-8-15
number and users can uncover the cards by choosing the corresponding number in the control matrix.
The game is finished once all pairs are found. The level of difficulty is varied by the number of cards
and the level of complexity of the depicted pictures/symbols shown. At the end of each game, the
results are shown in a numerical and graphical display. The cognitive rehabilitation games can be
assigned to specific users via a web application (therapist station). The sessions can then be loaded by
the users by choosing the session symbol from the main games menu. All applications are
controlled with a P300 system based on control matrices that were first proposed for a spelling system
(Farwell & Donchin, 1988).
2 Methods
2.1 Framework of Evaluation
The evaluation of the BCI Prototype followed the framework for evaluation suggested by Zickler
et al. (2011). It is based on ISO recommendations for the evaluation of usability (ISO-9241-210) and
requires the estimation of effectiveness and efficiency of the system and of user satisfaction.
Effectiveness is operationalized as the accuracy of symbol/command selections in percent correct.
Efficiency is estimated through the calculation of the information transfer rate using the formula
suggested by Wolpaw et al. (2002) and the NASA-Task Load Index (NASA-TLX) as an indicator of
the subjective workload. Satisfaction was assessed with an extended version of the Quebec User
Evaluation of Satisfaction with assistive Technology (extended QUEST 2.0, Demers, Weiss-Lambrou
& Ska, 2002), a visual analogue scale (VAS, ranging from 0=not at all satisfied to 10=completely
satisifed) and a custom usability questionnaire concerning the design of the system.
Figure 1. Exemplary question of the category game (A) and matrix used to control the application (B). Users
choose a number to indicate the right answer. By focusing on the particular icon, they can also switch to a
different application. In Figure B the first row is overlaid with pictures of faces (in this publication they are
pixelated due to copyright). Figure C shows a user in front of the two screens during BCI operation.
Article ID 015-2
DOI:10.3217/978-3-85125-378-8-15
Users were asked to complete an experimental protocol that included writing ten letters with the
spelling application, playing the two games (minimum of 5 and 8 selections required), writing and
posting a short twitter message (18 selections) and controlling a webcam (3 selections). To navigate
the menus and switch between applications 7 selections were required. To make a certain selection
users were asked to focus on the symbol they should select and silently count the number of times it
was highlighted. During the screening run, used to acquire data for the classifier, users had to select 5
letters from a 3x3 matrix. Each row and column flashed for a total 15 times during the screening run.
After feature weights had been determined, the number of flashes was adaptive and varied for every
selection (dynamic stopping). Rows and columns were flashed with pictures of famous faces to
improve classification accuracies (Kaufmann et al., 2013). During all tasks (except spelling), users
could correct mistakes. If, with three attempts, users were unable to select a certain symbol, it was
selected by the investigator, who then indicated the next selection. After completing the BCI tasks
users were asked to fill in the questionnaires.
3 Results
All users had control over the BCI and reached high accuracies (>85%) with four of the five
applications. The average accuracies for the different tasks are depicted in Figure 2. Accuracy for
menu navigations was 72%. Due to the varying number of stimulus repetitions needed for one
selection and different matrix sizes, bitrates can only be estimated for individual tasks and
participants. Highest bitrates were achieved with the web browser (Twitter), with bitrates ranging
from 11-30 bits/minute.
It is more important to note that 98% of symbols could be selected, with a maximum of three
attempts for one selection. Subjective workload was indicated with the NASA-TLX as moderate
(score of 51.8414.34; highest possible score is 100).
The overall satisfaction score assessed with a VAS was 7.481.37 (range 6-10). The global score of
the QUEST 2.0 was also high (4.330.59), indicating that users were quite satisfied with the
system. The score included items concerning the dimensions, weight, adjustment, safety, comfort,
ease of use, effectiveness of the system and professional services (instructions). The average score for
the BCI specific items reliability, speed, learnability and aesthetic design was lower, but still high
(4.050.68). While users criticized the aesthetic design of the electrode cap (not suited for everyday
life, clinical/scientific look, necessity of gel, restricting cables), they did not name it as one of
the most important aspects. Instead, they stressed the importance of effectiveness, ease of use and
learnability. Generally, users were satisfied with the high selection accuracies, but stated that the
reliability of the software should be improved (it had to be restarted occasionally). Further, some users
Article ID 015-3
DOI:10.3217/978-3-85125-378-8-15
suggested to improve the design of the software that they criticized as pixelated. Several users
remarked positively on the many different applications that the system offers.
4 Discussion
With the proposed P300 BCI system study participants could switch between and control multiple
applications ranging from entertainment to smart home control. The web browser allows to gain
access to internet based communication (e.g. via social media platforms). The evaluation revealed
some aspects of the system that could be improved to increase its usability. This includes
improvements of the hardware, such as less conspicuous EEG equipment as well as improvements of
the software (e.g. design of the user interface and stability). Some symbols were particularly difficult
to select (e.g. for the camera task), probably because they differed in size. We will continue to
evaluate the system with end users and, in an iterative process, implement new features to increase
user friendliness of the system to ultimately reach our goal of implementing a system that can be
employed in daily life to reduce social exclusion and increase independence of the end users.
Acknowledgments
This work was funded by the FP7 EU research project BackHome (No. 288566). This paper
reflects only the authors views and funding agencies are not liable for any use that may be made of
the information contained herein.
References
Demers, L., Weiss-Lambrou, R., & Ska, B. (2002). Quebec user evaluation of satisfaction with
assistive technology (QUEST 2.0): An overview and recent progress. Technology and
Disability, 14, pp. 101-105.
Faller, J., Torrellas, S., Costa, U., Opisso, E., Fernndez, J. M., Kapeller, C., . . . Mller-Putz, G.
(2013). Autonomy and Social Inclusion for the Severely Disabled: The BrainAble Prototype.
Proc. of the 5th Int. BCI Meeting 2013, (pp. 372-3). Asilomar, CA, USA.
Farwell, L., & Donchin, E. (1988). Talking off the top of your head: toward a mental prosthesis
utilizing event-related brain potentials. Electroen Clin Neuro, 70(6), pp. 510-23.
Kaufmann, T., Schulz, S., Kblitz, A., Renner, G., Wessig, C., & Kbler, A. (2013). Face stimuli
effectively prevent brain-computer interface inefficiency in patients with neurodegenerative
disease, 124(5). Clin Neurophysiol, 893-900.
Kbler, A., Holz, E., & Kaufmann, T. (2013). A user centred approach for bringing BCI controlled
applications to end-users. In R. Fazel-Rezai, Brain-Computer Interface Systems - Recent
Progress and Future Prospects (pp. 1-19). Rijeka, Croatia: InTech.
Mninger, J. I., Halder, S., Kleih, S. C., Furdea, A., Raco, V., Hsle, A., & Kbler, A. (2010). Brain
Painting: First Evaluation of anew Brain-Computer Interface Application with ALS-Patients
and Healthy Volunteers. Frontiers in Neuroscience, 4:182, doi:10.3389/fnins.2010.00182.
Wolpaw, J., Birbaumer, N., McFarland, D. J., Pfurtscheller, G., & Vaughan, T. M. (2002). Braincomputer interfaces for communication and control. Clin Neurophysiol, 113(6), pp. 767-91.
Zickler, C., Riccio, A., Leotta, F., Hillian-Tress, S., Halder, S., Holz, E., . . . Kbler, A. (2011). A
Brain-Computer Interface as Input Channel for a Standard Assistive Technology Software.
Clinical EEG and Neuroscience, 42(2), pp. 236-244.
Article ID 015-4
DOI:10.3217/978-3-85125-378-8-16
Ozan Ozdenizci
, Timm Meyer2 , M
ujdat C
etin1 and Moritz Grosse-Wentrup2
1
Introduction
While initially conceived as communication devices for severely paralyzed patients, braincomputer interfaces (BCIs) have recently been considered in the context of post-stroke motor
rehabilitation [1]. Here, BCIs are employed to synchronize movement intent, as decoded by a
BCI from sensori-motor rhythms, with congruent haptic feedback, e.g., as delivered by an exoskeleton [2]. This artificial re-establishment of the sensorimotor feedback loop has been shown
to support modulation of sensorimotor-rhythms [3] and result in enhanced post-stroke recovery
[4]. In this paper, we argue that, motivated by the impressive results achieved to date, the
current focus should be extended beyond sensori-motor- to also include associative brain areas.
This argument is based on empirical evidence that a variety of brain rhythms beyond those in
primary sensorimotor areas are related to the extent of post-stroke impairment. For instance,
the global ratio of - to -power of the brains electromagnetic field has been found to correlate
with the extent of post-stroke disability and predict subsequent recovery [5]. We interpret such
abnormal activation patterns as disturbances of the balance of large scale cortical networks [6],
and argue that re-establishing their natural balance by means of BCI-based neurofeedback is
likely to support the brain in post-stroke recovery.
In order to turn this hypothesis into a viable stroke rehabilitation protocol, several interrelated problems need to be addressed. Firstly, we need to identify which large scale networks
are involved in sensori-motor learning in healthy control subjects. Next, we have to investigate how activation patterns of and between these networks are disturbed in stroke patients,
and elucidate how these disturbances relate to post-stroke recovery. And thirdly, we need to
train patients via neurofeedback to establish activation patterns of large scale networks that
are associated with good post-stroke recovery.
In the present work, we address the viability of the last issue. In a recent publication, we have
identified EEG correlates of motor-learning performance [7]. In particular, we have provided
evidence that parieto-occipital -power during rest as well as during movement preparation
predicts performance in a subsequent reaching movement. This led us to hypothesize that
parieto-occipital -power reflects activity in a cortical network that is tuned by the brain to
optimize motor-learning performance. Here, we propose an adaptive neurofeedback training
Article ID 016-1
DOI:10.3217/978-3-85125-378-8-16
scheme to modulate parieto-occipital -power in a way that we predict will enhance motorlearning performance. Based on experimental data from five healthy subjects we argue that,
firstly, subjects are able to modulate parieto-occipital -power by means of neurofeedback,
and, secondly, that this modulation is not task-specific but extends into the inter-trial restingperiods. Taken together, these results indicate the feasibility of using neurofeedback to support
subjects in generating brain activation patterns that are associated with good motor-learning
performance. This constitutes an important building block towards a stroke therapy based on
neurofeedback training of associative brain areas.
2
2.1
Methods
Experimental Data
Five healthy subjects were recruited for this study, each of which completed two training sessions. Each session lasted one hour with one week interval between sessions. Prior to the
training sessions all participants gave their informed consent after the training procedure was
explained to them in accordance with guidelines set by the Max Planck Society. During the
training sessions, a 120-channel EEG was recorded at 1 kHz sampling rate, using active EEG
electrodes and a QuickAmp amplifier (BrainProducts, Gilching, Germany). Electrodes were
placed according to the 10-20 system, with Cz as the initial reference electrode. All data was
re-referenced to common average reference.
2.2
Each training session included one resting-state baseline and eight training blocks with one
minute breaks in between. For the resting-state baseline, subjects were instructed to relax
for five minutes with eyes open, looking at a fixation cross displayed centrally on a screen
approximately 1.5 m in front of the subjects. This resting-state data was used to calibrate the
feedback for the subsequent training session. Specifically, the 120-dimensional raw EEG data
x[t] was first spatially filtered by a filter w to obtain a one-dimensional signal y[t] = wT x[t].
The spatial filter w was taken from a group-wise independent component analysis (ICA) of a
previous study [7], in which the corresponding independent components -power was found to
predict performance in a motor-learning task. Figure 1a displays the source localization results
obtained for this independent component in parieto-occipital areas (adapted from [7]). Next,
log-bandpower of the spatially filtered signal was computed in the -band (8-14 Hz), using a
FFT in conjunction with a Hanning window, in a sliding window of 2 s with a window step
size of 100 ms. This signal processing pipeline was used to compute the mean and standard
(a)
(b)
Article ID 016-2
DOI:10.3217/978-3-85125-378-8-16
2.3
Following the last training session, the data of all subjects and sessions was visually inspected
for contamination by ocular artifacts. The data of one session of one subject was discarded,
as it was heavily contaminated by eye blinks. To assess the overall eect of neurofeedback
training, we then pooled the data of all subjects and sessions. Specifically, we computed the
correlation of trial-number, ranging from one to 120, with parieto-occipital -power, averaged
across subjects and sessions, in each trial. In this way, we estimated the linear trend in parietooccipital -power across a training session. We also computed this correlation for -power at
each individual recording channel in order to investigate the topography of -power modulation.
Finally, we computed the same correlation only using the EEG data of the resting-periods prior
to each trial.
(a)
(b)
(c)
Figure 2: a) Grand average of trial -log-bandpower values (red is the linear fit). b) Topography
of grand average log-bandpower correlations with trial numbers within a session of 120 trials in
-band (for trial phases). c) Same topography in (b) obtained for rest phases.
Article ID 016-3
DOI:10.3217/978-3-85125-378-8-16
Results
Discussion
In this pilot study, we presented an adaptive neurofeedback training paradigm that enables subjects to up-regulate parieto-occipital -power. Importantly, we could provide evidence that upregulation of -power was sustained in the inter-trial periods, indicating that subjects learned
to induce a stable state-of-mind that we predict will be beneficial for motor-learning [7]. In
future work, this training paradigm will be carried out by stroke patients prior to a rehabilitation session. We hypothesize that this will be beneficial for motor-learning and hence support
post-stroke motor recovery.
References
[1] M. Grosse-Wentrup, D. Mattia, and K. Oweiss. Using braincomputer interfaces to induce neural
plasticity and restore function. Journal of Neural Engineering, 8(2):025004, 2011.
[2] M. Gomez-Rodriguez, M. Grosse-Wentrup, J. Hill, A. Gharabaghi, B. Sch
olkopf, and J. Peters.
Towards brainrobot interfaces in stroke rehabilitation. In IEEE International Conference on
Rehabilitation Robotics (ICORR), pages 16. IEEE, 2011.
[3] M. Gomez-Rodriguez, J. Peters, J. Hill, B. Sch
olkopf, A. Gharabaghi, and M. Grosse-Wentrup.
Closing the sensorimotor loop: haptic feedback facilitates decoding of motor imagery. Journal of
Neural Engineering, 8(3):036005, 2011.
[4] A. Ramos-Murguialday et al. Brainmachine interface in chronic stroke rehabilitation: A controlled
study. Annals of Neurology, 74(1):100108, 2013.
[5] S. P. Finnigan, M. Walsh, S. E. Rose, and J. B. Chalk. Quantitative EEG indices of sub-acute
ischaemic stroke correlate with clinical outcomes. Clinical Neurophysiology, 118(11):25252532,
2007.
[6] S. L. Bressler and V. Menon. Large-scale brain networks in cognition: emerging methods and
principles. Trends in Cognitive Sciences, 14(6):277290, 2010.
[7] T. Meyer, J. Peters, T. O. Zander, B. Sch
olkopf, and M. Grosse-Wentrup. Predicting motor
learning performance from electroencephalographic data. Journal of NeuroEngineering and Rehabilitation, 11:24, 2014.
[8] S. Othmer, S. F. Othmer, and D. A. Kaiser. EEG biofeedback: An emerging model for its
global ecacy. In J. Evans and A. Abarbanel, editors, Introduction to Quantitative EEG and
Neurofeedback, pages 243310. Academic Press, New York, 1999.
[9] G. Schalk, D. J. McFarland, T. Hinterberger, N. Birbaumer, and J. R. Wolpaw. BCI2000: A
general-purpose brain-computer interface (BCI) system. IEEE Transactions on Biomedical Engineering, 51(6):10341043, 2004.
[10] BCPy2000, http://bci2000.org/downloads/BCPy2000/.
Article ID 016-4
DOI:10.3217/978-3-85125-378-8-17
Introduction
Faced with rising classroom sizes and the higher incidence of learning diculties, such as ADD,
we need a tool that can improve a users ability to remember study material. Such a tool may
also be useful for addressing age-related memory decline. We propose a system that tailors
instruction to each individuals brain dynamics. The system will use neural activity reflecting
memory encoding and retrieval to choose optimal presentation times and intervals to improve
memory. This system can be considered a passive BCI where the system monitors the brain in
real-time to extract information related to memory encoding/retrieval.
We review findings on the neural correlates of long-term memory which can be used for an
EEG-based passive BCI system and give a summary of the classification results applied to
single-trial EEG data to identify neural signatures reflecting memory encoding and retrieval.
2.1
Encoding Success
There are significant dierences in the spectral and temporal patterns between remembered and
forgotten trials during study item presentation which are known as subsequent memory eects
(SMEs). The dierence in event-related potential (ERP) between subsequently remembered
and forgotten items is also known as the Dm (or dierence due to memory) eect [13, 12]. It is
observed around 400 to 800ms after study item presentation. Brain oscillations during encoding
Article ID 017-1
DOI:10.3217/978-3-85125-378-8-17
also distinguish between subsequently remembered and forgotten items (see [6] for a review).
Power decreases for the remembered items typically occur in the alpha (7-12 Hz) and low beta
(12-19 Hz) bands [7, 5] of the EEG signal.
We used single-trial classification to successfully distinguish between remembered and forgotten pictures using the temporal and spectral information in the EEG during stimulus presentation [10]. By combining the information from multiple time windows, the encoding success
classifiers achieved an overall accuracy of 58 % across all 18 subjects. The classifiers gave
accuracies significantly over chance (p < 0.05) for 9 subjects [9].
2.2
Encoding preparedness
EEG, MEG, and ECoG studies have shown that spectral dierences in brain activity immediately preceding study item presentation also show significantly dierent patterns for the subsequently remembered and forgotten trials. This dierence is observed in multiple frequency
bands ranging from theta (4-7 Hz) [4] to the high beta (19-30 Hz) [3] bands.
These good/bad (for subsequent memory retention) brain states were successfully identified
from the spectral information in the EEG signal between -300 to 0 ms before stimulus presentation of pictures in [10]. The classifier combined information from 9 overlapping subbands of
the EEG signal (4-7, 6-10, 7-12, 10-15, 12-19, 15-25, 19-30, 25-35, and 30-40 Hz). The overall
classification accuracy of the encoding preparedness classifiers across 18 subjects was 57.2 %
where 9 subjects showed significantly over chance results.
Because study items were presented at a fixed interval without awareness of the subjects
brain state, we used the 10 % of presentations with the highest and lowest classifier confidence
as analogs for the best and worst sets in a real-time system. The rate of remembered items gave
a 50 % improvement when the study items were presented at the best times compared to the
worst (59.6 % items remembered during good brain states vs. 40 % during bad brain states). For
the 10 subjects with the highest overall classification accuracy, the rate of remembered items
was a 74 % improvement when the study items were presented at the best times compared to
the worst (63.4 % items remembered during good brain states vs. 36.5 % during bad ).
2.3
Confidence at retrieval
The parietal ERP old/new eect is the dierence in ERP between correctly rejected new and
correctly detected old trials during memory retrieval. The old trials show a positive-going
deflection compared to the new trials in the left parietal channels between 500-800 ms after
stimulus presentation [14]. The frontal ERP old/new eect also distinguishes between correctly
rejected new and correctly detected old trials. The ERPs show a more negative peak for less
familiar trials in the frontal channels between 300-500 ms after stimulus presentation [2].
We used the EEG data during the recognition phase of a memory experiment and found that
it is possible to distinguish unsuccessfully from successfully retrieved studied items with 58.4%
accuracy [11] where 20 out of 34 subjects gave significantly over chance results. The likelihood
of remembering a study item for trials with the 10 % highest and lowest classifier outputs were
0.8 and 0.45 respectively suggesting that the confidence at retrieval classifier outputs reflect the
level of retrieval strength during the test phase.
Article ID 017-2
(a)
(b)
DOI:10.3217/978-3-85125-378-8-17
(c)
The three classifiers above (encoding preparedness, encoding success, confidence at retrieval)
can be incorporated into a passive BCI system to assist users in improving memory. A common
real-world learning task is paired-associate learning (e.g. word-meaning lists for students or
vocabulary learning for a second language). In this section, we propose a system that incorporates the above classifiers to improve paired-associate learning. The system will present study
items to the users following a continuous recognition paradigm where there is no distinction
between study and test phases. Items may occur multiple times in the list for restudy.
The encoding preparedness classifier monitors the users brain state in real-time to identify
optimal brain states for study item presentation. When the system identifies a near-optimal
time for presentation, it gives a scheduled study item (e.g. a target word in a foreign language)
to the user. The user responds with either New for items they have not seen before, Dont
know for items for which they do not remember the associations, or with their guess for the
previously given association (e.g. the same word in English). After the response, the user
receives feedback from the system with the correct association pair. When the user is finished
studying/restudying the given association, the next target item appears on the screen at the
next detected near-optimal time. With a first-time user, the classifiers can be initialized using
training data from other users or the study items can be given at fixed time points. These initial
classifiers can be adapted to the user in an online manner at the second (test) presentation of
each item (when the behavioral results are obtained) with the stored EEG data from memory
encoding/retrieval.
For a new study item, the lag for re-presentation is determined by the encoding success
classifier. If the likelihood of encoding success is high (low) for a given item , the item may
be re-presented with a long (short) lag. For a restudy item, the lag for re-presentation (for
further restudy) is determined by the subjects response and the confidence at retrieval classifier
output. Incorrect user responses will be re-presented with a short lag; for correct responses, the
confidence at retrieval classifier output will determine the lag duration. Spacing models can
also be incorporated to specifically determine these lags for optimal retention [1].
One disadvantage of the proposed system is that the throughput (rate of learned material
presentation) may be lower than a conventional tutoring system due to the waiting period for
good brain states. We can overcome this issue by using the good brain states to present the
most critical or dicult1 items. At intervening times, the remaining study material can be
1 Dicult items may be identified from average hit rates from multiple users or based on previous behavioral/classification results for the specific user.
Article ID 017-3
DOI:10.3217/978-3-85125-378-8-17
given to the user with a typical presentation rate [8]. As a long-term goal, we plan to explore
the eects of long term use of our system. We hypothesize that the implicit neurofeedback
users would get from being presented study items only when they are ready could help them
remain in a receptive brain state more often.
Conclusion
In this paper, we proposed a passive BCI system for assisting memory formation and retention.
Our system has three components: 1) it will infer the users preparedness for learning to
present study items at estimated optimal times; 2) it will monitor the brain activity during
learning/encoding to assess whether the encoding process was successful or not; 3) it will also
extract information related to the users confidence during re-presentation of the item to assess
the level of reinstatement. The incorporation of spacing models was also discussed to specifically
determine these lags for optimal retention. Other systems that monitor study performance or
user state could also be integrated with the proposed system.
This research was funded by NSF grants CBET-0756828, SBE-0542013 and SMA-1041755,
NIH Grant MH64812, and a KIBM Innovative Research Grant.
References
[1] N. J. Cepeda, N. Coburn, D. Rohrer, J. Wixted, M. Mozer, and H. Pashler. Optimizing distributed
practice: theoretical analysis and practical implications. Exp Psychol, 56:236246, 2009.
[2] T. Curran. Brain potentials of recollection and familiarity. Memory & Cognition, 28:92338, 2000.
[3] J. Fell, E. Ludowig, B. P. Staresina, T. Wagner, T. Kranz, C. E. Elger, and N. Axmacher. Medial
temporal theta/alpha power enhancement precedes successful memory encoding: evidence based
on intracranial EEG. Journal of Neuroscience, 31(14):53925397, 2011.
[4] S. Guderian, B. H. Schott, A. Richardson-Klavehn, and E. Duezel. Medial temporal theta state
before an event predicts episodic encoding success in humans. PNAS, 106:536570, 2009.
[5] S. Hanslmayr, B. Spitzer, and K.-H. B
auml. Brain oscillations dissociate between semantic and
nonsemantic encoding of episodic memories. Cerebral Cortex, 19(7):16311640, 2009.
[6] S. Hanslmayr and T. Staudigl. How brain oscillations form memories - a processing based perspective on oscillatory subsequent memory eects. NeuroImage, 85(2):648655, 2013.
[7] W. Klimesch, H. Schimke, M. Doppelmayr, B. Ripper, J. Schwaiger, and G. Pfurtscheller. Eventrelated desynchronization (ERD) and the Dm eect: Does alpha desynchronization during encoding predict later recall performance? Intl. J. Psychophysiology, 24:4760, 1996.
[8] R. Mayer. Applying the Science of Learning. Pearson Education, Incorporated, 2010.
[9] G. M
uller-Putz, R. Scherer, C. Brunner, R. Leeb, and G. Pfurtscheller. Better than random? A
closer look on BCI results. International Journal of Bioelectromagnetism, 10(1):5255, 2008.
[10] E. Noh, G. Herzmann, T. Curran, and V. R. de Sa. Using single-trial EEG to predict and analyze
subsequent memory. NeuroImage, 84:712723, 2014.
[11] E. Noh, M. V. Mollison, T. Curran, and V. R. de Sa. Single-trial identification of failed memory
retrieval. submitted to the 48th Asilomar Conference on Signals, Systems, and Computers, 2014.
[12] K. A. Paller, M. Kutas, and A. R. Mayes. Neural correlates of encoding in an incidental learning
paradigm. Electroencephalography and Clinical Neurophysiology, 67(4):360371, 1987.
[13] T. F. Sanquist, J. W. Rohrbaugh, K. Syndulko, and D. B. Lindsley. Electrocortical signs of levels of
processing: perceptual analysis and recognition memory. Psychophysiology, 17(6):568576, 1980.
[14] E. L. Wilding and M. D. Rugg. An event-related potential study of recognition memory with and
without retrieval of source. International Journal of Psychophysiology, 119:889905, 1996.
Article ID 017-4
DOI:10.3217/978-3-85125-378-8-18
1
The University of Tokyo, Tokyo, Japan
Life Science Center of TARA, University of Tsukuba, Tsukuba, Japan
3
RIKEN Brain Science Institute, Wako-shi, Japan
tomek@bci-lab.info & http://bci-lab.info/
Abstract
We study the extent to which contactless and airborne ultrasonic tactile display
(AUTD) stimuli delivered to the palms of a user can serve as a platform for a brain
computer interface (BCI) paradigm. Six palm positions are used to evoke combined somatosensory brain responses, in order to define a novel contact-less tactile BCI. A comparison is made with classical attached vibrotactile transducers. Experiment results of
subjects performing online experiments validate the novel BCI paradigm.
Introduction
Stateoftheart brain computer interfaces (BCIs) are typically based on mental visual or auditory paradigms, as well as motor imagery paradigms, which require extensive user training
and good eyesight or hearing. In recent years, alternative solutions have been proposed to make
use of a tactile modality [1, 2, 3] to enhance braincomputer interfacing efficiency. The concept
reported in this paper further extends the brains somatosensory channel by the application of
a contactless stimulus generated with an airborne ultrasonic tactile display (AUTD) [4]. The
rationale behind the use of the AUTD is that, due to its contactless nature, it allows for a more
hygienic application, avoiding the occurrence of skin ulcers (bedsores) in patients in a lockedin
state (LIS). This paper reports very encouraging results with AUTDbased BCI (autdBCI) in
comparison with the classical paradigm of vibrotactile transducerbased somatosensory stimulus (vtBCI) attached to the users palms [3]. The rest of the paper is organized as follows.
The next section introduces the materials and methods used in the study. The results obtained
in online experiments with 13 healthy BCI users are then discussed. Finally, conclusions are
formulated and directions for future research are outlined.
Thirteen male volunteer BCI users participated in the experiments. The users mean age was
28.54, with a standard deviation of 7.96 years. The experiments were performed at the Life
Science Center of TARA, University of Tsukuba, at the University of Tokyo and at RIKEN
Brain Science Institute, Japan. The online (real-time) EEG autdBCI and vtBCI paradigm
K.
Article ID 018-1
DOI:10.3217/978-3-85125-378-8-18
3.1.
20
3.1.
21
3.2
3.2
A user
during
249
2 = 498
the autdBCI
experiment with both palms
placed under the AUTD array
with ultrasonic transducers.
3.3
250mm
150Hz
3.2.
2.0gf
12mm
2
3.1
experiments were conducted in accordance with the WMA Declaration
of Helsinki - Ethical
Principles for Medical Research Involving Human Subjects and the procedures were approved
3.3
and designed in agreement with the ethical committee guidelines of the Faculty of Engineer2
ing, Information and Systems at University of Tsukuba, Japan. The AUTD stimulus generator
NT-9UH2PK
produced vibrotactile contactless stimulation of the human skin via the air using
focused ultra- [27]
sound [4, 5]. The eect was achieved by generating an ultrasonic radiation static force produced
2
249 2 = 498
by intense sound pressure amplitude (a nonlinear acoustic phenomenon). The radiation pres- 1
10
1
sure deformed the surface of the skin on the palms, creating a tactile
sensation.
An array of 6
= 60
50
1
ultrasonic transducers30Hz
mounted on the AUTD created
the focussed10 radiation pressure
at an
arbitrary focal point by choosing a phase
3.3 shift of each transducer appropriately (the socalled
phased
array technique). Modulated radiation pressure
1 created a sensation of
60 6tactile
= 360vibration
250mm
1cm
similar to the one delivered by classical vibrotactile
transducers attached to the users palms.
HIHX09C005-8[26]
TheHiWave
AUTD device developed
by the authors [4, 5] adhered to ultrasonic medical standards and
150Hz the permitted skin absorption levels (approximately 40 times below the limits).
did not exceed
The eective vibrotactile sensation was set to 50 Hz [5] to match with tactile skin receptors
3.1
and notch filters for power line interference rejection. As a reference, in the second vtBCI
experiment, contact vibrotactile stimuli were also applied to locations on the users palms via
the transducers HIHX09C005-8. Each transducer in the experiments was set to emit a square
acoustic frequency wave at 50 Hz, which was delivered from the ARDUINO microcontroller
board with a custom batterydriven and isolated power amplifier and software developed in
house and managed from a MAX 6 visual programming environment. The two experiment
set-ups above are presented in Figure 1. Two types of experiments were performed with the
volunteer users. Psychophysical experiments with footbuttonpress responses were conducted
in order to test uniform stimulus difficulty levels from response accuracy and time measurements. The subsequent tactile oddball online BCI EEG experiments evaluated the autdBCI
paradigm efficiency and allowed for a comparison with the classical skin contactbased vtBCI
reference. In both the above experiment protocols, the users were instructed to spell sequences
of six digits representing the stimulated positions on their palms. The training instructions
were presented visually by means of the BCI2000 [6] and MAX 6 programs with the numbers
1 6 representing the palm locations as depicted in the left panel of Figure 1. The EEG signals
were captured with an EEG amplifier system g.USBamp by g.tec Medical Engineering GmbH,
Article ID 018-2
10
60
DOI:10.3217/978-3-85125-378-8-18
Figure 2: The autdBCI (blue - targets; red - nontargets) and vtBCI (green - targets; black nontargets) grand mean averaged ERP responses, together with standard error bars. Due to
limited space, of the 16 measured, only electrodes Cz and Pz are presented.
Austria, using 16 active electrodes. The electrodes were attached to the head locations: Cz,
Pz, P3, P4, C3, C4, CP5, CP6, P1, P2, POz, C1, C2, FC1, FC2, and FCz, as in the 10/10
extended international system. The ground electrode was attached to the FPz position, and
the reference was attached to the left earlobe. No electromagnetic interference was observed
from the AUTD or vibrotactile transducers operating with frequencies notchfiltered together
with power line interference from the EEG. The EEG signals captured were processed online
with a BCI2000based application [6], using a stepwise linear discriminant analysis (SWLDA)
classifier [7] with features drawn from 0 800 ms ERP intervals decimated by a factor of 20.
The stimulus length and interstimulusinterval were set to 400 ms, and the number of epochs
to average was set to 15. The EEG recording sampling rate was set at 512 Hz, and the high
and low pass filters were set at 0.1 Hz and 60 Hz, respectively. The notch filter to remove
power line interference was set for a rejection band of 48 52 Hz. Each user performed three
experiment runs (randomized 90 targets and 450 non-targets each). As a feedback the spelled
numbers (palm position assigned digits as in Figure 1) were shown on a display to the user.
The averaged evoked responses to targets and nontargets are depicted together with standard
error bars in Figure 2. The BCI six digit sequences spelling accuracy analyses for both the
experiments for the various averaging options are summarized in Figure 3. The chance level
was of 16.6%. The mean accuracies for 15-trial averaged ERPs were 63.8% and 69.4% for autdBCI and vtBCI, respectively. The maximum accuracies were 78.3% and 84.6% respectively.
The dierences were not significant, supporting the concept of AUTDbased tactile stimulus
usability for BCI. However, a single trial classification oine analysis of the collected responses
resulted in mean accuracies of 83.0% for autdBCI and 53.8% for vtBCI, leading to a possible
19.2 bit/min and 7.9 bit/min, respectively. In the case of the autdBCI, only a single users
results were bordering on the level of chance, and four subjects attained 100% (10 trials averaging). On average, lower accuracies were obtained with the classical vtBCI, with which
three users bordered on the level of chance, and only one user scored 100% accuracy level in
SWLDAclassified averaged responses. This case study demonstrates results obtained with a
novel sixcommandbased autdBCI paradigm. We compared the results with classical vibrotactile transducer stimuli already generated. The experiment results obtained in this study
Article ID 018-3
DOI:10.3217/978-3-85125-378-8-18
Figure 3: Averaged autdBCI and vtBCI six digits spelling accuracy results colour coded separately for each user, with standard error bars depicted.
confirm the validity of the contactless autdBCI for interactive applications and the possibility
to further improve the results with the utilization of single trialbased linear classification. The
EEG experiment with the paradigm confirms that contactless (airborne) tactile stimuli can
be used to create six commandbased interfaces. The results presented oer a step forward in
the development of novel neurotechnology applications. Due to the still not very high interfacing rate achieved by users in the case of online BCI, the current paradigm obviously requires
improvement and modification. These requirements determine the major lines of study for
future research. However, even in its current form, the proposed autdBCI can be regarded as
a practical solution for LIS patients (locked into their own bodies despite often intact cognitive functioning), who cannot use vision or auditory-based interfaces due to sensory or other
disabilities.
References
[1] G.R. Muller-Putz, R. Scherer, C. Neuper, and G. Pfurtscheller. Steady-state somatosensory evoked
potentials: suitable brain signals for brain-computer interfaces? Neural Systems and Rehabilitation
Engineering, IEEE Transactions on, 14(1):3037, March 2006.
[2] A-M. Brouwer and J.B F Van Erp. A tactile P300 brain-computer interface. Frontiers in Neuroscience, 4(19), 2010.
[3] H. Mori, Y. Matsumoto, S. Makino, V. Kryssanov, and T.M. Rutkowski. Vibrotactile stimulus
frequency optimization for the haptic BCI prototype. In Proceedings of The 6th International
Conference on Soft Computing and Intelligent Systems, and The 13th International Symposium on
Advanced Intelligent Systems, pages 21502153, Kobe, Japan, November 20-24, 2012.
[4] T. Iwamoto, M. Tatezono, and H. Shinoda. Non-contact method for producing tactile sensation
using airborne ultrasound. In Manuel Ferre, editor, Haptics: Perception, Devices and Scenarios,
volume 5024 of Lecture Notes in Computer Science, pages 504513. Springer Berlin Heidelberg,
2008.
[5] K. Hamada. Braincomputer interface using airborne ultrasound tactile display. Master thesis, The
University of Tokyo, Tokyo, Japan, March 2014. In Japanese.
[6] G. Schalk and J. Mellinger. A Practical Guide to BrainComputer Interfacing with BCI2000.
Springer-Verlag London Limited, 2010.
[7] D.J. Krusienski, E.W. Sellers, F. Cabestaing, S. Bayoudh, D. J. McFarland, T. M. Vaughan, and
J. R. Wolpaw. A comparison of classification techniques for the P300 speller. Journal of Neural
Engineering, 3(4):299, 2006.
Article ID 018-4
DOI:10.3217/978-3-85125-378-8-19
Faculty of life and Health Sciences, University of Ulster, Northern Ireland, UK.
2
Cedar Foundation, Northern Ireland, UK
3
Institute for Knowledge Discovery, Graz University of Technology, Graz, Austria
4
G.tec Medical Engineering GmbH, Schiedlberg, Austria
s.martin@ulster.ac.uk
Abstract
This paper focuses on the evaluation of a P300 BCI prototype to migrate from the
lab to the home of people with acquired brain injury (ABI). Our evaluation involved
both, end users and healthy controls. Overall, lower accuracy scores were recorded for
the end-user group (55%) compared to 78% in the control group. The findings further
indicate that participants were satisfied with the BCI but felt frustrated when it did not
respond to their input. Overall, our evaluation indicated that BCI systems can work for
people with ABI and the results will inform the development of subsequent prototypes.
1 Introduction
Brain-Computer Interfaces (BCI) are hardware and software systems that respond to brain signals,
recorded by the electroencephalogram (Wolpaw et al. 2002). The goal of BCI technology is to
increase independence, communication, rehabilitation outcomes, environmental control and social
inclusion. Although BCIs can support a number of applications, little evidence of exploration of the
systems beyond the laboratory has been undertaken. There are many reasons for this for example;
complexity of the system and access to the target population makes this difficult. This research,
carried out as part of the BackHome project (FP7/2007-2013), aims to build on laboratory-based
results to develop a BCI system for home use.
Limited research has explored BCI technology with ABI end-users (Mulvenna et al. 2012).
Post ABI a number of barriers can impact on a persons quality of life, including physical function,
cognition and communication. This ambitious project will identify user requirements and system
usability within this population by adopting a user-centered approach (Lightbody et al. 2011). Enduser feedback will inform the technical developers through an iterative process in the design and
development of the BCI system throughout the project. It is anticipated that the final prototype will be
a system on which a number of services can be offered to support the transition from hospital to home,
increase therapeutic outcomes, enable communication and home monitoring and control.
Article ID 019-1
DOI:10.3217/978-3-85125-378-8-19
2 Methods
Eleven people were recruited to evaluate the prototype. Five participants (4 female, M= 35.6 years,
range: 26-45; N= 3 had no prior experience of BCI) in a control group in advance of working directly
with six target end users (1 female, M= 36 years, range: 25-48) who are living with ABI. The study
design set out that each participant would aim to complete a predefined experimental protocol on three
different occasions. The experimenter instructed all participants in detail prior to the measurement and
checked whether they understood the paradigm before starting the measurement. Adherence to the
protocol was monitored throughout the measurement.
The BCI prototype was implemented in Matlab Simulink (MathWorks, USA) and used a P300
based paradigm. One screen displayed the P300 speller matrix, while the other displayed the user
interface used for controlling Facebook, Twitter and a desk light. The distance between user and
screen was approximately one meter. EEG was acquired from eight active Ag/AgCl electrodes
(g.Gamma, g.tec, Austria), at the positions Fz, Cz, P3, POz, P4, PO7, Oz and PO8. The channels were
referenced to the right earlobe and a ground electrode was placed at FPz. Signals were amplified by a
g.USBamp (g.tec, Austria), sampled at 256 Hz and band filtered between 0.5 and 30 Hz.
The set-up phase measured the time from sitting in front of the equipment until commencing the
testing protocol. This included placing the cap/electrodes, adding gel, testing the signals, and creating
the classifier. For classifier training, users were instructed to consecutively count the number of
flashes of five specific letters in a 6 X 6 speller matrix. This data was then down-sampled to 64 Hz.
Step-wise linear discriminant analysis then automatically determined the most discriminative features
from the eight channels and the signal points in the 800 ms epochs after flash onset and setup the
classifier model.
The testing phase required the participant to complete a 30-step protocol. The researcher guided
the participant through the process, which included the selection of fifteen letters (Spelling task) and
fifteen selections to navigate the system such as turn on/ off a light and read messages on Facebook or
Twitter. Erroneous selections were not corrected. If users were unable to make the correct selection
after three attempts the step was abandoned and they were directed to the next step in the protocol.
Each participant aimed to complete the protocol in three sessions on three different days, followed
by the visual analogue scale (VAS) questionnaire to rate overall satisfaction. After the final evaluation
session participants completed the extended Quebec User Evaluation of Satisfaction with Assistive
Technology (QUEST) 2.0 (Demers, Weiss- Lambrou & Ska, 2002), a customized usability
questionnaire and the (NASA-TLX) NASA-Task Load Index to assess workload Sharek (2009).
3 Results
All of the set up was undertaken by non-BCI-experts and took 15.5 minutes on average with the
control group as oppose to 27.6 minutes for the end users. The most challenging aspect of the set up
was to assure good signal quality in all electrodes. Figure 1 below illustrates that the EEG responses
are neurophysiologically sound over the multiple training sessions for both the control and end users
although some differences are noted in the strength of deflection of the P300 component. The Panels
(A) and (B) show the averaged target and non-target EEG response in channel Cz over all healthy and
disabled users. Both curves are averages over multiple training sessions. As baseline correction, we
subtracted the mean of the 500 ms window before stimulus onset from every epoch. We rejected
epochs with an overall average absolute amplitude higher than +8 muV.
All five participants in the control group successfully completed the 30-step protocol over three
sessions with an average accuracy of 78% (range: 65% to 91%). A total of 14 out of 18 attempted
protocols were fully completed by the end users. The evaluation presents an average accuracy of 55%
Article ID 019-2
DOI:10.3217/978-3-85125-378-8-19
(individual session range= 41% to 79%) for those completing a full protocol. Four protocols were
partially completed up to step ten, three were stopped as the system was failing to respond and one
because the participant had reached the cut off time of two hours using the system. The four protocols
that were partially completed (accuracy ranged from 36% to 62%) brought the overall average
accuracy down to 55%. One participant did not complete the evaluation as the system stopped
responding to his commands after one complete protocol and two partly completed protocols.
Specifically the final four steps of the protocol recorded considerably lower accuracy ranging from
37% to 45% especially in comparison to the control group that ranged from 71% to 83%.
Table 1: Outcomes from Control Group and End User Evaluation
Measurement Tool
Time
Set up
Complete Protocol
Controls
Cedar Participants
78% (9.4)
83% (13.4)
55% (10.6)
62% (10.4)
Efficacy
NASA TLX
-----------
Satisfaction:
VAS
QUEST 2.0
QUEST added items
7.6 (1.67)
4.23 (1.9)
4 (0.88)
7.8 (1.9)
3.8 (.86)
3.9 (1.19)
Figure 1: Panel (A) shows the brain response for healthy and Panel (B) for disabled users.
All participants indicated satisfaction following each session, with the control group reporting an
average VAS score of 7.6 (range: 5-9) and target end-users average score of 7.8 (range: 5-10). The
control group mean QUEST score was 4.23 (4= quite satisfied) and the average score of the added
BCI related items was 4. The target end users reported a slightly lower average QUEST score of 3.8
(4= quite satisfied/3= more or less satisfied) and the average score of the added items was 3.9. The
aspects rated as most important were effectiveness, comfort, ease of use, speed, and reliability.
Additionally, target end users highlighted safety as an important fact to consider in the BCI design.
The subjective workload using the NASA-TLX reported moderate to high workload scores (ranging
from 44.66- 75.26 of 100 with a mean of 58.56).
4 Discussion
This evaluation demonstrates that it is feasible, valuable and worthwhile to engage directly with
Article ID 019-3
DOI:10.3217/978-3-85125-378-8-19
people who have ABI in the development of a BCI system for home use. The work described here is
part of an iterative process to improve the functionality and usability of BCI for users with ABI.
Recruiting people without ABI, as a control, is helpful to test out the protocol and provide comparison
data in advance of going to your target end user group. The results highlighted that although the end
users were able to use the system the set up time was longer, tasks took longer to complete and the
accuracy was lower. It is possible this is due to the participants residual cognitive impairment as a
result of ABI such as difficultly concentrating for periods of time as well as decreased stamina,
memory and attention. Mental fatigue was indicated as an issue for the end-user group in the usability
questionnaire and highlighted by the last four steps of the protocol recording considerably lower
accuracy scores. The key findings from both groups included frustration when selections were
incorrect and difficulty navigating through some aspects of the system. Equally, both groups
recommended changes to the user interface, appearance of the cap/wires, and the control group
suggested changes to the onscreen keyboard.
A number of limitations for every day use of the system by non-BCI-experts emerged from this
evaluation. The most challenging aspect of the set up was achieving a stable signal from all the
electrodes. It was also difficult to determine why the system was not responding to a particular user.
There could be a number of reasons for the reduced response rate such as noisy signals, a system
failure, participant fatigue level or insufficient classifier accuracy however it is currently impossible
for the non-expert user to resolve the issue to support the user.
The focus of the project is to move BNCI systems from the laboratory to the home of people with
neurological conditions such as ABI. The findings indicated that BCI systems can work for people
with ABI, which is promising, and will be used to inform the development and design of the
subsequent two prototypes within the project. The evaluation provides important information to
improve the prototype design and enhance the ability of the BCI to improve individuals functional
ability, quality of life, and independence.
5 Acknowledgements
Funded by the FP7 EU research project BackHome (grant agreement n 288566). We thank Josef Faller
and our partners from University of Wrzburg and Barcelona Digital for technical help.
6 References
Demers, L., Weiss-Lambrou, R., & Ska, B. (2002). Quebec user evaluation of satisfaction with assistive
technology (QUEST 2.0): An overview and recent progress. Technology and Disability, 14, pp. 101-105.
Lightbody, G., Ware, M., Mccullagh, P., Mulvenna, M.D., Thomson, E., Martin, S., Todd, D., Medina,
V.R., & Martinez, S.C. (2010). A user-centred approach for developing brain-computer interfaces. In:
Proceedings of Pervasive Computing Technologies for Healthcare, Munich 2010, pp. 1-8.
Mulvenna, M Lightbody, G., Thomson, E. McCullagh, P. Ware, M., and Martin, (2012). Realistic
expectations with brain computer interfaces. Journal of Assistive Technologies, 6 (4), pp. 233 244.
Nicolas-Alonso, L. F. & Gonez-Gil, J. (2012). Brain computer interfaces, a review. Sensors, 12, pp.
1211 1279.
Sharek, D. (2009). NASA-TLX Online Tool (Version 0.06) [Internet Application]. Research Triangle,
NC. Retrieved from http://www.nasatlx.com
Wolpaw, J., Birbaumer, N., McFarland, D. J., Pfurtscheller, G., & Vaughan, T. M. (2002). Braincomputer interfaces for communication and control. Clinical Neurophysiology, 113(6), pp. 767-791.
Article ID 019-4
DOI:10.3217/978-3-85125-378-8-20
This study provides a comprehensive test of the headrelated impulse response (HRIR)
to an auditory spatial speller braincomputer interface (BCI) paradigm, including a comparison with a conventional virtual headphonebased spatial auditory modality. Five BCI
naive users participated in an experiment based on five Japanese vowels. The auditory
evoked potentials obtained produced encouragingly good and stable P300responses in online BCI experiments. Our case study indicates that the auditory HRIR spatial sound
paradigm reproduced with headphones could be a viable alternative to established multi
loudspeaker surround sound BCIspeller applications.
Introduction
A brain-computer interface (BCI) is capable of providing a speller for disabled people with
conditions such as amyotrophic lateral sclerosis (ALS). Although the currently successful visual
modality may provide a fast BCI speller, patients at an advanced stage who are in a lockedin
state cannot use the modality because they lose all intentional muscle control, including even
blinking and movements of the eyes. An auditory BCI may be an alternative method because
it does not require good eyesight. However, the modality is not as precise as the visual.
We propose an alternative method to extend the previously published spatial auditory BCI
(saBCI) paradigm [1] by making use of a headrelated impulse response (HRIR) for virtual
sound image spatialization with headphonebased sound reproduction. Our research goal is a
virtual spatial auditory BCI using HRIRbased spatialized cues in the part of the noninvasive,
stimulusdriven, auditory modality which does not require longterm training. Experiments
were conducted to reproduce and provide a comparison with previously reported vectorbased
amplitude panning (VBAP)based spatial auditory experiments [1]. The more precise HRIR
based spatial auditory BCI stimulus reproduction was used to simplify previously reported real
sound sources generated with surround sound loudspeakers [2].
HRIR appends interaural intensity dierences (IID), interaural time dierences (ITD), and
spectral modifications to create the spatial stimuli, while VBAP appends only IID. HRIR allows
for more precise and fully spatial virtual sound image positioning, even without utilizing the
users owner HRIR measurements [3].
The next section of this paper describes the experiment setup and the HRIRbased saBCI
paradigm, together with EEG signal acquisition, preprocessing and classification steps. In the
The author was supported by the Strategic Information and Communications R&D Promotion Program
(SCOPE) no. 121803027 of The Ministry of Internal Aairs and Communications in Japan.
Corresponding author.
Article ID 020-1
DOI:10.3217/978-3-85125-378-8-20
third section, the event related potentials (ERP), and especially the P 300 response latencies are
described, with a classification and discussion of the HRIRbased saBCI paradigm information
transfer rate (ITR) results, including a comparison with the conventional method. Finally, the
conclusions and future research directions are indicated.
Methods
All of the experiments were performed at the Life Science Center of TARA, University of
Tsukuba, Japan. Five paid BCI-naive users participated in the experiments. The average age of
the users was 21.6 years (standard deviation 0.547 years; five females). The psychophysical and
online EEG BCI experiments were conducted in accordance with The World Medical Association
Declaration of Helsinki - Ethical Principles for Medical Research Involving Human Subjects. The
experiment procedures were approved and designed in agreement with the ethical committee
guidelines of the Faculty of Engineering, Information and Systems at the University of Tsukuba,
Japan. Five Japanese vowels (a, i, u, e, o) were used in this experiment. The vowels were taken
from a sound dataset of female voices [4]. The monaural sounds were spatialized using the public
domain CIPIC HRTF Database provided by the University of California, Davis [5]. Each
Japanese vowel was set on a horizontal plane at azimuth locations of 80 , 40 , 0 , 40 , 80
for the vowels a, i, u, e, o, respectively. The psychophysical experiments were conducted to
investigate the response time and recognition accuracy. The users were instructed to respond by
pressing the button as soon as possible after they perceived the target stimulus, as in a classical
oddball paradigm [6]. In a single psychophysical experimental run, 20 targets and 80 non-targets
were presented. An online EEG experiment was conducted to investigate the P300 response with
BCInaive users. The brain signals were collected with a biosignal amplifier system g.USBamp
by g.tec Medical Engineering GmbH, Austria. The EEG signals were captured by sixteen active
gelbased electrodes attached to the following head locations Cz, Pz, P3, P4, Cp5, Cp6, P1, P2,
Poz, C1, C2, FC1, FC2, and FCz, as in the extended 10/10 international system.The ground
electrode was attached on the forehead at the FPz location, and the reference on the users left
earlobe. BCI2000 software was used for the saBCI experiments to present stimuli and display
online classification results. A single experiment was comprised of five runs which contained
10 target and 40 non-target stimuli. Each run contained five selections. The stimulus duration
was set to 250 ms, the interstimulus interval (ISI) to 150 ms, and brain signal ERPs were
averaged 10 times for each vowel classification. In brief, the single experiment was comprised
of 25 selections. The EEG sampling rate was set to 512 Hz, and a 50 Hz notch filter to remove
electric power line interference was applied in a rejection band of 48 52 Hz. The band pass
filter was set with 0.1 Hz and 60 Hz cut-o frequencies. The acquired EEG brain signals
were classified online by the BCI2000 application using a stepwise linear discriminant analysis
(SWLDA) classifier with features drawn from the 0 800 ms ERP interval.
Results
This section presents and discusses results obtained from the psychophysical and EEG experiments conducted with five users, as described in the previous section. In the psychophysical
experiment, the accuracy rates for all stimuli were above 94%. The majority of responses were
concentrated at the 350 ms latency. There were no significant dierences in the response times
between the target stimuli as tested by ANOVA (p < 0.05). The results of the EEG experiment
are depicted in Figure 1. The left panel shows the grand mean averaged ERP results at four
Article ID 020-2
DOI:10.3217/978-3-85125-378-8-20
max(AUC) at 488 ms
Averaged targets
0.55
0.54
0.53
0.52
0.51
0.5
Cz
Pz
P3
P4
C3
C4
CP5
CP6
P1
P2
POz
C1
C2
FC1
FC2
FCz
2
1
[V]
0.56
0
1
2
0
300
600
0.49
0.48
min(AUC) at 285 ms
0.56
0.55
Cz
Pz
P3
P4
C3
C4
CP5
CP6
P1
P2
POz
C1
C2
FC1
FC2
FCz
2
1
[V]
0.47
0
1
2
0
300
600
0.54
0.52
0.51
0.5
0.49
0.48
0.47
0.56
Cz
Pz
P3
P4
C3
C4
CP5
CP6
P1
P2
POz
C1
C2
FC1
FC2
FCz
0.54
0.52
0.5
AUC
0.53
0.48
0
300
600
time [ms]
Figure 1: Grand mean averaged ERP and AUC scores leading to final classification results for
the participants. The left panel shows the averaged ERP for all participants. The purple line
shows the brain waves for targets, and blue line is for non-targets. The centre panel presents
the head topographies at the maximum and minimum AUC scores as obtained from the right
bottom panel. The top right panel presents averaged ERP responses to the target and the
middle panel to the nontarget stimuli. The right bottom panel visualizes the AUC analysis
results of target versus nontarget response distribution dierences.
representative electrodes. The centre panel provides the results as scalp topographies at the
maximum and minimum area under curve (AUC) of a receiver operating with characteristic
values [7] for target vs. nontarget latencies. It also demonstrates the EEG electrode positions used in the experiments. The top right panel indicates the averaged ERP responses of
all electrodes to the target, and the second panel shows responses to the nontarget stimuli.
The bottom panel indicates the AUC of target versus non-target responses, clearly confirming
the usability of 400 600 ms latencies for the subsequent classification. Table 1 presents the
classification accuracies of the P300 responses as obtained with the SWLDA classifier and the
ITR scores. The average score was obtained as a mean value calculated from 1 5 runs (the
training run was not included in the calculation of accuracies). The ITR is a major comparison
measure [7] among the BCI paradigms. All five users scored above the five vowel sequences
spelling chance levels of 20%. There was one user who achieved 100% accuracy, which was the
best in the experiments reported. We also compared the ITR scores with a VBAPbased spatial
auditory BCI, which is regarded as a conventional method [1]. The VBAP experiment was conducted in 2 runs and with 16 BCInaive users in [1]. The electrode positions were the same as in
our current experiments. The sound stimuli were presented with small earfitting headphones
in both the modalities. The ISI was set to 500 ms in the VBAP experiment, and to 150 ms
in the HRIR experiment. In the VBAP modality, the average ITR score was 1.05 bit/min and
the best was 1.78 bit/min. In the HRIR modality, the average ITR was 1.35 bit/min and the
best was 2.40 bit/min. The ITR scores of the HRIR experiment were recalculated for 2 runs,
Article ID 020-3
User
#1
#2
#3
#4
#5
1
60%
0%
20%
20%
0%
2
80%
40%
20%
20%
40%
3
40%
100%
0%
20%
40%
Run
4
5
20%
40%
80% 100%
40%
80%
20%
40%
80%
60%
DOI:10.3217/978-3-85125-378-8-20
Average
48%
64%
32%
24%
44%
Best
80%
100%
80%
40%
80%
ITR [bit/min]
Average
Best
2.26
9.60
5.27 18.58
0.46
9.60
0.06
1.21
1.70
9.60
Table 1: Vowel spelling accuracies and ITRs of each user obtained in the EEG experiments
the same as for the VBAP experiment. HRIR based modality produced better results than the
VBAP based modality for both the average and the best score.
Conclusions
The EEG results presented confirm the P300 responses of BCInaive users. The mean accuracy
was not very good owing to the short ISI, but the accuracy tends to improve when the number of
run increases. Therefore, more attention training or interface using practice may be necessary
for BCInaive users. The ITR scores were higher compared (no significance analysis due to
dierent user groups) with our previous study using HRIR stimuli, and also compared with the
previously reported VBAPbased spatial auditory BCI. Nevertheless, the current study is not
able to compete with the faster visual BCI spellers. Furthermore, it is necessary to improve
the ITR for a more comfortable spelling. We plan to continue research with larger numbers of
sound stimuli, a better suited ISI, and more complex spatial sound patterns.
References
[1] M. Chang, N. Nishikawa, Z. R. Struzik, K. Mori, S. Makino, D. Mandic, and T. M. Rutkowski.
Comparison of P300 responses in auditory, visual and audiovisual spatial speller BCI paradigms.
In Proceedings of the Fifth International Brain-Computer Interface Meeting 2013, page Article ID:
156, Asilomar Conference Center, Pacific Grove, CA USA, June 3-7, 2013. Graz University of
Technology Publishing House, Austria.
[2] T. M. Rutkowski, A. Cichocki, and D. P. Mandic. Spatial auditory paradigms for brain computer/machine interfacing. In International Workshop On The Principles and Applications of Spatial
Hearing 2009 (IWPASH 2009) - Proceedings of the International Workshop, page P5, Miyagi-Zao
Royal Hotel, Sendai, Japan, November 11-13, 2009.
[3] J. Schnupp, I. Nelken, and A. King. Auditory Neuroscience - Making Sense of Sound. MIT Press,
2010.
[4] S. Amano, S. Sakamoto, T. Kondo, and Y. Suzuki. Development of familiarity-controlled word lists
2003 (FW03) to assess spoken-word intelligibility in Japanese. Speech Communication, 51(1):7682,
2009.
[5] V. R. Algazi, R. O. Duda, D. M. Thompson, and C. Avendano. The CIPIC HRTF database. In 2001
IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics, pages 99102.
IEEE, 2001.
[6] J. Wolpaw and E. Winter Wolpaw, editors. Brain-Computer Interfaces: Principles and Practice.
Oxford University Press, 2012.
[7] M. Schreuder, B. Blankertz, and M. Tangermann. A new auditory multi-class brain-computer
interface paradigm: Spatial hearing as an informative cue. PLoS ONE, 5(4):e9813, 04 2010.
Article ID 020-4
DOI:10.3217/978-3-85125-378-8-21
1 Introduction
Patients with motor impairment due to central nervous system diseases such as stroke and
Parkinsons disease have difficulties in their daily-life activities, and many of them are not able to
perform basic body movements without caregivers or assistive devices. To now, many researchers
have developed various neurorehabilitation strategies for recovering their damaged motor functions.
Recently, researchers have shown that a combinatory neurorehabilitation strategy that
simultaneously uses both passive and active exercise can dramatically enhance the outcomes of
conventional passive exercises. For example, Joa et al. compared brain activations induced by the
passive exercise (passive motor execution by functional electrical stimulation (FES)), the active
exercise (voluntary contraction), and combination of both using functional magnetic resonance
imaging (fMRI) (Joa, 2012), and demonstrated that the combination of the passive and active
exercises may be more effective for successful neurorehabilitation.
Nevertheless, this combinatory strategy has a critical limitation in that the practitioners cannot
know how actively the patients perform the active motor execution or motor imagery during passive
exercise. Therefore, it is needed to monitor patients active engagement in the rehabilitation program.
Article ID 021-1
DOI:10.3217/978-3-85125-378-8-21
2 Methods
2.1 Subjects
A total of eight healthy subjects (six males and two females, 24-30 years old) were enrolled in our
study. All subjects were right-hander and had normal or corrected-to-normal vision. None of them had
a previous history of neurological, psychiatric, or other severe diseases that might affect the
experimental results.
Article ID 021-2
DOI:10.3217/978-3-85125-378-8-21
meaningful channels among 40 channels. Second, we classified the passive exercise alone and the
combinatory strategies using the hemodynamic responses of the selected channels. We used five
moving windows with different windows sizes of 1, 2, 4, 5, and 10 seconds and extracted features by
moving the windows along the time axis with 50% overlap. Five features (mean, variance, kurtosis,
skewness, and slope) were evaluated for each moving window. The linear bayes normal classifier
implemented in the PRtools package was used for the binary classification (Duing, 2000). The 10 by
10 cross-validation was performed for the estimation of classification accuracy.
3 Results
Figure 1 shows topographic maps of oxy-Hb concentration changes induced by the passive
exercise alone and the combinatory strategies. It could be observed from the figures that the oxyHb concentration changes during both combinatory strategies were stronger and broader than that
during passive exercise alone.
Figure 1: Topographic map of oxy-Hb concentration changes in each task. Left and right yellow circles
reprecent locations of C3 and C4, respectively.
Table 1 shows the results of the single-trial pattern classification. The passive exercise alone and
the combinatory strategies could be classified with classification accuracy larger than 70%, which is
high enough to be used as an indicator to evaluate whether the users actively performed active motor
execution or motor imagery during passive exercise (Choularton, 2004).
Article ID 021-3
Subject No.
1
2
3
4
5
6
7
8
Mean
DOI:10.3217/978-3-85125-378-8-21
References
K.-L. Joa, Y.-H. Han, C.-W. Mun, B.-K. Son, C.-H. Lee, and Y.-B. Shin. (2012). Evaluation of the
brain activation induced by functional electrical stimulation and voluntary contraction using
functional magnetic resonance imaging. Journal of neuroengineering and rehabilitation
D. T. Delpy, M. Cope, P. van der Zee, S. Arridge, S. Wray, and J. Wyatt. (1988). Estimation of
optical pathlength through tissue from direct time of flight measurement. Physics in medicine and
biology
S. Lee, D. Koh, A. Jo, H. Y. Lim, Y.-J. Jung, C.-K. Kim, Y. Seo, C.-H. Im, B.-M. Kim, and M.
Suh. (2012). Depth dependent cerebral hemodynamic responses following direct cortical electrical
stimulation (DCES) revealed by in vivo dual-optical imaging techniques. Optics express
R, P. W. Duin. (2000) PRTools version 3.0: A Matlab toolbox for pattern recognition. Delft
University of Technology
Article ID 021-4
DOI:10.3217/978-3-85125-378-8-22
Abstract
The goal of this study was to investigate the relationship between heart rate variability (HRV) and
performance in a clinical application for brain-computer interfaces, namely Neurofeedback training
(NFT) for children with autism spectrum disorder (ASD). HRV parameters in an initial pre-training
test predicted performance as well as performance improvements in social cognition following NFT,
confirming a relationship between the autonomic nervous system and social cognition. Furthermore,
HRV improved after each NFT session in comparison to pre-session levels. However, no direct
relationship between resting HRV and NFT performance was found. The decrease in HRV over
sessions might be explained by an increase in mental effort due to an increase in difficulty of the NFT
protocol. Nonetheless, HRV can serve as a predictor for NFT effects on social dysfunction in autism
and HRV-biofeedback might lead to further improvements in social cognition for autistic children.
1 Introduction
Recent studies have suggested that heart rate and its variability (HRV) can be used as predictor for
mental effort in a brain-computer interface (BCI) (Pfurtscheller et al. 2013) and can predict P300based BCI performance (Kaufmann et al. 2012). Additionally, a positive relation between HRV and
social cognition was recently shown in able-bodied individuals using the Reading the Mind in the
Eyes-test (RMET; Baron-Cohen et al. 2001; Quintana et al. 2012). These links between brain and
body could be especially important for clinical applications of BCIs such as Neurofeedback training
(NFT) for social dysfunction in children with autism spectrum disorder (ASD).
In this study, an EEG-based NFT for children with ASD was implemented as in previous work
(Pineda et al. 2008, 2014). ASD is characterized with deficits in social and communicative skills such
as imitation and empathy. Besides neurophysiological abnormalities, deficits in the social engagement
system have been linked to the regulation of heart rate (Porges 2007). The goal of this study was to
investigate if the reported positive relation between HRV and social cognition as well as BCI
performance can be replicated in children with ASD. Therefore, (1) resting HRV was correlated with
performance in the RMET which was used as an indicator for social cognition and NFT success and
(2) HRV in a resting baseline before and after each NFT session was correlated to NFT performance.
2 Methods
Thirteen children (aged 6-17, one female) with a confirmed diagnosis of ASD participated in a
pre- and posttest and in sixteen 1-hour NFT sessions twice a week. During the NFT sessions, EEG
Article ID 022-1
DOI:10.3217/978-3-85125-378-8-22
was recorded from one electrode over the right sensorimotor cortex (C4), sampled at 256 Hz and
filtered for mu (8-12 Hz), theta (3-8 Hz) and high beta (18-30 Hz) frequency bands. Before and after
each NFT session, a 5-min baseline was recorded for HRV analyses (see last paragraph of methods).
The children were trained to control a video game involving social interactions by modulation of their
mu frequency band. They were not provided with specific control strategies but learned by operant
conditioning via feedback. The beta and theta frequency bands inhibited positive feedback in the game
if the amplitudes in these frequencies exceeded a certain threshold. The thresholds for mu, beta and
theta were set as a function of an initial preceding resting period of baseline activity. The amplitude
value for mu was set in the first session and then shaped to get higher in the following sessions. In
contrast, the amplitude values of theta and beta were shaped to get lower in the subsequent sessions.
Performance during the NFT was calculated as: Performance = Hitrate x Difficulty
(1)
A hit was defined as fulfilling all threshold criteria (e.g. above mu and below beta and theta) and
thus triggering positive feedback in the game. In order to make different parts of the game
(2)
comparable, the hitrate was defined as: Hitrate =
Due to the continuous shaping of the threshold, the possibility of fulfilling all threshold criteria
decreased and thus difficulty increased over sessions. In order to adjust the hitrate for the level of
difficulty, the distances (obs) between the shaped threshold and the preceding baseline were
considered. Distances were calculated so that positive numbers reflected the threshold being set easier
than the baseline values and negative numbers reflected the threshold being set more difficult than the
baseline values. The observed distances were then normed (norm) to the defined standard distance
(std). The standard distance was set in a way that the mu () threshold was 50% lower and the beta
() and theta () threshold was 50% higher than the preceding baseline value which resulted in a
difficulty of 1 for the first session. In order to ensure that a distance of zero and negative distance
values still show reasonable results, a logarithmic transformation was used:
Difficulty =
norm =
(3)
In the pre- and posttest, two 6-min baselines with open and closed eyes were recorded before the
children completed the RMET. In this test, pictures of individuals eye regions were shown. Based on
the eyes, children had either to determine what the individual is thinking or feeling out of 4 possible
choices presented at the four corners of the display, or what gender (male or female) the individual is.
The percentage of correct responses (Corr%), as well as the reaction time (RT), was calculated.
The electrocardiogram (ECG) was recorded at a sampling rate of 2048 Hz from electrodes
attached to the left wrist and the right side of the neck. HRV parameters were analyzed in all 5-min
resting baselines before and after each NFT session as well as in the pre- and posttest. The ECG data
was down-sampled to 512 Hz and the interbeat interval (IBI) detection and artifact correction were
made with the software ARTiiFACT (Kaufmann et al. 2011). The statistical parameters for HRV were
calculated with KUBIOS (Tarvainen et al. 2014) and included the SDNN in ms (i.e. standard
deviation of IBIs) in the time domain and the power (ms2) and percentage (%) of high- (HF; 0.15-0.4
Hz) and low-frequency (LF; 0.04-0.15 Hz) measures in the frequency domain derived using Fast
Fourier Transformation. SDNN and HF have a positive, LF (%) a negative association with HRV.
Article ID 022-2
DOI:10.3217/978-3-85125-378-8-22
3 Results
First, the relationship between HRV and the RMET was investigated. A normal distribution was
found for all variables. Children had more Corr% in the gender (M=84, SE=5) than in the emotion
recognition (M=52, SE=6) task (F1,12=74.5, p<.01) confirming the differences in difficulty of the
tasks, and more Corr% in the post- (M=70, SE=5) than in the pretest (M=65, SE=6) by trend
(F1,12=3.9, p<.1) suggesting improvement as a function of training. There were neither significant
effects for RT in the RMET nor for HRV parameters between pre- and posttest.
HRV parameters in the resting baseline of the pretest - but not of the posttest - correlated with
Corr% and RT in the RMET of the pre- and posttest (Table 1). In order to adjust the correlations for
possible influence of age and gender of the participants, partial correlations were calculated. All
correlations above r=.3 were in the expected direction: The higher SDNN and HF, the higher the
Corr% and the shorter the RT in the RMET. For the LF-percentage, the opposite occurred.
Second, HRV parameters during the baselines before and after each NFT session were correlated
with hitrate, difficulty and performance during each NFT session. However, the correlations failed to
show any consistent pattern. Hitrate decreased over sessions (F7,84=9.1, p<.01), whereas difficulty and
performance increased (F7,84=3.7/1.9, p<.05/.1). The parameters SDNN and HF power showed
significantly higher HRV in the baselines after than before each NFT session (F1,12=44.4/13, p<.01).
However, HF (ms2, %) decreased and LF (%) increased over the training sessions (F7,84=2.3-3.4,
p<.05).
Pretest
Eyes
open
SDNN
HF (ms2)
HF (%)
LF (%)
Eyes
closed
HF (%)
LF (%)
Pretest
Gender
Emotion
Corr%
RT
Corr%
RT
.49
-.29
-.41
.61*
.44
-.31
.57(*)
-.47
.50
.25
.20
-.19
-.55(*)
-.25
-.22
.10
.63*
-.62*
-.07
.05
.36
-.37
-.53(*)
.51
Gender
Corr%
RT
.47
-.40
.44
-.42
-.47
.68*
.42
-.70*
.67*
-.71*
Posttest
-.47
.54(*)
Emotion
Corr%
RT
-.44
.66*
-.40
.61*
.48
-.40
-.40
.40
.63*
-.63*
-.47
.51
Table 1: Bivariate partial correlation coefficients controlled for age and gender between parameters of
HRV and RMET. The significance level is indicated with the asterisks (* p<.05, (*) p<.1).
Article ID 022-3
DOI:10.3217/978-3-85125-378-8-22
could be predicted by HRV in able-bodied individuals. In the present study, HRV improved after each
NFT session compared to before, which suggests that the NFT-game was successful in inducing
stress-free social interactions. However, across training sessions, HRV decreased, whereas
performance increased. As the increase in performance was the product of increasing difficulty and
decreasing hitrate, we could explain the decrease in HRV as an increase in mental effort (Pfurtscheller
et al. 2013; Cowley et al. 2013). Analyzing HRV during game play itself and not only in the baseline
before and after each NFT might reveal more consistent correlations with performance parameters.
To conclude, although we could not show consistent correlations between resting HRV and NFT
performance, this study extends the idea that HRV is a good predictor for behavioral performance in
social cognition in individuals with ASD, and can be used as an indicator of mental effort rather than
performance during NFT. Moreover, the results suggest that HRV-biofeedback might improve NFT
effects in social cognition for children on the spectrum.
Acknowledgements
This research was supported by a fellowship provided by the Max Kade Foundation to the Department
of Cognitive Science, UCSD and by grant funding from the ISNR Research Foundation. The authors
thank Kaitlin Huennekens and Anna Christina Macari for help with the data analyses and all
colleagues from the CNL and HWU for their support.
References
Baron-Cohen S, Wheelwright S, Hill J, Raste Y, Plumb I (2001). The Reading the Mind in the Eyes
Test Revised Version: a Study with Normal Adults, and Adults with Asperger Syndrome or Highfunctioning Autism. Journal of Child Psychology and Psychiatry 42: 24151.
Cowley B, Ravaja N, Heikura T (2013). Cardiovascular physiology predicts learning effects in a
serious game activity. Computers & Education 60: 299309.
Kaufmann T, Stterlin S, Schulz S M, Vgele C (2011). ARTiiFACT: a Tool for Heart Rate Artifact
Processing and Heart Rate Variability Analysis. Behavior Research Methods 43: 116170.
Kaufmann T, Vgele C, Stterlin S, Lukito S, Kbler A (2012). Effects of Resting Heart Rate
Variability on Performance in the P300 Brain-computer Interface. International Journal of
Psychophysiology 83: 33641.
Pfurtscheller G, Solis-Escalante T, Barry R J, Klobassa D S, Neuper C, Mller-Putz G R (2013). Brisk
Heart Rate and EEG Changes During Execution and Withholding of Cue-paced Foot Motor
Imagery. Frontiers in Human Neuroscience 7: 379.
Pineda J A, Brang D, Hecht E, Edwards L, Carey S, Bacon M, Futagaki C, et al (2008). Positive
Behavioral and Electrophysiological Changes Following Neurofeedback Training in Children with
Autism. Research in Autism Spectrum Disorders 2: 557581.
Pineda J A, Carrasco K, Datko M, Pillen S, Schalles M (2014). Neurofeedback Training Produces
Normalization in Behavioral and Electrophysiological Measures of High Functioning Autism. In
Mirror Neurons: Fundamental Discoveries, Theoretical Perspectives and Clinical Implications,
edited by P F Ferrari and G Rizzolatti. Royal Society Publishing, London.
Porges, S W (2007). The Polyvagal Perspective. Biological Psychology 74: 11643.
Quintana D S, Guastella A J, Outhred T, Hickie I B, Kemp A H (2012). Heart Rate Variability Is
Associated with Emotion Recognition: Direct Evidence for a Relationship Between the Autonomic
Nervous System and Social Cognition. International Journal of Psychophysiology 86: 16872.
Tarvainen M P, Niskanen J-P, Lipponen J, Ranta-Aho P O, Karjalainen P (2014). Kubios HRV-heart
Rate Variability Analysis Software. Computer Methods and Programs in Biomedicine 113: 210
20.
Article ID 022-4
DOI:10.3217/978-3-85125-378-8-23
1 Introduction
To date BCIs have hardly been established at the patients homes. To our knowledge there exist
only 3 studies presenting independent BCI use [1-3]. The current study aimed at demonstrating proofof-principle of independent BCI use and investigating usability of independent BCI use in severely
disabled patients. A P300 BCI for creative expression, Brain Painting, was implemented at the
patients homes. Here we present data within two patients, who used the BCI at their homes for
several months. The study covering data of the first 14 months (200 sessions) for the first patient is
published in its extended form in [4]. In this paper we present follow-up data and also data of a second
patient, who has been participating in the study for 5 months.
2 Method
For a detailed description of methods the reader is referred to [4].
2.1 Patients
Both patients (one female), aged 73 and 74, were diagnosed with ALS. Both are artists and with
progress of disease related paralysis were no longer able to paint. Patient 1 is in the locked-in state
with preserved eye-movements. Patient 2 is tetraplegic, able to talk and to move his head.
Article ID 023-1
DOI:10.3217/978-3-85125-378-8-23
independent home use [4], in the way that Brain Painting can be started within a few clicks and BCI
parameters, such as calibration data, are loaded automatically. In an initial meeting including two
consecutive days, BCI was calibrated and the family and/or assistants were trained in setting up the
BCI system. BCI is used independent of the help of BCI experts. BCI data is automatically
transmitted to a remote-server to the BCI lab. BCI use is monitored and supported in case of e.g.
technical problems using a remote desktop application. Patient 1 was visited in further meetings for
e.g. improvement of the system or recalibration. At date of submission, patient 1 has been using the
BCI for 23 months and patient 2 for 5 months. Both studies are ongoing.
3 Results
3.1 Face valid measures of BCI use
In 23 months patient 1 painted in 271 sessions, with a total painting time of 403.48 hours with a
mean painting duration of M=89.33 (SD=52.87) minutes. Patient 2 painted with the BCI in 58 sessions
within 5 months, resulting in a total painting time of 64.60 hours with a mean painting duration of
M=66.82 (SD=34.56) minutes.
Article ID 023-2
DOI:10.3217/978-3-85125-378-8-23
VAS satisfaction
VAS satisfaction
10
Rating
Rating
10
5
0
5
0
100
200
Number of session
25
50
Number of session
Figure 1: VAS satisfaction for patient 1 (left) and patient 2 (right) over time (271 and 58 sessions).
Zero indicates not at all satisfied and 10 very satisfied.
3.4 Paintings
Patient 1 created more than 120 paintings, patient 2 more than 30. Two paintings are depicted in
figure 2.
Figure 2: Paintings: Black bird by patient 1 (left) and Patchwork by patient 2 (right), with
permission of the artists.
4 Discussion
The present study demonstrates that independent BCI use is possible with high satisfaction and
gratification for the patients. BCI use was challenged by technical problems, BCI control and personal
factors. Technical problems, occurring mostly at the beginning of the study (in patient 1), could be
solved and fully removed. Dissatisfaction with the painting occurred also only at the beginning of the
study and disappeared with more practice in using the Brain Painting matrix. Attentional issues could
be reduced through frequent breaks within sessions and time management, e.g. BCI session is not
started, if the patient is too tired. Disease related factors, such as cough, however are less likely to be
overcome and will remain in daily-life. In most of the BCI sessions end-users rated BCI control as
Article ID 023-3
DOI:10.3217/978-3-85125-378-8-23
being medium. BCI control depended on the adjustment of EEG cap and electrodes, but also on
personal factors (e.g. concentration and cough). Problems on BCI control could be decreased by
improvements on hard- and software, e.g. EEG cap and electrodes or an auto-calibration option [6].
Interestingly patients indicated moderate to high satisfaction even if BCI control is moderate and not
100%. Patient 2 states: I do not want to focus of my attention on my cough. I have enough time and I
refine until I am satisfied with it [painting]. This is the way I did it in the past as a [hand] painting
artist []. This shows that, even with low(er) BCI control, the patient can nevertheless be satisfied
with BCI as assistive technology. Patient 1 points out: [] Brain Painting makes me happy and
satisfied everytime again, if it works - if not, then frustration and disappointment. Fortunately this
happens in only few cases. BP completely changed my life. I can paint again and be creative again
using colours [].
5 Conclusion
BCI controlled painting is accepted well as assistive technology by the severely disabled endusers. Brain Painting exactly matches the artists need for creative expression and positively impacts
the quality of life of the patients. P300-BCIs have come of age and can be used independently in
daily-life. To replicate our findings, inclusion of more patients is planned.
6 Acknowledges
This work is supported by the European ICT Programme Project FP7- FP7-288566 (Backhome).
This paper only reflects the authors views and funding agencies are not liable for any use that may be
made of the information contained herein.
References
[1] Neumann, N., A. Kbler, J. Kaiser, T. Hinterberger, and N. Birbaumer, Conscious perception of
brain states: mental strategies for brain-computer communication. Neuropsychologia, 2003.
41(8): p. 1028-36.
[2] Karim, A.A., T. Hinterberger, J. Richter, J. Mellinger, N. Neumann, H. Flor, A. Kubler, and N.
Birbaumer, Neural internet: Web surfing with brain potentials for the completely paralyzed.
Neurorehabil Neural Repair, 2006. 20(4): p. 508-15.
[3] Sellers, E.W., T.M. Vaughan, and J.R. Wolpaw, A brain-computer interface for long-term
independent home use. Amyotroph Lateral Scler, 2010. 11(5): p. 449-55.
[4] Holz, E.M., L. Botrel, T. Kaufmann, and A. Kbler, Long-term independent brain-computer
interface home use improves quality of life of a patient in the locked-in state: A case study.
Arch Phys Med Rehabil, accepted for publication.
[5] Zickler, C., S. Halder, S.C. Kleih, C. Herbert, and A. Kbler, Brain Painting: usability testing
according to the user-centered design in end users with severe disabilities. Artif Intell Med,
2013. 59(2): p. 99-110.
[6] Kaufmann, T., Vlker, S., Gunesch, L., and Kbler, A. Spelling is Just a Click Away - A usercentered brain-computer interface including auto-calibration and predictive text entry. Front.
Neurosci. 2012;6:72.
Article ID 023-4
DOI:10.3217/978-3-85125-378-8-24
Abstract
The mismatch negativity (MMN) is commonly studied in oddball paradigms
comprising only one deviant. Recently, multifeature oddball paradigms comprising
multiple deviants have been developed and might represent a valuable tool in clinical
assessment where time is a limiting factor. This study addressed whether (1) duration
MMNs elicited by physically identical stimuli differ between a unifeature vs.
multifeature oddball paradigm, and (2) the magnitude of the standard vs. deviant
difference would affect the MMN amplitude. Event-related potentials were recorded in
26 healthy participants who listened to three oddball paradigms (two unifeature, one
multifeature). Results showed that (1) duration deviants elicited larger MMNs in the
multifeature oddball, and (2) MMN amplitudes increased with increasing deviantminus-standard distance in terms of the target stimulus feature. Taken together, duration
oddballs elicit reliable MMNs in multifeature paradigms and may therefore represent a
valuable tool in clinical settings.
1 Introduction
The mismatch negativity (MMN) is frequently used to investigate discrimination skills and residual
cognitive functioning in patients with disorders of consciousness (DOC). These results may then serve
as a basis for the future application of a brain-computer interface (BCI). Traditionally, the MMN is
studied in auditory oddball designs using a secondary task to prevent elicitation of an N2b
contaminating the MMN effect. It peaks between 150 and 250 ms (Duncan et al., 2009). Typical
paradigms comprise one standard and one rare deviant, thus focusing on only one feature of auditory
stimuli. However, attention span of DOC patients is short, so paradigms have to be brief, with a
maximum on information, but at the same time allow for a reliable detection of MMNs in single
subjects. Using oddball paradigms comprising several deviant tones in various dimensions may be a
promising tool to record event-related potentials (ERPs) in response to more than one deviating
stimulus within a short period of time. These multifeature oddball paradigms were first developed by
Ntnen, Pakarinen, Rinne, and Takegata (2004) whose tone sequence comprised deviants in five
different dimensions: frequency, intensity, location, duration, and a silent gap in the middle of a tone.
They found MMN responses in the multifeature paradigm to be as large as in the traditional unifeature
oddball, which was also supported by (Pakarinen et al., 2009).
*
Article ID 024-1
DOI:10.3217/978-3-85125-378-8-24
With this study we addressed whether (1) the MMNs elicited by physically identical stimuli differ
as a function of paradigm uni- vs. multifeature oddball), and (2) the amplitude of the MMN increases
with the increasing deviant-minus-standard distance (DSD) in terms of the target stimulus feature in
three different dimensions (frequency, intensity, and duration).
2 Methods
Event-related potentials were recorded in 26 healthy participants (mean age = 34.1 years, SD =
9.9, 14 females). EEG was recorded with a 32 active electrodes system (gTec, Graz, Austria) with a
sampling frequency of 512 Hz. The data were filtered online between 0.1 to 100 Hz (butterworth).
The paradigms were presented while participants watched a silent movie and were required to press a
key when a certain scene appeared. A scale for subjectively experienced effort ranging from 0 to 220
(Eilers, Nachreiner, & Hnecke, 1986) was administered after each paradigm.
Auditory stimulation was realized in three different paradigms. All paradigms shared the same
harmonic tone as standard with 500+1000+1500 Hz, 75 ms duration (rise/fall 5 ms) and an intensity
of 70dB. The paradigms used were the following: (1) a duration oddball called MMNAbsolut
comprising 900 standards and 100 odds differing in duration (50 ms), (2) a second duration oddball
paradigm called MMNProportion comprising 900 standards and 100 odds differing in duration as well
(37 ms) and (3) a multifeature oddball called MMNMultifeature with 600 standards and 600 deviants
varying in three dimensions: duration, frequency, and intensity. The deviants in the multifeature
paradigm were 37 and 50 ms in the duration domain, 450+900+1350 and 400+800+1200 Hz in the
frequency domain, and 65 and 60dB in the intensity domain. Stimulus onset asynchrony was 500 ms
in all paradigms.
Offline, the EEG data were band-pass filtered between 0.1 and 25 Hz, automatic artefact
correction was performed, and segments from 0 to 500 ms were averaged. Finally, grand averages
were obtained. For all calculations, mean amplitudes under the curve for the electrode Fz were entered
into analyses. All analyses were carried out in MATLAB (The Math Works, Inc., M.A.), statistics
were calculated in SPSS 17.0 (SPSS Inc., IL). Relevant time windows for component analyses in all
paradigms were defined by visual inspection and set to 120 to 245 ms post-stimulus in all paradigms.
3 Results
Results were obtained in repeated measures ANOVAs including the factors paradigm (unifeature
vs. multifeature), DSD (small vs. large), and deviant type (duration, frequency, intensity).
Article ID 024-2
DOI:10.3217/978-3-85125-378-8-24
vs. multifeature paradigm. An MMN was elicited in all conditions. The results reveal a significant
main effect of paradigm (F(1, 25) = 6.29, p = .019) with larger amplitudes in the multifeature
paradigm (MMultifeature = -2.73, MUnifeature = -2.26) and a significant main effect of DSD (F(1, 25) =
22.03, p < .001) with larger amplitudes when DSD was large (MDSD small = -1.76, MDSD large = -3.23).
Thus, for duration deviants, larger amplitudes were found in the multifeature paradigm and when the
DSD was large.
Secondly, we calculated an ANOVA including the factors deviant type and DSD to compare
MMN amplitudes elicited by duration, frequency, and intensity deviants in the multifeature paradigm.
An MMN was elicited in all conditions. MMN amplitudes did not differ according to the deviant type,
but varied as a function of DSD (F(1, 25) = 71.95, p < .001) with larger amplitudes for large DSDs
(MDSD small = -1.97, MDSD large = -3.34). Furthermore, DSD effects varied as a function of deviant type
(F(2, 50) = 3.45, p = .040): The difference in MMN amplitudes between small and large DSD was
smallest for frequency deviants (MDiff frequency = .76), larger for duration deviants (MDiff duration = 1.48),
and largest for intensity deviants (MDiff intensity = 1.85). All SIDAK corrected post-hoc comparisons
between DSDs were significant (p < .013).
Thus, also for intensity and frequency deviants, MMN amplitudes increase with larger DSDs.
Figure 1: recorded MMN potentials at Fz in the unifeature oddball paradigms MMNAbsolut (a) and
MMNProportion (b)
Figure 2: recorded MMN potentials at Fz in the multifeature oddball paradigm for duration deviants (a),
frequency deviants (b), and intensity deviants (c)
Article ID 024-3
DOI:10.3217/978-3-85125-378-8-24
References
Duncan, C. C., Barry, R. J., Connolly, J. F., Fischer, C., Michie, P. T., Ntnen, R., (2009).
Event-related potentials in clinical research: Guidelines for eliciting, recording, and quantifying
mismatch negativity, P300, and N400. Clinical Neurophysiology, 120, 18831908.
doi:10.1016/j.clinph.2009.07.045
Eilers, K., Nachreiner, F., & Hnecke, K. (1986). Entwicklung und berprfung einer Skala zur
Erfassung subjektiv erlebter Anstrengung. Zeitschrift fr Arbeitswissenschaft, 40(4), 215224.
Horvth, J., Czigler, I., Jacobsen, T., Maess, B., Schrger, E., & Winkler, I. (2007). MMN or no
MMN: No magnitude of deviance effect on the MMN amplitude. Psychophysiology, 45.
doi:10.1111/j.1469-8986.2007.00599.x
Jaramillo, M., Paavilainen, P., & Ntnen, R. (2000). Mismatch negativity and behavioural
discrimination in humans as a function of the magnitude of change in sound duration.
Neuroscience Letters, 290, 101104. doi:10.1016/S0304-3940(00)01344-6
May, P. J. C., & Tiitinen, H. (2010). Mismatch negativity (MMN), the deviance-elicited auditory
deflection, explained. Psychophysiology, 47(1), 66122. doi:10.1111/j.1469-8986.2009.00856.x
Ntnen, R., Pakarinen, S., Rinne, T., & Takegata, R. (2004). The mismatch negativity (MMN):
towards the optimal paradigm. Clinical Neurophysiology, 115, 140144.
doi:10.1016/j.clinph.2003.04.001
Pakarinen, S., Lovio, R., Huotilainen, M., Alku, P., Ntnen, R., & Kujala, T. (2009). Fast multifeature paradigm for recording several mismatch negativities (MMNs) to phonetic and acoustic
changes in speech sounds. Biological Psychology, 82, 219226.
doi:10.1016/j.biopsycho.2009.07.008
Article ID 024-4
DOI:10.3217/978-3-85125-378-8-25
Real-time artifact correction enables EEGbased feedback inside the fMRI scanner
C. Zich1, S. Debener1, M. De Vos2, C. Kranczioch1, I. Gutberlet3
Neuropsychology Lab, University of Oldenburg, Germany.
Methods in Neurocognitive Psychology, University of Oldenburg, Germany.
3
BlindSight GmbH, Schlitz, Germany.
catharina.zich@uni-oldenburg.de
1
Abstract
The aims of this contribution were (1) to investigate the feasibility of motor imagery
(MI) electroencephalogram (EEG)-based online feedback during continuous functional
magnetic resonance imaging, and (2) to examine the validity of electric MI signatures by
comparing EEG MI data obtained inside and outside the MRI scanner. Participants
(N=22) imagined hand movements, which led to an event-related desynchronization
(ERD) of the contralateral sensorimotor rhythm. The contralateral ERD of the online
corrected as well as of the offline corrected EEG data correlated strongly with the EEG
data recorded outside the MRI scanner. We provide a proof-of-principle demonstration
that meaningful EEG-based feedback inside the MRI scanner is possible for MI.
1 Introduction
As both movement execution and motor imagery (MI) decrease the amplitude of sensorimotor
rhythms, termed event-related desynchronization (ERD), MI electroencephalogram (EEG)-based
neurofeedback seems a promising intervention supporting the recovery of motor impairments. For the
improvement of MI practice it is highly relevant to investigate the neurophysiological correlates of MI
EEG-based feedback using high spatial resolution imaging. Combining MI EEG-based feedback with
functional magnetic resonance imaging (fMRI) allows addressing several important aspects, such as a)
a investigation of the role of feedback; b) a better understanding of individual differences in MI; and c)
an identification of the brain patterns underlying the neural correlates of EEG MI feedback.
However, to successfully implement MI EEG-based neurofeedback inside the fMRI scanner good
EEG signal quality must be achieved, which requires the online correction of the gradient artifact (GA)
and the ballistocardiogram (BCG) artifact. As a first step we investigated the feasibility of real-time
EEG-based MI feedback during fMRI, as this would open the door to address the above stated aspects.
The validity of the MI EEG data collected inside the MRI scanner was verified by comparing the ERD
after online and offline artifact correction with the ERD obtained outside the MRI scanner.
2 Methods
2.1 Subjects and Task
Twenty-four healthy individuals (11 females, 20-30 years of age, mean age: 23.9 years) with no MI
experience participated in this study, which was approved by the local ethics committee. The data from
two subjects had to be excluded because of noncompliance with task instructions. Participants were
Article ID 025-1
DOI:10.3217/978-3-85125-378-8-25
measured twice with a six-week interval in between. During the first session concurrent EEG-fMRI was
recorded and during the second session the same experiment was recorded outside the scanner in the
EEG lab. Each session consisted of four blocks and each block comprised 40 trials (20 left and 20 right
hand). Subjects were instructed to execute a finger tapping movement in the first block and to imagine
the same movement in the last three blocks. Two imagery blocks were performed without feedback and
in the last one online feedback was provided. Stimulus presentation was controlled with OpenViBE and
followed the standard Graz MI protocol [1], except that we employed longer inter-trial intervals (4.5 to
9 sec) to accommodate the fMRI protocol timing.
2.3 EEG and fMRI Data Acquisition
EEG data were collected from 64 equidistant scalp sites, with Cz as reference, Iz as ground and ECG
at the lower back, using an MR-compatible BrainAmp system and the BrainVision Recorder software
1.20.0506 (Brain Products GmbH, Gilching, Germany). Raw EEG was sampled at 5kHz, both GA and
BCG artifacts were corrected online and corrected data were passed on to OpenViBE via a direct
network link. FMRI data acquisition was performed on a 3T Siemens MRI scanner (Siemens AG,
Erlangen, Germany). During functional measurements 420 T2*-weighted gradient echo planar imaging
volumes (3.1 x 3.1 x 3.0 mm voxels, 0.75 mm gap, TR = 1.5s, FoV = 200 * 200, Flip Angle = 90, 27
transversal slices) were obtained within each block. The EEG data obtained outside the scanner were
acquired using the same paradigm and EEG setup, except for a lower sampling rate of 500 Hz.
2.4 Online processing of EEG Data
To enable real-time EEG-based neurofeedback inside the MRI scanner, EEG data had to be
immediately corrected for MRI artifacts in all MI blocks. Online artifact corrections of GA and BCG
were performed stepwise using the BrainVision RecView software 1.42 (Brain Products GmbH,
Gilching, Germany). The online GA correction method employed is based on the average artifact
attenuation method proposed by [2] with the notable difference that online corrections can only
incorporate the already recorded data into the GA correction template. An artifact subtraction template
of 1500ms was built, based on the MR system TR events and the corrected data were down sampled to
500 Hz using a block down sampling algorithm. Subsequently the data were filtered using a Butterworth
low-pass with an edge frequency of 35 Hz (48 dB slope).
Online BCG correction first performed a search for the most suitable BCG template within the first
30 seconds of GA-corrected EEG data. This prototypical BCG template was then used in a moving
template matching approach to identify subsequent valid BCG episodes. Minimum correlation threshold
for BCG detections was 0.7, with an allowed average amplitude ratio range from 0.6x to 1.4x relative
to the prototypical BCG data. The total BCG correction window length varied between 800 (-100 to
700ms post R-wave) and 900 ms (-100 to 800 ms post R-wave) depending on the average inter-beatinterval of the subjects as determined before the main recordings commenced. The BCG artifact
subtraction template was based on a moving window comprised of 21 subsequent BCG episodes.
The two MI blocks without feedback were concatenated and filtered from 8-30 Hz. MI intervals
were extracted using EEGLAB. Artifactual segments were rejected and a common spatial pattern
algorithm was applied to train a linear discriminant analysis classifier [3]. During the feedback block
data were preprocessed online in OpenViBE, keeping the parameter settings identical to those used in
EEGLAB. The output of the classifier was translated into the length of a light blue horizontal bar
(updated at a frequency of 16 Hz).
2.5 Offline processing of EEG Data
For offline analysis standard preprocessing steps were applied on the uncorrected raw EEG data.
Offline corrections of GA and BCG artifacts were performed in separate steps using the BrainVision
Analyzer Professional software package 2.0.4 (Brain Products, Gilching, Germany). The GA correction
time window was placed from -35 to 1465 ms relative to the MR system TR marker to accurately capture
Article ID 025-2
DOI:10.3217/978-3-85125-378-8-25
all MR gradient activity. All other GA correction parameters were the same as for online GA correction
(see above). Analysis of the BCG artifact structure in the (GA corrected) data showed that the ECG
channel data was afflicted with strong additional artifacts related to respiratory cycle chest movements.
Since even strict filtering of the ECG data (3 to 12 Hz - 48 dB slope Butterworth filter) typically did
not remove these artifacts, a spatio-temporal matching approach was employed that detected BCG
episodes based on spatial and temporal BCG signatures in all EEG channels. This method resulted in
near perfect detection of valid BCG episodes. All other parameters were the same as for online BCG
correction, with the notable exception that offline BCG correction automatically determined the optimal
position and length of the BCG artifact subtraction window relative to the R-markers detected.
After offline correction, data were re-referenced to the common average. All three MI blocks were
concatenated, filtered from 8 and 30 Hz, segmented into consecutive time intervals of one second and
gross artifactual segments rejected. The remaining data were submitted to extended infomax ICA to
identify and subsequently attenuate eye-blink, eye-movement and heart-beat artifacts. Artifactcorrected EEG data were segmented to extract task-related event-related desynchronization (ERD). The
ERD time course was computed following the procedure proposed by [4].
Online and offline analysis of the EEG data recorded outside the MRI scanner was exactly the same
except for the necessity of MRI artifact correction in the former.
3 Results
As reported previously MI caused an ERD prominent above the contralateral sensorimotor areas
[e.g. 1, 5, 6]. This effect was evident for all three conditions (Figure 1), online, offline and outside the
MRI scanner. Descriptively, the relative power decrease of the contralateral ERD at electrode sites
corresponding to C3 and C4 was on average -18.5% 15.1% for the online corrected EEG signal, -31.1
14.9% for the offline corrected signal and -28.7% 13.5% for the data obtained outside the scanner.
Analysis of the ERD time course using a one-way ANOVA with factor condition (online corrected,
offline corrected, EEG recorded outside the MR) revealed a significant main effect (F2,42=12.47, p<.001,
!2=.37). Subsequent paired sample t-tests revealed no significant difference in ERD amplitude between
offline corrected data and those recorded outside the MRI scanner (t21=-.83, p=.42) but a significantly
lower ERD amplitude in the online compared to offline corrected (t21=5.83, p<.001) data. The standard
deviation within the MI interval across trials was significantly higher for online corrected compared to
offline corrected data, t21=-3.5, p=.003.
Examination of the association between the online as well as the offline corrected EEG data recorded
inside and outside the MRI scanner revealed strong correspondences for the contralateral ERD (r=.56,
p=.007; r=.53, p=.01). Analysis of the overall spatial ERD pattern was performed for the 30 electrodes
Figure 1: Grand average ERD from N=22 for online and offline corrected EEG data and data obtained outside the
scanner at C3 and C4. Blue indicates contra- and red ipsilateral electrode sites, with respect to stimulus direction.
Article ID 025-3
DOI:10.3217/978-3-85125-378-8-25
on the top of the scalp. Between the offline corrected data and the data acquired outside the scanner
high associations for right and left hand MI (r=.87, p<.001; r=.88, p<.001) were found, weather no
significant correspondences could be obtained for online corrected data and data obtained outside the
MRI scanner.
4 Discussion
Motivated by our interest in the spatial underpinnings of MI EEG-based neurofeedback, the
possibility of EEG-based feedback during fMRI acquisition for MI was explored and the validity of MI
EEG data after online and offline artifact correction examined. Although the procedure is technically
challenging, it offers the unique possibility to investigate in detail the brain patterns involved in EEGBCI operation. This may help to further develop effective MI EEG-based feedback.
For the first time we provide a proof-of-principle demonstration that EEG-based feedback during
continuous MRI scanning is possible for MI. This is illustrated by high associations of the contralateral
ERD, for EEG data obtained inside and outside the MRI scanner. The contralateral ERD was
significantly lower for online versus offline corrected data, however as EEG data and artifact correction
protocols were identical for online and offline corrections, the differences can only be attributed to the
vastly reduced efficacy of the ECG detection method used in online versus offline corrections. A
possible adaptation of the spatio-temporal matching algorithm used for the offline ECG detection thus
holds excellent promise for a further improvement of the efficacy of online EEG-fMRI based MI
protocols. Furthermore the development of more efficient online artifact correction procedures can
accelerate the correction process and thereby reduce the delay of feedback inside the MRI scanner.
However, even with the currently implemented methods, we were able to show that it is possible to
remove MRI related artifacts in real-time, which is necessary to provide online feedback.
The present findings open up new possibilities to investigate the neurophysiological underpinnings
of MI, which seems necessary in light of the promising role of MI for stroke recovery.
Acknowledgments
This work is supported by a travel grant of the Universittsgesellschaft Oldenburg to CZ.
References
[1] G. Pfurtscheller and C. Neuper. Motor imagery activates primary sensorimotor area in humans.
Neuroscience Letters, 239(2-3):65-68, 1997.
[2] P.J. Allen, O. Josephs, and R. Turner. A Method for Removing Imaging Artifact from Continuous
EEG Recorded during Functional MRI. NeuroImage, 12:230-239.2000.
[3] B.Blankertz, R. Tomioka, S. Lemm, M. Kawanabe, and K.R. Mller. Optimizeing Spatial Filters
for Robust EEG Single-Trial Analysis. IEEE Signal Processing Mag, 25(1):41-56, 2008.
[4] G. Pfurtscheller and F.H. Lopes da Silva. Event-related EEG/MEG synchronization and
desynchronization: basic principles. Electroenceph clin Neurophysiol, 110(11):1842-1857, 1999.
[5] B. Blankertz, F. Losch, M. Krauledat, G. Dornhege, G. Curio, and K.R. Mller. The Berlin Brain
Computer Interface: accurate performance from first-session in BCI-naive subjects. IEEE Trans
Biomed Eng, 55(10):2452-2462, 2008.
[6] C. Zich, M. De Vos, C. Kranczioch, and S. Debener. Wireless EEG with individualized channel
layout enables efficient motor imagery training, In press.
Article ID 025-4
DOI:10.3217/978-3-85125-378-8-26
Introduction
A brain-computer interface (BCI) allows the control of devices trough brain-signals. In combination with a neuroprosthesis, e.g., functional electrical stimulation (FES), a BCI can be
used to restore motor functions in paralyzed persons [5]. One type of non-invasive BCIs so
called sensorimotor rhythm (SMR) BCIs are usually based on the deliberate modulation of
brain oscillations [7] by movement imagery (MI). However, the control of SMR-based BCIs is
not natural and intuitive, because MIs are assigned artificially to control functions (e.g., a foot
MI controls the elbow function). Furthermore, SMR-based BCIs do not decode the imagined
movement trajectories, but the general activity at sensorimotor areas during MI. As opposed
to non-invasive BCIs, invasive BCIs were already used to decode trajectories of imagined movements and to control robotic arms [2, 3]. On the downside, invasive BCIs have the drawback
that they require a major surgical intervention with the risk of infection. Gratifyingly, Bradberry et al. [1] discovered that low-frequency electroencephalography (EEG) signals can be
used to decode executed movement trajectories, and also our group decoded 3D hand positions
from EEG signals [6]. In this work we decoded frontal and lateral hand movements from brain
sources reconstructed from the EEG. Notably, we did not trained a decoder using all sources
simultaneously and interpreted the decoder weights. This can lead to wrong interpretations as
these weights must be seen as a filter and not as a pattern. Instead, we decoded the movements
from each brain source separately and calculated the correlation coecients with the measured
movements, and obtained maps showing the involved brain regions when decoding movements.
2
2.1
Methods
Subjects
We recruited 8 right-handed and 1 left-handed subjects who got compensated for their participation (5 males, 4 females). Most of them had already participated in BCI experiments.
Subjects sat comfortably in a chair with their arms supported by arm rests.
Article ID 026-1
2.2
DOI:10.3217/978-3-85125-378-8-26
Paradigm
We recorded 3 runs. In the first run (frontal run) subjects moved their right arm in front of
them while the gaze was fixated on a cross on a screen. The second run (lateral run) was similar
to the first run, except that subjects moved their right arm laterally. In the third run (ball
run), subjects moved their right arm in front of them again, but now observed and followed
with the eyes a moving ball on the computer screen. Subjects were instructed to execute
the arm movements independently from the ball movements. In all runs, we asked subjects
to execute round, natural movements (not jaggy) with the extended right arm, and to keep
the hand closed with the thumb being on the upper side. Each run comprised of 8 65 s long
trials, with subject specific breaks between the trials to avoid muscle fatigue (usually around
1 minute). A trial started with the presentation of a cross (run 1 and 2) or a ball (run 3),
respectively, on the computer screen. Two seconds later a beep chimed indicating to the user
to start with the movement. In run 3, also a ball on the screen started to move. Additionally
to the arm movement trials, we recorded two trials where we instructed the subjects to follow
a moving ball on the screen with the eyes, but avoid any arm movement. These two trials were
used to remove the influences of eye movements from the EEG with a linear regression method
[8], and to calculate the noise covariance matrix used for source imaging (after removal of eye
movements).
2.3
Recording
We recorded the EEG with 68 passive Ag/AgCl electrodes covering frontal, central and parietal
areas, and the electrooculogram (EOG) with 3 electrodes placed above the nasion and the
two outer canthi of the eyes. Reference was placed on the left mastoid, ground on the right
mastoid. We assured that all impedances were below 5 kOhm. All biosignals were recorded with
g.USBamp amplifiers (g.tec medical engineering GmbH, Schiedlberg, Austria). We applied an
8-th order Butterworth bandpass filter with cut o frequencies at 0.01 Hz and 100 Hz, a notch
filter at 50 Hz, and then sampled the signals with 512 Hz. The position of the right hand was
tracked with a Kinect sensor device (Microsoft, Redmond, US). Here, the x-axis was orientated
leftward, y upward, and z backward with respect to the subject. We also recorded the electrode
positions with a CMS 20 EP system (Zebris Medical GmbH, Isny, Germany).
2.4
Preprocessing
We computed the independent component analysis for each run in the frequency range 0.3 Hz
70 Hz, using the extended infomax algorithm [4], and removed independent components suspected to be muscle or technical artefacts. Subsequently, we applied a zero-phase anti-aliasing
filter and downsampled data to 16 Hz for computational convenience. Then we applied a zerophase 4-th order Butterworth band-pass filter with cuto frequencies interesting for decoding
at 0.2 Hz and 2 Hz. Afterwards, we removed influences of eye movements on the EEG with
a linear regression method [8], and removed samples exceeding a threshold of 5.9 times the
median absolute deviation (MAD) of a channel to get rid of remaining artefacts. MAD is a
robust deviation measure, and the threshold corresponds to 4 times the standard deviation
when the data are normally distributed. Furthermore, we filtered the measured positions with
the same band-pass filter as used for the EEG (i.e. 0.2 Hz 2 Hz), and centered and scaled them
to a standard deviation of one. Finally, we omitted the first 5 seconds of each trial to exclude
possible existing movement onset eects.
Article ID 026-2
2.5
DOI:10.3217/978-3-85125-378-8-26
Source Imaging
To transform the data from the sensor space into the source space we used the software Brainstorm [9]. We calculated the head model using the Colin27 model included in Brainstorm, and
coregistered the electrode positions. Using the head model and a noise covariance matrix, we
calculated 15028 brain sources with the weighted minimum norm estimation (wMNE) method.
The noise covariance matrix was calculated from the two trials without arm movements after
we removed eye movements from them.
2.6
Decoding
We analyzed all runs separately using a 10-fold cross-validation. For this purpose we divided
each run in segments of 10 s length and assigned these segments to train and test sets. The
decoder [6] itself comprised of 2 multiple linear regressions between a brain source (voxel) and
the 2D position in the movement plane (frontal or lateral, respectively). We used the current
time step and time lags at ca. 60, 130, 190 ms of the EEG as the input for the decoder. X and
y positions were decoded in the frontal and ball runs, and y and z positions in the lateral run.
We decoded the positions from the test sets, and calculated the Pearson correlation coecients
between the decoded and the measured 2D positions. Subsequently, we calculated the average
of the correlation coecients across the 10 test sets. Finally, we averaged the correlation
coecients over the movement plane dimensions, i.e. x/y or y/z, respectively. This procedure
was performed for every single voxel, and we got one correlation coecient for each voxel, run
and subject.
To asses the chance level, we randomly permuted the coordinate segments and performed a
10-fold cross-validation as described above and repeated this procedure 50 times. Thus, we got
50 correlation coecients for each voxel, run and subject, and then, fitted a normal distribution
to these 50 chance correlation coecients (this is reasonable as the correlation values are usually
around 0 and not at the limits 1/-1). Subsequently, we calculated the p-values of the correlation
coecients based on the chance level distributions.
2.7
Results
The maximum correlation coecient reached by a subject averaged over all subjects were (mean
value/standard deviation) 0.47 0.09 (frontal), 0.52 0.13 (lateral), and 0.47 0.10 (ball).
The corresponding chance level correlations were 0.12 0.03 (frontal), 0.12 0.02 (lateral),
and 0.11 0.02 (ball). Figure 1 shows the subject averaged correlations of each voxel in each
run and their average over the runs. Before averaging, all non-significant correlations were set
to 0. Observable are higher correlations on the central and left motor cortex, parietal areas,
and right lateral correlations.
Discussion
We have successfully decoded executed movements from frontal and lateral arm movements
on a per brain source (voxel) basis. The subject averaged correlations indicate a contribution
of the primary motor cortex. This was expected, as subjects executed movements. However,
also contributions from parietal and lateral areas are observable. These contributions can
be external sources projected onto the margins of the head model. Such an external source
could be muscle activity, although muscle activity is thought to be most prominent in higher
Article ID 026-3
frontal run
lateral run
DOI:10.3217/978-3-85125-378-8-26
ball run
average
Figure 1: Subject averaged correlations on a per voxel basis for all 3 runs and the average of
them. Red corresponds to the maximum value of 0.33 in the lateral run, white to 50 % of the
maximum value, and correlations below 50 % of the maximum are not shown.
frequency ranges. To summarize, the findings indicate that indeed brain sources carry decodable
movement information, but the measurements are potentially contaminated by external sources.
References
[1] Trent J Bradberry, Rodolphe J Gentili, and Jose L Contreras-Vidal. Reconstructing threedimensional hand movements from noninvasive electroencephalographic signals. Journal of Neuroscience, 30:34323437, 2010.
[2] Jennifer L Collinger, Brian Wodlinger, John E Downey, Wei Wang, Elizabeth C Tyler-Kabara,
Douglas J Weber, Angus J C McMorland, Meel Velliste, Michael L Boninger, and Andrew B
Schwartz. High-performance neuroprosthetic control by an individual with tetraplegia. Lancet,
381(9866):557564, 2013.
[3] Leigh R Hochberg, Daniel Bacher, Beata Jarosiewicz, Nicolas Y Masse, John D Simeral, Joern Vogel,
Sami Haddadin, Jie Liu, Sydney S Cash, Patrick van der Smagt, and John P Donoghue. Reach
and grasp by people with tetraplegia using a neurally controlled robotic arm. Nature, 485:372375,
2012.
[4] T W Lee, M Girolami, and T J Sejnowski. Independent component analysis using an extended infomax algorithm for mixed sub-gaussian and super-gaussian sources. Neural Computation, 11(2):417
441, 1999.
[5] G R M
uller-Putz, R Scherer, G Pfurtscheller, and R Rupp. EEG-based neuroprosthesis control: a
step towards clinical practice. Neuroscience Letters, 382:169174, 2005.
[6] Patrick Ofner and Gernot R M
uller-Putz. Decoding of velocities and positions of 3d arm movement
from eeg. In Engineering in Medicine and Biology Society (EMBC), 2012 Annual International
Conference of the IEEE, pages 64066409, 2012.
[7] Gert Pfurtscheller and Fernando H Lopes Da Silva. Event-related eeg/meg synchronization and
desynchronization: basic principles. Clinical Neurophysiology, 110(11):18421857, 1999.
[8] A Schl
ogl, C Keinrath, D Zimmermann, R Scherer, R Leeb, and G Pfurtscheller. A fully automated
correction method of eog artifacts in eeg recordings. Clinical Neurophysiology, 118(1):98104, 2007.
[9] Francois Tadel, Sylvain Baillet, John C Mosher, Dimitrios Pantazis, and Richard M Leahy. Brainstorm: A user-friendly application for meg/eeg analysis. Computational Intelligence and Neuroscience, 2011.
Article ID 026-4
DOI:10.3217/978-3-85125-378-8-27
Abstract
In this study we present a control paradigm that enables subjects to drive a real
wheelchair using four navigation commands. The control is achieved through the
discrimination of two mental tasks (relaxed state versus one active Motor Imagery task),
in order to reduce the risk of misclassification. After a short training, the paradigm
becomes only audio-cued, thus avoiding the need of a graphical interface that could
distract subjects attention.
1 Introduction
A brain-computer interface (BCI) is based on analysis of the brain activity, recorded during certain
mental activities, in order to control an external device. It helps to establish a communication and
control channel for people with serious motor function problems but without cognitive function
disorder (Wolpaw, Birbaumer, McFarland, Pfurtscheller & Vaughan, 2002). Amyotrophic lateral
sclerosis (ALS), brain or spinal cord injury, cerebral palsy and numerous other diseases impair the
neural pathways that control muscles or impair the muscles themselves. Among others ways to control
such a system, Sensorimotor rhythm-based BCIs (SMR-BCI) are based on the changes of and
rhythms. These rhythms correspond to specific features of the EEG signals characterized by their
frequencies that can be modified by voluntary thoughts: when a person performs a movement (or
imagines it), it causes a synchronization/desynchronization in this activity (event-related
synchronization/desynchronization, ERS/ERD) which involves a rhythm amplitude change (Neuper &
Pfurtscheller, 1999).
Recently, a survey on BCI-controlled devices has been published (Bi, Fan, & Liu, 2013) which
shows a wide variety of systems controlling wheelchairs or mobile robots. Most of the SMR-BCI
there reported match the number of commands to the number of mental tasks. However, incrementing
the number of mental tasks can reduce the classification performance (Kronegg, Chanel,
Voloshynovskiy & Pun, 2007; Obermaier, Neuper, Guger, & Pfurtscheller, 2001). In order to provide
different commands without making the BCI performance worse, our group proposed a paradigm
based on the discrimination of only two classes (one active mental task versus any other mental
activity), which enabled the selection of four navigation commands (Ron-Angevin, Daz-Estrella &
Velasco-lvarez, 2009). The evolution of this paradigm included its adaptation to a self-paced BCI
enabling a Non-Control (NC) state (Velasco-lvarez & Ron-Angevin, 2009), the support for
continuous movements (Velasco-lvarez, Ron-Angevin & Blanca-Mena, 2010) or its conversion into
an audio-cued paradigm (Velasco-lvarez, Ron-Angevin, da Silva-Sauer, Sancha-Ros & Blanca-
Article ID 027-1
DOI:10.3217/978-3-85125-378-8-27
Mena, 2011). This control paradigm has already been tested for virtual and real environments
(Velasco-lvarez, Ron-Angevin, da Silva-Sauer & Sancha-Ros, 2013); once it has proved to be valid
for controlling a virtual wheelchair or a mobile robot, our aim is to adapt it to the use of a real
wheelchair. The use of an audio-cued paradigm is important, as a graphical interface could limit the
subjects field of view and, at the same time, distract them from the task of controlling the wheelchair
by requiring them to look at the computer screen. Furthermore, for disabled people without gaze
control a graphical interface may not be useful.
2 Methods
2.1 Data Acquisition, Initial Training and Signal Processing
In our system, the EEG is recorded from ten active electrodes, combined to result two laplacian
channels around C3 and C4 positions (right and left hand sensorimotor area, respectively) according
to the 10/20 international system. The ground electrode is placed at the FPz position. Signals are
amplified and digitized at 200 Hz by an actiCHamp amplifier (Brain Products GmbH, Munich,
Germany).
Subjects have to follow an initial training that consists of two sessions: a first one without
feedback and a second one providing continuous feedback. These two training sessions are used for
calibration purposes. The training is the same as the one used in (Ron-Angevin, 2009) and is based on
the paradigm proposed by the Graz group (Leeb, Settgast, Fellner & Pfurtscheller, 2007). It consists
of a virtual car that dodges a water puddle in the road, by moving left or right according to the mental
task carried out (relaxed state or right hand MI). An offline processing of the first session determines
the parameters for the feedback session. The same parameters are used to calibrate the system for the
navigation sessions. This processing is based on the procedure detailed in (Guger, Edlinger, Harkam,
Niedermayer & Pfurtscheller, 2003), and consists of estimating the average band power of each
channel in predefined, subject-specific reactive frequency (manually selected) bands at intervals of
500 ms. In the feedback session, the movement of the car is computed on-line every 25 ms as a result
of a Linear Discriminant Analysis (LDA) classification. The trial paradigm and all the algorithms
used in the signal processing are implemented in MATLAB 2013b (The MathWorks Inc.,
Massachusetts, USA).
Article ID 027-2
DOI:10.3217/978-3-85125-378-8-27
the rotating bar completes two turns without selecting any command), they hear the Spanish word for
"wait"; the reverse switch is indicated with "forward", since it is the first available command in the IC
state. Finally, every time the bar points to a different command, they can hear the correspondent word
("forward", "right", "back" or "left").
IIC BUS
Control board
Sonar
Sonar 3
USB or
Bluetooth
Sonar 2
Sonar 1
Computer
Control app.
Accelerometer
Magnetometer
TCP
USB
EEG
MATLAB app.
Article ID 027-3
DOI:10.3217/978-3-85125-378-8-27
3 Work in Progress
Our group is nowadays preparing experiments to test the viability of the proposed system, so the
given description of the system corresponds to a work in progress. As long as the navigation paradigm
has already been validated, we expect that, with the proper training and adaptation sessions, the
change from a mobile robot to a real wheelchair will not severely affect the performance.
4 Acknowledgements
This work was partially supported by the Spanish Ministry of Economy and Competitiveness,
project TEC 2011-26395, the European fund ERDF and the University of Mlaga.
References
Bi, L., Fan, X. -., & Liu, Y. (2013). EEG-based brain-controlled mobile robots: A survey. IEEE
Transactions on Human-Machine Systems, 43(2), 161-176. doi:10.1109/TSMCC.2012.2219046
Guger, C., Edlinger, G., Harkam, W., Niedermayer, I., & Pfurtscheller, G. (2003). How many
people are able to operate an EEG-based brain-computer interface (BCI)? IEEE Transactions on
Neural Systems and Rehabilitation Engineering, 11(2), 145-147.
Kronegg, J., Chanel, G., Voloshynovskiy, S., & Pun, T. (2007). EEG-based synchronized braincomputer interfaces: A model for optimizing the number of mental tasks. IEEE Transactions on
Neural Systems and Rehabilitation Engineering, 15(1), 50-58.
Leeb, R., Settgast, V., Fellner, D., & Pfurtscheller, G. (2007). Self-paced exploration of the
austrian national library through thought. International Journal of Bioelectromagnetism,9(4),237-244.
Neuper, C., & Pfurtscheller, G. (1999). Motor imagery and ERD. In G. Pfurtscheller, & F. H.
Lopes da Silva (Eds.), Event-related desynchronization.handbook of electroencephalography and
clinical neurophysiology, revised series (pp. 303-325). Amsterdam: Elseiver.
Obermaier, B., Neuper, C., Guger, C., & Pfurtscheller, G. (2001). Information transfer rate in a
five-classes brain-computer interface. IEEE Trans Neural Syst Rehabil Eng, 9(3), 283-288.
Ron-Angevin, R. (2009). Changes in EEG behavior through feedback presentation. Annual Review
of CyberTherapy and Telemedicine, 7(1), 184-188.
Ron-Angevin, R., Daz-Estrella, A., & Velasco-lvarez, F. (2009). A two-class brain computer
interface to freely navigate through virtual worlds. Biomedizinische Technik, 54(3), 126-133.
Velasco-lvarez, F., & Ron-Angevin, R. (2009). Asynchronous brain-computer interface to
navigate in virtual environments using one motor imagery, LNCS, 5517 (I), 698-7058.
Velasco-lvarez, F., Ron-Angevin, R., & Blanca-Mena, M. J. (2010). Free virtual navigation
using motor imagery through an asynchronous brain-computer interface. Presence: Teleoperators and
Virtual Environments, 19(1), 71-81.
Velasco-lvarez, F., Ron-Angevin, R., da Silva-Sauer, L., Sancha-Ros, S., & Blanca-Mena, M. J.
(2011). Audio-cued SMR brain-computer interface to drive a virtual wheelchair. LNCS, 6691 (I),
337-344.
Velasco-lvarez, F., Ron-Angevin, R., da Silva-Sauer, L., & Sancha-Ros, S. (2013). Audio-cued
motor imagery-based braincomputer interface: Navigation through virtual and real environments.
Neurocomputing, 121(0), 89-98. doi:http://dx.doi.org/10.1016/j.neucom.2012.11.038
Wolpaw, J. R., Birbaumer, N., McFarland, D. J., Pfurtscheller, G., & Vaughan, T. M. (2002).
Brain-computer interfaces for communication and control. Clinical Neurophysiology, 113(6), 767791.
Article ID 027-4
DOI:10.3217/978-3-85125-378-8-28
Introduction
The P300 speller, as first described by Farwell & Donchin in [1], is a BCI for spelling text. In the
original speller in [1] the six rows and six columns of the grid are highlighted in a random order.
The target element is the element on the intersection of the row and the column that elicited
a P300 as response when highlighted. A sequence containing every stimulus once is called an
iteration. This row-column paradigm (RC) thus has an iteration length of 12 stimuli.
Two common phenomena limit the speller accuracy. In adjacency distraction the patient
is distracted by the intensification of a row or column next to the actual target. Double flash
(DF) errors occur when the target element is intensified twice in rapid succession. The second
P300 wave generated has a lower amplitude [2] which can result in a wrong target selection.
To alleviate these eects, several stimulus sequence paradigms have been examined, for
example the checkerboard paradigm [4]. Unfortunately, the improvements to the accuracy were
diminished by an increased time required per iteration. New paradigms always seem to make
this compromise between accuracy and speed of spelling. The goal of this paper is to present a
paradigm that achieves a higher accuracy than RC with a speed of spelling as high as in RC.
In what follows, the new paradigm is explained in detail together with the simulation results.
The performance of this paradigm is compared with RC in a first online experiment.
For every character to spell, the new paradigm constructs a pre-determined sequence of stimulus
patterns through an optimization algorithm. This algorithm takes a full sequence of stimuli
(in this paper: a sequence of 15 iterations constructed by the RC paradigm) as initialisation
and tries to reduce the potential causes of spelling errors by swapping the highlighted elements
between stimuli. If the initial sequence, for example, contains the subsequence of stimuli S1 -S2 S3 in which the sets of characters {A, B, C, D, E, F}, {E, K, Q, W, 3, 9} and {C, I, O, U, 1, 7}
are respectively highlighted, there is clearly a DF of character E in the consecutive stimuli S1
and S2 . Swapping the characters E and U between the stimuli S2 and S3 changes the sequence
to {A, B, C, D, E, F}-{U, K, Q, W, 3, 9} -{C, I, O, E, 1, 7} and eliminates the DF.
Article ID 028-1
DOI:10.3217/978-3-85125-378-8-28
Double flashes are thus easily avoided by preventing elements to be highlighted twice in rapid
succession. Adjacency distraction itself can not be avoided. However, when the target element
is simultaneously highlighted with a neighbouring element, the generated P300 wave on this
target stimulus is also linked to this neighbour. A single distraction by a following intensification
of this neighbour is enough to wrongly select this neighbour as the target. Adjacency distraction
errors thus can be partly avoided by preventing an element to be highlighted simultaneously
with a neighbouring element in the grid. The subject then needs to be distracted twice to a
neighbour before wrong selection, which is less likely to occur. This results in the reported
decrease of adjacency distraction errors in the checkerboard paradigm [4].
Not every two elements from every two stimuli can be swapped. If two elements are highlighted in the same set of stimuli in the sequence and these stimuli appear to be target stimuli,
then it is impossible to determine which one of the two is the target element. The sequence is
said to be undecodable. A swap that does not harm this decodability is a legal swap. Only
these swaps will be considered further on.
The algorithm works in two phases. The first phase tries to remove the most severe causes
of mistakes, while the second phase optimizes the sequence further on. As each optimization
in one phase creates new optimization opportunities in the other, these phases are alternately
executed until no more optimizations are found. We allow a maximum of 5 alternate executions
to prevent the algorithm from getting stuck in a loop, constantly swapping the same elements.
For every iteration in the sequence separately, every possible pair of stimuli is examined
in random order. For every pair of stimuli, every legal swap of highlighted elements between
these stimuli is examined in random order. Two criteria are used to verify if the swap is indeed
optimizing the sequence. In the first phase these criteria are:
1. For both elements: does the swap reduce the number of DFs? If this number is not
altered, does the swap reduce the second order DFs (DFs with one stimulus in between)?
2. For both stimuli: does the swap reduce the number of direct horizontal/vertical neighbouring elements in the grid simultaneously intensified? If this number is not altered,
does the swap reduce the number of direct diagonal neighbours?
In the second phase these criteria are:
1. For both elements: does the distance in the sequence to the closest stimulus in which this
element is highlighted increase?
2. For both stimuli: is the distance in the grid of the swapped element to the closest highlighted element bigger, i.e. does the spread of highlighted elements in the grid increase?
If at least one of these questions has a strict positive answer and none of the others is strictly
negative, then the swap is executed. After considering all legal swaps, the procedure is repeated
all over again until no more optimizations are found and the algorithm moves to the next phase.
A total of 500 row-column sequences of 15 iterations each are created and compared to
their counterparts after applying the algorithm. The number of target double DFs seen by the
subject in a sequence of 15 iterations is shown in the histograms in Figure 1a. On average, 2.71
( = 1.49) double flashes are noticed in a standard RC sequence. After applying the algorithm,
this reduces by 98% to only 0.05 ( = 0.23) double flashes on average.
The probability of adjacency distraction is reflected in the number of times the subject sees
the target being flashed simultaneously with at least one of its direct (horizontal or vertical)
neighbours in the grid. In the RC paradigm the target is always flashed with a neighbour. After
applying the algorithm only 1.32% of the target intensifications occur together with a neighbour.
Article ID 028-2
DOI:10.3217/978-3-85125-378-8-28
Frequency of Occurence
Frequency of Occurence
0.3
0.8
0.6
0.4
0.25
0.2
0.15
0.1
0.2
0.05
2
3
4
5
6
# Double Flashes
4
5
6
7
8
9
Manhattan Distance from Target
10
1
0.9
0.3
0.7
Frequency of Occurence
Frequency of Occurence
0.8
0.6
0.5
0.4
0.3
0.25
0.2
0.15
0.1
0.2
0.05
0.1
0
3
4
5
# Double Flashes
(a)
4
5
6
7
8
9
Manhattan Distance from Target
10
(b)
Figure 1: Simulation results. Left: normalized histogram of the number of double flashes seen
in the sequence for the RC paradigm (top) and the new paradigm (bottom). Right: normalized
histogram of the Manhattan distance in the grid from the target to every other element for RC
(top) and the new paradigm (bottom).
Figure 1b gives a histogram of the Manhattan distance in the grid from the target to every
other element simultaneously highlighted. After application of the algorithm the highlighted
characters are clearly spread more over the grid.
After presentation of a stimulus sequence to the subject, the recorded brain signals are preprocessed. A Common Average Reference filter is applied, followed by a bandpass filter with cuto
frequencies 0.5 Hz and 15 Hz. Each EEG channel is normalized to zero mean and unit variance.
Dimensionality reduction retains 10 samples per stimulus, centered around the expected time
step of the P300 wave and uniformly distributed over the range between 150ms and 450ms after
stimulus onset. Finally, we add a bias term. The stimuli are classified as target/non-target by
detecting the presence/absence of a P300 wave in the response. The classifier used in this work
is based on the unified probabilistic model proposed by Kindermans et al. in [3].
9 subjects (6 male, 3 female) with an average age of 22.56 ( = 0.73) used the original RC
paradigm and the switching paradigm (SP) to spell the 36 characters of the grid in random
order. Both the order of the characters and the order in which the paradigms are used are
altered between subjects. In the calibration procedure, 10 characters are spelled. Calibration
is done separately for both paradigms. The stimulus and interstimulus time are 62.5 ms and
125 ms respectively. Every sequence of stimuli contains 15 iterations. Apart from subject 7,
the two tests where done on dierent days. Only subjects 5 and 8 had previous experience with
the P300 speller. The Emotiv EPOC headset is used to record the EEG.
The accuracy of the speller is defined as the fraction of characters correctly spelled. Accu-
Article ID 028-3
DOI:10.3217/978-3-85125-378-8-28
100
RC
SP
accuracy (%)
80
60
40
20
0
6
7
subject (order)
avg
Figure 2: Spelling accuracies obtained with both paradigms for the 9 subjects separately, and
their average result. For the first 4 subjects the RC paradigm is used first, for the other 5 SP.
racies for both paradigms can be found in Figure 2. Subject 1 to 4 used the RC paradigm first,
the others started with the SP paradigm. A mixed-design ANOVA shows the larger eect of
the order in which the paradigms are used (Fbetween subjects = 0.81) compared to the eect
of the paradigm itself (Fwithin subjects = 0.337). A possible cause is that, once a subject gets
used to a paradigm, it may be harder for him to adapt to a new paradigm. A more thorough
examination will be done in the future in which the eect of the order is reduced by letting the
subjects get more familiar with the speller and the paradigms before passing on to the actual
experiment. The average accuracy over all subjects when using the RC paradigm is 81.79%
compared to 86.42% when using SP. This indicates that the newly developed paradigm can
potentially achieve a higher accuracy than RC while the time per iteration is still 2.25 s.
Conclusion
The first online results with the new paradigm are promising, although not convincing. On
average, the accuracy is increased while the iteration length and thus the speed of spelling
remain the same. Starting from these pilot tests, extensive testing with real EEG caps will
have to show whether the new paradigm significantly improves the accuracy of the classifier.
Acknowledgment
This project is funded by The Special Research Fund of Ghent University.
References
[1] L.A. Farwell and E. Donchin. Talking o the top of your head: toward a mental prosthesis utilizing
event-related brain potentials. Electroencephalography and clinical Neurophysiology, 70:510523,
1988.
[2] C.J. Gonsalvez and J. Polich. P300 amplitude is determined by target-to-target interval. Psychophysiology, 39:388396, 2002.
[3] P. Kindermans, H. Verschore, and B. Schrauwen. A unified probabilistic approach to improve
spelling in an event-related potential based brain-computer interface. ieee transactions on biomedical
engineering, 60(10):26962705, 2013.
[4] G. Townsend, B.K. LaPallo, C.B. Boulay, D.J. Krusienski, G.E. Frye, C.K. Hauser, N.E. Schwartz,
T.M. Vaughan, J.R. Wolpaw, and E.W. Sellers. A novel P300-based brain-computer interface
stimulus presentation paradigm: moving beyond rows and columns. Clinical Neurophysiology,
121(7):11091120, July 2010.
Article ID 028-4
DOI:10.3217/978-3-85125-378-8-29
p.arico@hsantalucia.it
Abstract
Several ERP-based BCIs that can be controlled even without eye movements (covert
attention) have been recently proposed. However, when compared to similar systems
based on overt attention, they displayed a significantly lower accuracy. In the current
interpretation, this is ascribed to the absence of the contribution of short-latency visual
evoked potentials (VEPs) in the tasks performed in the covert attention modality. This
study aims to investigate if this decrement (i) is fully explained by the lack of VEP
contribution to the classification accuracy; (ii) correlates with lower temporal stability
of the single-trial P300 potentials elicited in the covert attention modality. We evaluated
the latency jitter of P300 evoked potentials in three BCI interfaces exploiting either
overt or covert attention modalities in 20 healthy subjects. Results highlighted that the
P300 jitter is higher when the BCI is controlled in covert attention and classification
accuracy negatively correlates with jitter.
1 Introduction
The Farwell and Donchins P300 Speller (1988) is among the most widely validated Brain
Computer Interface (BCI) paradigms for communication applications. Brunner and colleagues (2010)
have recently shown that the P300 Speller recognition accuracy was significantly decreased if the
subject was not allowed to gaze at the target stimulus. Several user interfaces designed to be used in
covert attention modality have been implemented and tested with the overall result of a lower system
performance in covert with respect to overt attention usage. The observed superiority in the system
performances under overt usage modality was mainly ascribed to the contribution of visual evoked
potential (VEP) components recorded at occipital and parieto-occipital sites (Aloise et al., 2012;
Treder and Blankertz, 2010). Another important contribution regards the P300 latency jitter that
occurs when the lag between each target stimulus onset and the related potential peak is not constant
for the different stimulus repetitions. Thompson and colleagues (2013) demonstrated that the accuracy
Wrote the manuscript and followed the experimental protocol
Developed the stimulation interface used in the study
Article ID 029-1
DOI:10.3217/978-3-85125-378-8-29
achieved with the P300 Speller was strongly correlated with the jitter in the P300 latency. In this study
we addressed the issue of whether the accuracy of BCIs used in covert attention modality i) is fully
explained by the lack of VEP contribution to the classification accuracy and/or ii) is correlated with a
lower stability of the P300 potential elicited in the covert attention with respect to the overt attention
modality.
Article ID 029-2
DOI:10.3217/978-3-85125-378-8-29
gazing at the target stimuli, iv) [P300 epoch realigned] same epoch length as above, using potentials
obtained after realignment of the single epochs, whose time courses were shifted according to the
estimated P300 latency values.
Correlation between P300 latency jitter and performance: The information transfer rate (ITR,
bit/min) was calculated at each fold of cross-validation as a function of the number of sequences in the
trial. In particular, we calculated the mean value of the ITR along the 8 stimulation sequences, in
order to have a synthetic measure of the systems performance. To assess the correlation between the
ITRMean and the P300 latency jitter, we estimated the non-parametric Spearmans rank correlation
coefficient between these variables.
3 Results
P300 latency jitter: Significant differences of P300 latency jitter elicited by the 3 interfaces were
explored by means of one-way repeated measures ANOVAs (Confidence Interval = .95) where
interface was considered as factor and the P300 latency jitter as dependent variables. The analysis
revealed a significant difference across the interfaces for the jitter (F(2, 357)=52.58; p=9x10-6). Posthoc analysis (Duncan test) showed that the GeoSpell produced a latency jitter significantly larger than
the P300 Speller and the Visual Oddball (mean: 10824ms, 7624ms, and 7438ms, respectively;
p<10-4).
BCI accuracy: Differences in the classification accuracy achieved with each of the 3 visual
interfaces and each of the 4 conditions introduced previously (Figure 1a). A two-way repeated
measures ANOVA (Confidence Interval = .95) was performed with interfaces and conditions as
factors and the accuracy per stimulation sequences as dependent variables. The analysis revealed a
significant interaction between the factors (F(6, 1428)=42.57; p=10 -9). The Duncan's multiple range
test was used for post hoc comparison. The differences in the epoch choices and the interfaces are
summarized in Figure 1b.
Correlation between P300 latency jitter and performance: The non-parametric Spearmans rank
correlation coefficient was used to evaluate the correlation between the classification accuracy as
expressed by the ITRMean values and the P300 latency jitter obtained for each interface. We found a
significant negative correlation between the latency jitter and the accuracy achieved by the subjects
with all 3 interfaces (GeoSpell: r=.17 p=.04; P300 Speller: r=.35 p=10 -4; Visual Oddball: r=.18
p=.03). Considering all the interface together, we found a significant negative correlation as well
(r=.50 p=2x10-23).
Figure 1: (a) Mean and confidence intervals (CI = 0.95) of the cross-validation target classification
accuracies achieved with the three interfaces, relative to each epoch choice as a function of the number of
stimulations; (b) Graphical representation of the differences between the epochs (WE: Whole epoch; WD:
Whole epoch decimated; P3: P300 epoch non-realigned; P3r: P300 epoch realigned) and the interfaces (GS:
GeoSpell; PS: P300 Speller; VO: Visual Oddball) in terms of accuracy, highlighted by the post hoc test.
Numbers in the circles indicate the percent mean accuracy value.
Article ID 029-3
DOI:10.3217/978-3-85125-378-8-29
4 Discussion
The overall aim of this study was to investigate the influence of the P300 latency jitter evoked
during (c)overt attention based BCI tasks on the accuracy achieved. The results proved that when the
user operates a BCI using covert attention, the latency jitter is greater than using overt attention.
Particularly, the P300 latency jitter was significantly greater when using the GeoSpell interface than
using the other interfaces. In line with previous studies (Brunner et al., 2010), our findings on the first
phenomenon clearly indicate the significant contribution of the early VEPs to the classification
accuracy only for the overt (i.e. P300 Speller) interface. Also, removing the VEP contribution from
ERPs elicited using the P300 Speller and the GeoSpell interface, the latter still performed significantly
worse than the former, suggesting that the lack of VEPs is not the only reason for the performance
decrement in the tasks performed in covert attention modality. In addition, a significant correlation
was found between the latency jitter and the BCI performances for all the interfaces.
5 Conclusion
We found that (i) even canceling the contribution of short latency VEPs, the P300 Speller interface
(used in overt attention modality) remains more accurate than the GeoSpell (used in covert attention);
(ii) the P300 latency jitter is negatively correlated with the accuracy of the BCI classifier; (iii) a
compensation of the P300 latency jitter makes the GeoSpell (used in covert attention) than the P300
Speller.
Acknowledgments
The work is partly supported by the EU grant FP7-287774 ABC and NINA research project, cofinanced by EUROCONTROL. This paper only reflects the authors views and funding agencies are
not liable for any use that may be made of the information contained herein.
References
Aloise, F., Schettini, F., Aric, P., Salinari, S., Babiloni, F., Cincotti, F., (2012). A comparison of
classification techniques for a gaze-independent P300-based brain-computer interface.
Journal of neural engineering 9, 045012. doi:10.1088/1741-2560/9/4/045012
Aric, P., Aloise, F., Schettini, F., Salinari, S., Mattia, D., Cincotti, F., 2014. Influence of P300
latency jitter on event related potential-based brain-computer interface performance. J
Neural Eng 11, 035008. doi:10.1088/1741-2560/11/3/035008
Brunner, P., Joshi, S., Briskin, S., Wolpaw, J.R., Bischof, H., Schalk, G., (2010). Does the P300
speller depend on eye gaze? J Neural Eng 7, 056013. doi:10.1088/1741-2560/7/5/056013
Farwell, L.A., Donchin, E., (1988). Talking off the top of your head: toward a mental prosthesis
utilizing event-related brain potentials. Electroencephalogr Clin Neurophysiol 70, 510523.
Thompson, D.E., Warschausky, S., Huggins, J.E., (2013). Classifier-based latency estimation: a novel
way to estimate and predict BCI accuracy. J Neural Eng 10, 016006. doi:10.1088/17412560/10/1/016006
Treder, M.S., Blankertz, B., (2010). (C)overt attention and visual speller design in an ERP-based
brain-computer interface. Behav Brain Funct 6, 28. doi:10.1186/1744-9081-6-28
Article ID 029-4
DOI:10.3217/978-3-85125-378-8-30
Introduction
In accordance with Cognitive Load Theory (CLT) [5] the type and amount of workload (WL)
during learning is crucial for successful learning and should be held in the individual optimal WL
range for each learner. Thus it seems advisable to provide technological support for learners,
which adapts instructional materials and tasks to their level of expertise and WL capacity.
As it is not feasible to use data from the same subject and the same task for classifier training
and testing in an adaptive learning environment, one can either use data from the same subject
but dierent tasks (cross-task) or from the same task, but dierent subjects (cross-subject) to
calibrate the classifier.
In a previous study [7] we used cross-task classification, where a SVM was trained on electroencephalogram (EEG)-data, recorded while participants had to solve working memory tasks.
Subsequently the SVM was tested on EEG-data recorded while solving complex mathematical
tasks. This approach lead to classifier accuracies around chance level. Thus we introduced a
cross-subject WL prediction in the present study based on linear regression, for a group of 10
subjects. As most of the EEG-based WL classifications are subject-specific, apart from a few
exceptions [8], this work is a step towards the development of EEG-based WL classifiers as it
would enable training a classifier once that could handle multiple subjects.
2
2.1
A total of 10 subjects (17 - 32 years) voluntarily participated in the EEG experiment. The
experiment had a within-subject design and comprised two tasks, a WL as well as a vigilance
Article ID 030-1
DOI:10.3217/978-3-85125-378-8-30
task. In this article we will merely report the results of the WL task. The subjects had to
solve 240 math problems with an increasing level of difficulty, by typing the solution in a given
time. The presented problems varied in difficulty as measured by the information content (Q)
according to Thomas [6], with a Q value ranging from 0.6 (easy) to 7.2 (difficult). As postulated
from Kantowitz [2] increasing task difficulty (=
increasing Q-value) always increases WL, since
by definition, an increase of task difficulty demands additional capacity. Further, an increase of
WL is characterized in EEG-data by changes in the theta-, alpha- and beta-frequency bands [3].
To avoid the classifier of being based on perceptual-motor confounds the time windows used for
analyzing EEG-data should not contain motor events. Therefore, each trial was divided into
two phases: First, the calculation phase occurred (= phase for analyzing the EEG-data), where
the problem to be solved was shown for 5 sec. Subsequently subjects had 3.5 sec to type in
their result, followed by an inter-trial interval of 1.5 sec.
2.2
EEG Recording
A set of 29 active electrodes (actiCap, BrainProducts GmbH), attached to the scalp placed
according to the extended International Electrode 10 - 20 Placement System, was used to record
EEG-signals. Three additional electrodes were used to record an electrooculogram (EOG); two
placed horizontally at the outer canthus of the left and right eye to measure horizontal eye
movements and one placed in the middle of the forehead between the eyes to measure vertical
eye movements. EOG- and EEG-signals were amplified by two 16-channel biosignal amplifier
systems (g.USBamp, g.tec). The sampling rate was 512 Hz. EEG-data was high-pass filtered
at 0.1 Hz and low-pass filtered at 100 Hz during the recording. Furthermore a notch-filter was
applied between 48 - 52 Hz to filter power line noise.
2.3
For further analysis we reduced the number of channels to 17 (FPz, AFz, F3, Fz, F4, FC3,
FCz, FC4, C3, Cz, C4, CPz, P3, Pz, P4, Oz, POz), to lower the influence of possible artifacts,
which are most prominent on the outer channels. As frequency bands were not consistent and
varied between subjects, we used a wide frequency range of 4 - 30 Hz [3].
The power spectra were calculated for each 5 sec window (= calculation phase) by using
autoregressive models based on Burgs maximum entropy method [1], using a model order of
32. In addition, the data was z-score normalized along the channels, meaning for each trial
the mean of each frequency bin equals zero. The feature selection was conducted during the
10-fold or leave-one-subject-out cross-validation only on the training data. Each feature was
correlated with the Information Content (Q) and the 125 features with the highest r2 -values
were selected for the regression analysis. For WL prediction we used a linear ridge regression
with the regularization parameter set to a fixed value of = 0.001.
As criterion for performance evaluation of the presented prediction method we used the
correlation coefficient (CC) to observe the statistical relationship between the actual and the
predicted Q-values, as well as the root-mean-squared error (RMSE) to examine the dierence
between the actual and the predicted Q-values.
Since single-trial prediction is not necessary in a learning environment, the regression output
was additionally smoothed to improve prediction accuracy at the expense of increased delay of
the system. We used a window-size of 6 trials, which still guaranteed a response time 1 min
of the system, which is feasible for the detection of WL.
Article ID 030-2
DOI:10.3217/978-3-85125-378-8-30
Figure 1: Distribution of actual and predicted WL for subject 1. The continuous red line represents the actual WL and the smoothed dashed blue line the predicted WL for each trial. Left:
Within-subject - Using a 10-fold cross-validation; Right: Cross-subject - Regression trained on
data of nine subjects and tested with data of the remaining subject 1.
Results
The performance for the within-subject WL prediction was calculated by using a 10-fold crossvalidation on 240 trials (see Table 1). For the evaluation of the cross-subject WL prediction the
regression was trained on data of nine subjects (= 240 9 trials) and tested with the complete
data-set of the remaining subject (see Table 2). The within-subject WL prediction reached on
average a correlation coefficient of CC = 0.66 and a RM SE = 1.70. Using the smoothed
regression output for within-subject WL prediction leads to more robust and even better results
with an average over all subjects of CC = 0.88 and RM SE = 1.02 (see Table 1).
As stated in Table 2 the average results over all 10 subjects for cross-subject WL prediction slightly decreased. Applying linear regression to unsmoothed data leads on average to a
CC = 0.58 and a RM SE = 1.95. Using the smoothed data for cross-subject WL prediction
improved the CC by 0.26 and the RM SE by 0.37.
The distributions of actual and predicted WL for within- and cross-subjects are exemplary
shown for subject 1 in Fig. 1. In both cases the increasing WL was successfully predicted by
using the linear regression method. It can be noticed that during the first 90 trials (Q 0.9)
and the last 30 trials (Q
6.0) the actual and predicted WL deviate strongest from each
other. This may be due to subjects being unchallenged or overburdened while solving these
tasks.
CC
CC (smooth)
RM SE
RM SE (smooth)
S1
0.43
0.69
2.20
1.39
S2
0.69
0.93
1.61
0.86
S3
0.68
0.93
1.71
0.99
S4
0.58
0.89
2.09
1.21
S5
0.84
0.94
1.11
0.68
S6
0.58
0.87
1.95
1.15
S7
0.60
0.76
1.63
1.04
S8
0.72
0.91
1.60
1.04
S9
0.65
0.93
1.82
1.08
S10
0.78
0.93
1.30
0.73
mean
0.66
0.88
1.70
1.02
Discussion
The results show that the amount of WL can be predicted using a cross-subject regression
method. Compared to earlier studies, where cross-task classification [7] leads to classification
Article ID 030-3
CC
CC (smooth)
RM SE
RM SE (smooth)
S1
0.67
0.89
1.68
1.13
S2
0.50
0.91
2.56
2.00
S3
0.65
0.88
1.69
1.25
S4
0.63
0.91
1.73
1.39
DOI:10.3217/978-3-85125-378-8-30
S5
0.63
0.89
1.72
1.60
S6
0.59
0.86
1.77
1.44
S7
0.39
0.74
2.65
2.11
S8
0.62
0.84
1.87
1.58
S9
0.52
0.71
2.04
1.57
S10
0.57
0.79
1.81
1.68
mean
0.58
0.84
1.95
1.58
Acknowledgments
The research was supported by ScienceCampus T
ubingen and the LEAD Graduate School at
the University of T
ubingen, which is funded by the German Research Foundation.
References
[1] T. M. Cover and J. A. Thomas. Elements of information theory. Hoboken,NJ: Wiley-Interscience,
2006.
[2] B. H. Kantowitz. Human Factors Psychology, chapter Mental Workload, pages 81122. NorthHolland, 1987.
[3] M. Pesonen, H. H
am
al
ainen, and C. M. Krause. Brain oscillatory 4-30 Hz responses during a visual
n-back memory task with varying memory load. Brain Research, 1138:171177, 2007.
[4] W. Samek, M. Kawanabe, and C. Vidaurre. Group-wise stationary subspace analysis - a novel
method for studying non-stationarities. In Proc. 5th Int. Brain-Computer Interface Conf., pages
1620, 2011.
[5] J. Sweller, J. J. G. Van Merrienboer, and F. Paas. Cognitive architecture and instructional design.
Educational Psychology Review, 10(3):251296, 1998.
[6] H. B. G. Thomas. Communication theory and the constellation hypothesis of calculation. Quarterly
Journal of Experimental Psychology, 15(3):173191, 1963.
[7] C. Walter, S. Schmidt, W. Rosenstiel, P. Gerjets, and M. Bogdan. Using cross-task classification
for classifying workload levels in complex learning tasks. In Proceedings of the 5th IEEE Humaine
Association Conference on Aective Computing and Intelligent Interaction, pages 876881, Geneva,
Switzerland, 2013.
[8] Z. Wang, R. M. Hope, Z. Wang, Q. Ji, and W. D. Gray. Cross-subject workload classification with
a hierarchical bayes model. NeuroImage, 59(1):6469, 2012.
Article ID 030-4
DOI:10.3217/978-3-85125-378-8-31
Introduction
We demonstrate the possibility that BCI can be learned by implicit feedback reinforcement;
i.e., without instructions, such as in many other tasks, including motor tasks. Moreover, the
subjects in the present study were not even instructed that there was an opportunity to learn,
but were led to believe that they are taking part in a passive experiment. The study is based on
a generic system we have developed that allows turning any 3D object in a virtual environment
into an SSVEP target with reliable flicker rates.
2
2.1
Method
Participants
Twenty one subjects took part in the experiment (10 males, 11 females) with a mean age of 24
(range 19-40). All subjects had normal or corrected-to-normal vision. They showed no signs of
neurological or psychiatric disorders and all gave written, informed consent. Each subject was
paid the equivalent of 12 Euro for participating in the experiment, and the study was approved
by the institutional ethics committee.
2.2
Apparatus
EEG signals were recorded using 8 g.LADYbird sintered Ag/Cl crown active ring electrodes
located on the subjects occipital lobe at pO7, PO3, POz, PO4, PO8, O1, Oz and O2 locations
according to the international 10-20 system (Figure 1). A reference electrode was positioned on
the subjects right ear lobe and a ground electrode was placed at Fpz location according to the
international 10-20 system. EEG signals were recorded at 256 Hz sample rate and amplification
Article ID 031-1
DOI:10.3217/978-3-85125-378-8-31
and analog filtering (5-100 Hz) and notch filtering at 50Hz were performed using a g.USBamp
amplifier (Guger Technologies, Schiedlberg, Austria).
2.3
The experiment included three stages. In the first training stage the system computed a classifier
model. The training sessions included 20 stimuli, 5 times of each frequency in pseudo-random
order and location on the screen. The first stage might have been repeated until the first
successful session, indicated by model fit of 90% accuracy or more to the training data. The
second stage was the task stage, including 4 sessions of immerisve experience. The experience
included a star field moving towards the subject, with a pseudo random collection of larger stars,
which were the SSVEP targets. The instruction to all subjects was to attend to the stimuli
and speculate about the reason that the stars were moving. Eleven subjects experienced the
experimental group, in which they were only told about their BCI control after 3 of the 4 task
trials. After 1 second of continous SSVEP classfication of an on screen star that star began
moving towards the subject, possibly exploding on screen if attention was kept to it long
enough. Ten subjects participated in the sham control group, in which the motion of the stars
in the first 3 trials was random. The third stage, a validation trial intended to determine BCI
accuracy, was identical to the training trial.
Classification data was collected during each training session. For each frequency 5 trials
each 7 seconds long were entered into the algorithm for classifier training, while comparing
these 7 seconds with the 6 second trial interval rest periods for zero class classification. During
the online classification of the signals at the second stage of the experiment, a 2-second running
Article ID 031-2
DOI:10.3217/978-3-85125-378-8-31
window buer was used for classification. The algorithm produced a classification result 5 times
per second (i.e., every 200ms). In order to decrease false positive classification errors a filter
was applied to the stream of classification results; a total of 6 consecutive results were required
of the same class in order to activate a game object (stars) command. This corresponds to
one second of consistent classification. Since a running buer of 2 seconds was also used, this
entails that the minimum response time of the stars (game objects) was 3 seconds.
Results
The hypothesis that implicit reinforcement induces a learning eect was tested using the two
groups accuracy results obtained at the validation stage using T-test analysis. First, optimal
time point in terms of classification accuracy was obtained; this time point for all subjects was
exactly 6 seconds post stimuli onset, and thus the accuracy per session in the following analysis
was based on the accuracy between seconds 5 and 6. The experiment group achieved an average
of 82.5% accuracy while the sham group achieved an average of 55.55%. The dierence between
the groups was found to be significant (t (19) = 2.23, p < 0.05; Figure 2).
Figure 2: Minimum error rates between groups at optimal accuracy time stamp (5-6 seconds
post stimuli onset).
In order to further confirm this finding an ANOVA with repeated measures analysis was used
(Figure 3). The within variable considered was time, sampled once per second, the between
variable was group, and the dependent variable was accuracy rate. A significant main eect was
found for time (F (9,171) = 36.43, p <.001, partial eta = 0.66). A significant eect for group
was not found (F (1, 19) = 1.05, n.s), but taking into account time a significant interaction was
found (F (9,171) =.4.05, p < .05, partial eta =.17): after five seconds of stimulus presentation,
the accuracy between the groups changed, and the sham group displayed lower accuracy rate
past that time point, while the experiment group accuracy rates continued to rise. Note that in
the first five seconds post stimuli the classification is approximately random. We do not know
why the classification in the sham group was slightly higher in this early duration.The optimum
performance in this study was around 6 seconds after the trigger, in contrary to 3-4 seconds as
reported in most SSVEP studies; we suspect that the result is the combination of a 2 second
buer and a 1 second filter that we have applied. This, together with the minimal training,
can also explain the realtively low accuracy rates. When comparing false positive results, a
significant eect for time was also found (F (9,171) = 12.21, p < .05), but no significant eect
was found between groups (F(1,19) = 0.91, p > .05 , n.s.), and no interaction was found
Article ID 031-3
DOI:10.3217/978-3-85125-378-8-31
(F(2,171) = 1.88, p > .05, n.s). These results indicate significantly higher accuracy rate for the
experiment group, but only in the duration when classification was not random.
Figure 3: Linear comparison between the groups error rates; a significant interaction was found
between time and group BCI accuracy (F (9,171) = 4.05, p < .05, partial eta =.17).
Discussion
The present findings support the hypothesis that the developed interface is easy to use and
that its activation can be accomplished without prior knowledge. Moreover, it was found
that implicit SSVEP BCI interactions are sufficient in inducing a learning eect for the skill
of operating a BCI. Middendorf et al. [3] have shown that subjects can voluntarily learn to
regulate their SSVEP amplitude by explicit training. Guger et al. [2] demonstrate that very
short SSVEP training is often enough to reach high performance. In this study we show that
an implicit reinforcement method can result in improved accuracy rates. This suggests the
possibility that BCI control may be an acquired ability similar to motor abilities; this has
both theoretic implications for understanding how BCI skills are acquired, as well as practical
implications, e.g., for locked in patients who cannot learn to control a BCI that is based on
explicit control startegies.
Acknowledgements
This project was supported by EU projects BEAMING (248620) and VERE (257695). The
authors wish to thank Gilan Jackont, Keren-OR Berkers and Yuval Kalguny for programming
the Unity environment, and Christoph Hintermuller of gTec GMBH for support with the SSVEP
module. The authors also wish to thank Beatrice Halser for her suggestions on the experimental
design.
References
[1] Henrik Gollee, Ivan Volosyak, Angus J McLachlan, Kenneth J Hunt, and Axel Gr
aser. An SSVEPbased brain-computer interface for the control of functional electrical stimulation. IEEE transactions
on bio-medical engineering, 57(8):184755, August 2010.
[2] Christoph Guger, Brendan Z Allison, Bernhard Growindhager, Robert Pr
uckl, Christoph Hinterm
uller, Christoph Kapeller, Markus Bruckner, Gunther Krausz, and G
unter Edlinger. How
many people could use an ssvep bci? Frontiers in neuroscience, 6, 2012.
[3] Matthew Middendorf, Grant McMillan, Gloria Calhoun, Keith S Jones, et al. Brain-computer
interfaces based on the steady-state visual-evoked response. IEEE Transactions on Rehabilitation
Engineering, 8(2):211214, 2000.
Article ID 031-4
DOI:10.3217/978-3-85125-378-8-32
Introduction
Recently, vibrotactilebased somatosensory modality BCIs have gained in popularity [1, 4, 2]. We
propose an alternative tactile BCI which uses P300 brain responses to a somatosensory stimulation delivered to larger areas of the users back, defined as a backtactile BCI (btBCI). We conduct experiments
by applying vibration stimuli to the users back, which allows us to stimulate places at larger distances
on the body in order to test our hypothesis of the full body-based tactile BCI paradigm validity. The
stimulated back areas are both shoulders, the waist and the hips. In order to do so, we utilize a haptic
gaming pad ZEUS VYBE by Disney & Comfort Research. An audio signal pads input allows for
the delivery of a sound pattern activating spatial tactile patterns of vibrotactile transducers embedded
within the device. In the experiments reported in this paper, the users lay down on the gaming pad and
interacted with tactile stimulus patterns delivered in an oddballstyle paradigm to their backs, as shown
in Figure 1. The reason for using the horizontal position of the gaming pad, developed for a seated
setting, is that bedridden users, with intact afferent neural fibers [5], could easily utilize it and it could
also serve as a muscle massage preventing the formation of bedsores.
Methods
In the research project reported in this paper the psychophysical and online EEG experiments were
carried out with ablebodied, BCInaive users. Seven healthy users participated in the study (three
males and four females) with a mean age of 25 years (standard deviation of 7.8 years). The users
were paid for their participation. All the experiments were performed at the Life Science Center of
The author was supported in part by the Strategic Information and Communications R&D Promotion Program no. 121803027
of The Ministry of Internal Affairs and Communications in Japan.
The corresponding author.
Article ID 032-1
DOI:10.3217/978-3-85125-378-8-32
0.0%
0.0%
0.7%
0.7%
98.5%
0.0%
5 0.0%
0.0%
1.5%
1.5%
97.1%
0.0%
0.0%
4 0.7%
0.7%
0.0%
97.8%
0.0%
0.7%
0.0%
3 0.0%
0.0%
98.6%
0.0%
0.7%
0.7%
0.0%
2 0.0%
100.0% 0.0%
0.0%
0.0%
0.0%
0.0%
1 99.3%
0.7%
0.0%
0.0%
0.0%
0.0%
0.0%
90
80
instructed target
70
60
50
40
30
20
4
5
subject response
10
0
TARA, University of Tsukuba, Japan. The procedures for the psychophysical experiments and EEG
recordings for the BCI paradigm were approved by the Ethical Committee of the Faculty of Engineering,
Information and Systems at the University of Tsukuba, Tsukuba, Japan. Each participant signed to give
informed consent to taking part in the experiments. The psychophysical experiments were conducted
to investigate the recognition accuracy and response times to the stimuli delivered from the gaming
Figure 3: Grand mean averaged ERP for target (purple line) and nontarget (blue line) stimuli. The
very clear P300 responses are easy to notice for each EEG electrode in the latencies of 200 1000 ms.
Eye blink contaminated ERPs were rejected in the process of creation of this figure, with a threshold of
80 V.
Article ID 032-2
DOI:10.3217/978-3-85125-378-8-32
pad. The behavioral responses were collected as keyboard button presses after instructed targets. In
the psychophysical experiment, each single trial was comprised of a randomly presented single target
and five nontarget vibrotactile stimuli (120targets and 600nontargets in a single session). The
stimulus duration was set to 300 ms and the interstimulusinterval (ISI) to 700 ms. In the btBCI online
experiments, the EEG signals were captured with a biosignal amplifier system g.USBamp by g.tec
Medical Instruments, Austria. Active EEG electrodes were attached to the sixteen locations Cz, Pz, P3,
P4, C3, C4, CP5, CP6, P1, P2, POz, C1, C2, FC1, FC2 and FCz, as in 10/10 international system.
A reference electrode was attached to the left mastoid, and a ground electrode to the forehead at the
FPz position. The EEG signals were captured and classified by BCI2000 software [6] using a stepwise
linear discriminant analysis (SWLDA) classifier [3] applied to sixteen times downsampled 0 800 ms
ERP latencies. The EEG recording sampling rate was set at 512 Hz, and the high and low pass filters
were set at 0.1 Hz and 60 Hz, respectively. The notch filter to remove power line interference was set
for a rejection band of 48 52 Hz. In each trial, the stimulus duration was set to 250 ms and the
ISI to random values in a range of 350 370 ms in order to break rhythmic patterns of presentation.
Each online experiment comprised of 10 trials used for epochs averaging. Users conducted three runs of
which the first one was used for SWLDA classifiers truing and the remaining for testing. The vibrotactile
spatial pattern stimuli (stimulated body locations used as cues) in the two experimental settings above
were generated using the same MAX 6 program, and the trigger onsets were generated by BCI2000 EEG
acquisition and ERP classification software [6]. During the training run the cues were given in form of
vibrations of the target location before each sequence. In testing phase the spelled digits (representing
the body locations) were given visually on a computer display.
Results
The psychophysical experiment results are summarized in the form of a confusion matrix depicted in
Figure 2, where the behavioral response accuracies to instructed targets and marginal errors are depicted
together with noresponse errors, which were not observed with the users participating in our experiments. The grand mean behavioral accuracies were above 97%, which proved the easiness of the back
vibrotactile stimuli discrimination. The behavioral response times did not differ significantly among
classes as tested with the Wilcoxon rank sum test for medians, which further supported the choice of the
experiment set-up with vibrotactile stimuli to the back. The EEG experiment results are summarized
in Figures 3 and 4 in the form of grand mean averaged ERPs and classification accuracies. The grand
mean averaged ERPs resulted in very clear P300 responses in latency ranges of 200 1000 ms. The
SWLDA classification results in online btBCI experiments of six digit spelling are shown in Figure 4,
depicting each users averaged scores in a range of 16.7% 62.45% and the best grand mean results
in the range of 57.26% 85.71%, both as a function of various ERP averaging scenarios. The chance
level was 16.7%. The best mean scores show very promising patterns for possible improvements based
on longer user training. Mean informationtransferrates ranged from 0.6 bit/min to 3.3 bit/min for
10trials averaging based SWLDA classification to 0.5 bit/min to 10.9 bit/min for single trial offline
analysis cases.
Conclusions
This paper reports results obtained with a novel sixcommandbased btBCI prototype developed and
evaluated in experiments with seven BCInaive users. The experiment results obtained in this study
confirm the general validity of the btBCI for six commandbased applications and the possibility to
further improve the results, as illuminated by the best mean accuracies achieved by the users. The EEG
Article ID 032-3
DOI:10.3217/978-3-85125-378-8-32
Figure 4: The averaged btBCI accuracies as obtained with a SWLDA classifier with different numbers of
averaged ERPs. The left panel presents mean accuracies for each user separately, together with standard
error bars calculated over different runs. The right panel presents the averaged maximum scores (the best
runs) of all the users together in order to illustrate the potential strength of the paradigm in comparison
to average scores shown in the left graph.
experiment with the prototype confirms that tactile stimuli to large areas of the back can be used to
spell six-digit (command) sequences. The results presented offer a step forward in the development of
somatosensory modality neurotechnology applications. Due to the still not very satisfactory interfacing
rate achieved in the case of the online btBCI, the current prototype obviously requires improvements
and modifications. These requirements will determine the major lines of study for future research.
However, even in its current form, the proposed btBCI can be regarded as a possible alternative solution
for lockedin syndrome patients, who cannot use vision or auditory based interfaces due to sensory or
other disabilities.
References
[1] A-M. Brouwer and J.B.F. Van Erp. A tactile P300 brain-computer interface. Frontiers in Neuroscience, 4(19),
2010.
[2] J.E. Huggins, C. Guger, B. Allison, C.W. Anderson, A. Batista, A-M. Brouwer, C. Brunner, R. Chavarriaga,
M. Fried-Oken, A. Gunduz, D. Gupta, A. Kubler, R. Leeb, F. Lotte, L.E. Miller, G. Muller-Putz, T. Rutkowski,
M. Tangermann, and D.E. Thompson. Workshops of the fifth international brain-computer interface meeting:
Defining the future. Brain-Computer Interfaces, 1(1):2749, 2014.
[3] D.J. Krusienski, E.W. Sellers, F. Cabestaing, S. Bayoudh, D.J. McFarland, T.M. Vaughan, and J.R. Wolpaw. A
comparison of classification techniques for the P300 speller. Journal of Neural Engineering, 3(4):299, 2006.
[4] H. Mori, Y. Matsumoto, Z.R. Struzik, K. Mori, S. Makino, D. Mandic, and T.M. Rutkowski. Multi-command
tactile and auditory brain computer interface based on head position stimulation. In Proceedings of the Fifth
International Brain-Computer Interface Meeting 2013, page Article ID: 095, Asilomar Conference Center,
Pacific Grove, CA USA, June 3-7, 2013. Graz University of Technology Publishing House, Austria.
[5] J R Patterson and M Grabois. Locked-in syndrome: a review of 139 cases. Stroke, 17(4):75864, 1986.
[6] G. Schalk and J. Mellinger. A Practical Guide to BrainComputer Interfacing with BCI2000. Springer-Verlag
London Limited, 2010.
Article ID 032-4
DOI:10.3217/978-3-85125-378-8-33
University of Glasgow, Glasgow, UK. 2 NED University Karachi, Pakistan. 3 Sothern General
Hospital, UK. 4 University of Strathclyde, Glasgow, UK. 5 Northeastern University, Boston, USA.
m.hasan.1@research.gla.ac.uk, Aleksandra.Vuckovic@glasgow.ac.uk
Abstract
The aim of the study was to investigate the effect of neurofeedback (NF) training on
Central neuropathic pain in patients with chronic paraplegia. Training consisted of 2040 NF sessions in which patients were asked to voluntarily reduce the theta (4-8 Hz)
and beta (20-30 Hz) band power and to increase the alpha band (9-12 Hz) power over
the sensory-motor cortex. A whole head EEG was recorded before the first and after the
last training day to assess long-term effect of NF. Four out of five patients reported
clinically significant (>30%) reduction in pain. Multichannel LORETA analysis
indicated that reduced pain intensity might be associated with reduced cortical and subcortical activity in the areas known to be involved in processing of chronic pain.
1 Introduction
Central Neuropathic Pain (CNP) is caused by a lesion or disease to the somatsosensory system
having a prevalence of 40 % in spinal cord injured (SCI) population (Werhagen et al. 2004). CNP is
perceived as coming from the body, while it is actually generated in the brain and is mainly associated
with the overactivation of the cortex (Samthein et al. 2006). Noninvasive interventions such as
repetitive Transcranial Magentic Stimulation (rTMS) and transcranial Direct Current Stimulation
(tDCS) reduce cortical and/or sub-cortical activity thus reducing pain (Moseley and Flor 2012).
Similar to rTMS and tDCS, neurofeedback (NF) is also a neuromodulation technique which is
based on voluntary modulation of brain activity. Although NF has been successfully used for different
types of chronic pain such as fibromyalgia and migraine (Kayiran et al. 2010), its efficiency on CNP
has not been confirmed. In a recent study we showed that paraplegic patients with CNP have
increased power in theta band, lower frequency of dominant alpha peak and increased Event Related
Desynchronisation (ERD) in the theta, alpha and beta band (Vuckovic et al. 2014). Based on that
study, we created a NF protocol for treatment of CNP, presented in this paper.
Article ID 033-1
DOI:10.3217/978-3-85125-378-8-33
2 Methods
2.1 Patients
Seven paraplegic patients with CNP larger than 5 (Visual Analogue Scale, range 0 to 10)
participated in the study. All patients had pain in their limbs. Ethical permission was obtained from
the NHS Ethical Committee for the Greater Glasgow and Clyde. Experiments were performed at
Queen Elizabeth National Spinal Injuries Unit, Glasgow, UK.
3 Results
Five out of seven patients completed the study. Two patients dropped out after 3 NF sessions due
to family and transport problems. From the remaining five patients, four patients received 40 sessions
and the fifth patient received 20 sessions. Patients received therapy 1-3 sessions per week.
Article ID 033-2
DOI:10.3217/978-3-85125-378-8-33
All five patients achieved a statistically significant (p0.05) reduction of pain (Table 1) and in four
patients this reduction was clinically significant (>30%). Patients reported immediate reduction of
pain during NF; reduction of pain gradually increased over training sessions and lasted for several
weeks after terminating the therapy. Initially we tested several locations over the primary motor cortex
which included C3, Cz and C4. In patients with incomplete paraplegia (preserved sensation) we
occasionally observed the uncontrolled movements of legs (spasm) which was higher when practicing
from C3 than from C4, so we chose C4 as a preferred training site to reduce discomfort. Interestingly,
NF training from Cz, located over the motor cortex corresponding to painful paralyzed legs resulted
in smallest reduction of pain. Patients received 30 min training in total per session.
Figure 1 shows power in one patient before, during and after NF for a single training day. The
patient increases the alpha and theta power, and decreases the beta power (black thick line on x-axis:
Pre NF versus NF). This effect can also be noticed several minutes after NF training (grey thick line
on x-axis: Post NF versus NF). We present PSD of an individual patient rather than the average value
across the whole group because the exact frequency bands in which EEG was modulated varied
among patients.
Pre NF
NF
Post NF
10
5
No
1
2
3
4
5
0
--5
-10
-1
-15
-1
0
10
0
15
5
Frequency (Hz)
20
0
25
30
Injury level
T5 complete
T6 complete
T6 incomplete
T6 incomplete
T6 incomplete
Pain (before/after)
6/5,
p=0.05
7/5,
p=0.0004
6/2,
p=0.0001
9/5,
p=0.0002
9/5,
p=0.006
Article ID 033-3
DOI:10.3217/978-3-85125-378-8-33
Frequency (Hz)
and reduced ERD. These results are in accordance to previous studies on EEG signatures of CNP
(Sarnthein et al. 2006; Vuckovic et al. 2014).
30
25
20
15
10
5
Before NF
-1
Time (s)
30
25
20
15
10
5
3
2
1
0
-1
-2
-3
After NF
-1
Time (s)
5 Acknowledgements
This work has been partially supported by the MRC grant G0902257/1, the GRPE, NED
University of Pakistan, and GU 68. We thank Dr Purcell and Dr Mclean and to all participants.
6 References
Kayiran Sadi et al. 2010. Neurofeedback Intervention in Fibromyalgia Syndrome; a Randomized,
Controlled, Rater Blind Clinical Trial. Applied psychophysiology and biofeedback. 35(4):293
302.
Makeig, S. 1993. Auditory Event-Related Dynamics of the EEG Spectrum and Effects of Exposure
to Tones. Electroencephalography and clinical neurophysiology 86(4):28393.
Moseley, G. Lorimer, and Herta Flor. 2012. Targeting Cortical Representations in the Treatment of
Chronic Pain: A Review. Neurorehabilitation and neural repair 26(6):64652.
Pascual-Marqui, R. D. 2002. Standardized Low-Resolution Brain Electromagnetic Tomography
(sloreta): Technical Details. Methods Find Exp Clin Pharmacol. 24 (suppl.:512.
Sarnthein et al. 2006. Increased EEG Power and Slowed Dominant Frequency in Patients with
Neurogenic Pain. Brain: a journal of neurology 129(Pt 1):5564.
Vuckovic et al. 2014. Dynamic Oscillatory Signatures of Central Neuropathic Pain in Spinal Cord
Injury. Journal of Pain.
Werhagen et al. 2004. Neuropathic Pain after Traumatic Spinal Cord Injury--Relations to Gender,
Spinal Level, Completeness, and Age at the Time of Injury. Spinal cord 42(12):66573.
Article ID 033-4
DOI:10.3217/978-3-85125-378-8-34
Objective. We have developed an efficient generic machine learning (ML) tool for realtime fMRI whole brain classification, which can be used to explore novel brain-computer
interface (BCI) or advanced neurofeedback (NF) strategies. Approach. We use information gain for isolating the most relevant voxels in the brain and a support vector machine
classifier. Main results. We have used our tool in three types of experiments: motor movement, motor imagery and visual categories. Significance. We show high accuracy results
in real-time, using an optimal number of voxels, with a shorter delay compared to the
previous method based on regions of interest (ROI). Finally, our tool is integrated with a
virtual environment and can be used to control a virtual avatar or a robot.
Introduction
Real time fMRI is a promising risk-free non-invasive method for several reasons. Due to superior
spatial resolution fMRI may be used to classify a much wider set of mental patterns than EEG.
Thus, fMRI-based BCI may facilitate exploring new BCI paradigms. fMRI BCI paradigms can
be used to localize underlying brain patterns and transfer the paradigms back to other, more
accessible signals, such as EEG and near-infrared spectrography (NIRS) [1, 2]. fMRI-based BCI
can also be used for training patients in BCI (e.g., before surgery), for rehabilitation sessions,
or for pattern-based neurofeedback (NF) in all these cases very specific brain areas may be
targeted.
We have developed a generic efficient tool for online whole-brain classification from fMRI
data. In contrast to other research [3], this tool is designed for real-time processing, normalization and multi-classification which can be extended to other algorithms within the Weka ML
API [4]. This tool is also integrated with a virtual environment feedback to allow for engaging
subjects in a wide range of scenarios and tasks, and can also be integrated with external devices
such as a humanoid robot. In this paper, we show how our approach achieves high accuracy in
three dierent mental tasks: motor motion, motor imagery, and visual categories.
The System
Imaging was performed on a 3T Trio Magnetom Siemens scanner as described in [5, 6], with
a repetition time (TR) of 2000ms. Visual feedback is provided by a mirror, placed 11cm from
the eyes of the subject and 97.5cm from a screen, which results in a total distance of 108.5cm
from the screen to the eyes of the subject.
Article ID 034-1
DOI:10.3217/978-3-85125-378-8-34
Dicom files1 from the scanner are preprocessed by Turbo Brain Voyager (TBV, Brain Innovation, Netherlands). Since fMRI data tends to have non linear non-homogeneous drifts, we
introduce a normalization process that has been verified to perform well online; given a raw
value at voxel i and time t, ri,t , and a sliding window of length w we derive a new value for
each raw value:
0
ri,t
= ri,t median(ri,t w+1 , ..., ri,t )
(1)
We have empirically established that a w of 40 TRs (80 seconds) is optimal in our case.
Our tool is integrated with the Unity game engine (Unity Technologies, California); in
other studies this allowed subjects to control an avatar in a virtual environment or a humanoid
robot [5]. The system allows easily configuring classification and interaction parameters during
an experiment and playing back experimental sessions.
Training and applying classifiers in real-time requires that learning be executed faster than
is generally done in the application of ML to fMRI. Our system is optimized for memory
usage, processing speed, and classification speed. To achieve faster processing, we focus on
several areas: (1) feature reduction, (2) feature selection, (3) redundant data reduction, (4)
minimization of computational cycles and RAM consumption by using sparse data handling,
(5) using RAM without having to access the disk drive, and (6) transferring data between
processes using an inter-process communication method.
Our method is divided into two parts. In the first part, the subject is given instructions
based on the experimental protocol and several runs are recorded as input to a machine-learning
tool. In the second part, the subjects perform a task, our system classifies their intentions in
real time, and the classification result is transmitted to an external device. In this study the
task was similar to training, in order to evaluate BCI accuracy.
The 3D matrix of the entire brain area is composed of 204,800 voxels, which hold the raw
blood-oxygen-level dependent (BOLD) derived values. Our TBV plugin transmits the raw data
using an inter-process communication method to our application. The 3D matrix is flattened
into a 1-dimensional vector, and the set of voxels is further reduced by setting an activity
threshold, which removes voxels that belong to the empty space around the head that and are
not part of the brain. We also remove any voxels that belong to the subjects eyes, reducing
the number of voxels to approximately 26-30,000 voxels per TR.
For purposes of learning, we select only those voxels with highest information gain (IG) [7].
Labeled training examples, each represented as a vector recording the values of the selected
voxels, are passed on to our learning algorithm: Wekas [4] implementation of multiclass [8]
SVM [9], using default parameters. The result of the training phase is an SVM classifier that
can classify previously unseen vectors.
In the second online stage we classify a vector every TR (2 seconds) and use the same noise
reduction, eye filtering and normalization methods as in the training stage, selecting the same
voxels based on the IG filtering performed at model training. Finally, the data is passed into
the trained SMO model, and the classification result is then transmitted to the external device.
The training process takes several minutes and the classification process uses 1-5% of the CPU
and lasts approximately 50 milliseconds.
Here we compare our results with our previous study, in which we introduced a simple
classification method used to successfully classify intentions of motor movement or imagery [5].
In that method, we manually localized the subjects relevant brain areas corresponding to
activation of left and right hand and legs, and the algorithm selected online the class that
corresponded to the ROI with maximum Z-score normalized averaged value.
1 http://medical.nema.org/
Article ID 034-2
DOI:10.3217/978-3-85125-378-8-34
Figure 1: A comparison of MM and MI classification accuracy across six (MM) and three (MI) subjects, between ML
and ROI. The ML results were obtained by using (a) all voxels with IG above 0 and (b) the smallest number of voxels
that permit perfect classification of all training examples.
We tested our system in three areas: i) motor motion (MM, 5 subjects), ii) motor imagery
(MI, 3 subjects), and (iii) visual categories (VIS, 5 subjects). In MM and MI, we classified left
hand, right hand, and legs movement or imagery; subjects were allowed to move their fingers
and toes in the MM condition. In VIS we had subjects watch four visual categories: faces,
houses, tools, and a fixation screen that corresponds with idle viewing. In both the MM and
MI studies the subjects see an avatar standing in the center of a three-door room. In each run
of the cue-based experiment, the subjects received equal amount of triggers from each class, in
a pseudo-random order. The total number of triggers was determined based on 10-12 minute
run. For MM, MI, and VIS we used 30, 45, and 40 samples per class respectively (in MI one
subject had performed the runs earlier in the research and had only 30 samples per class). We
used only three runs for model training in MM due to a higher signal to noise ratio, while for
MI and VIS we used four runs; the training and testing data were kept in a chronological order.
Written informed consent was obtained from all volunteers, and the studies were approved by
the Ethics Committee of the Weizmann Institute of Science.
Our results were obtained oine from recorded data. However, the calculation was simulated
by our real-time system as if it were an actual real-time experiment; our system received and
classified a new DICOM file every 2 seconds, as would be the case in a real-time study.
3.1
Results
Figure 1 compares average classification accuracy across subjects between ML and ROI. Classification accuracy coincides with the hemodynamic response: in TRs 1 and 2 the accuracy
is around chance level, then it gradually increases with the best accuracy in TR 3 to TR 5-6
(6-12 seconds after the trigger), and gradually drops to chance level. The results indicate that
identifying the most relevant features by using IG and classifying using SMO is superior to the
ROI method. In MM above 90% accuracy can be achieved even at TR 3 (6 seconds after a
cue), which is better than the 10 seconds delay in the ROI method. In MI, accuracy of 90%
in TR3 and 95% in TR4 can be achieved, which is higher than the highest ROI accuracy at
TR 5. Figure 2 provides the results of the VIS study, indicating 78% in TR3. Accuracy up to
87% can be reached with a longer delay. Finally, by reducing the voxel count (i.e., raising the
IG threshold) to the smallest number of voxels that permit perfect classification of all training
examples, we can achieve similar accuracy to that achieved by using all voxels with positive IG,
but with greatly reduced computation time.
Article ID 034-3
DOI:10.3217/978-3-85125-378-8-34
Figure 2: ML results obtained for VIS using (a) all voxels with IG above 0 and (b) the smallest number of voxels that
permit perfect classification of all training examples
Discussion
Our results indicate that the system is robust across subjects, and efficient in classification time
and CPU usage. We have identified multi-voxel brain patterns in the primary motor cortex,
the fusiform face area (FFA), the parahippocampal place area (PPA), and the lateral occipital
object area (LO). Future experiments may lead to identifying more complex brain patterns.
We show successful classification in three dierent tasks in two several brain areas, using the
same method. The fact that this system works online opens the door to new paradigms in BCI,
NF, and brain rapid brain mapping.
Acknowledgements
This research was supported by the EU project VERE (number 257695), www.vereproject.eu.
References
[1] S.M. Coyle, T.E. Ward, and C.M. Markham. Braincomputer interface using a simplified functional
near-infrared spectroscopy system. Journal of neural engineering, 4:219, 2007.
[2] R. Sitaram, A. Caria, and N. Birbaumer. Hemodynamic braincomputer interfaces for communication and rehabilitation. Neural networks, 22(9):13201328, 2009.
[3] S. Lee, S. Halder, A. K
ubler, N. Birbaumer, and R. Sitaram. Eective functional mapping of fmri
data with support-vector machines. Human brain mapping, 31(10):15021511, 2010.
[4] I.H. Witten, E. Frank, and A.M. Hall. Data mining: Practical machine learning tools and techniques, 3rd edition. 2011.
[5] O. Cohen, S. Druon, S. Lengagne, A. Mendelsohn, R. Malach, A. Kheddar, and D. Friedman. fmri
robotic embodiment: A pilot study. In Biomedical Robotics and Biomechatronics (BioRob), 2012
4th IEEE RAS EMBS International Conference on, pages 314319, June 2012.
[6] O. Cohen, M. Koppel, R. Malach, and D. Friedman. Controlling an avatar by thought using
real-time fmri. Journal of neural engineering, 11(3):035006, 2014.
[7] J.R. Quinlan. Induction of decision trees. Machine Learning, 1(1):81106, 1986.
[8] T. Hastie and R. Tibshirani. Classification by pairwise coupling. In I.M. Jordan, J.M. Kearns, and
A.S. Solla, editors, Advances in Neural Information Processing Systems, volume 10. MIT Press,
1998.
[9] J. Platt et al. Sequential minimal optimization: A fast algorithm for training support vector
machines. 1998.
Article ID 034-4
DOI:10.3217/978-3-85125-378-8-35
1 Introduction
Stroke is the main cause of adult disability in high-income countries worldwide; therefore, several
techniques have been proposed to reverse motoric impairments [1]. Over the past years, braincomputer interfaces (BCIs) have been proposed as tools that can be used in neurorehabilitation [2, 3].
Recently, it was shown that plasticity could be induced by using the movement-related cortical
potential (MRCP) as a control signal in a BCI that provides somatosensory feedback from electrical
stimulation [4, 5]. The MRCP is a low-frequency brain potential associated with executed and
imagined movements [6]. The potential can be observed in the electroencephalogram (EEG) up to two
seconds before a voluntary movement; therefore, it has been proposed for BCIs that are used for
induction of Hebbian-associated-like plasticity [4, 5]. The MRCP also contains kinetic information of
the executed or imagined movement such as the level of force and speed [6, 7]. This kinetic
information may be utilized in designing more sophisticated rehabilitation protocols where variety of
tasks is required. It has been shown that variety of tasks in a rehabilitation paradigm is good for
maximizing the retention of relearned motor skills [8]. To implement this in BCI protocols, we need
to detect the movement intention and classify different task parameters (force and speed) to activate
Article ID 035-1
DOI:10.3217/978-3-85125-378-8-35
correlated somatosensory feedback (through functional electrical stimulation). The detection and
decoding of the MRCP have been performed offline in a previous study [7].
The aim of this study was to implement and test the feasibility of detecting the MRCP and
discriminating between fast movements with a high level of force and slow movements with a low
level of force.
2 Methods
2.1 Subjects
Five healthy volunteers (1 female and 4 males: 295 years old) and one stroke patient with lower
limb paresis (77 years old, male, infarction, right side affected, 46 days since event) participated. All
the subjects gave their informed consent before participation, and the procedures were approved by
the local ethical committee (N-20130081).
Article ID 035-2
DOI:10.3217/978-3-85125-378-8-35
initial negative phase of the MRCP was matched to the surrogate channel. The template was extracted
from an average of the 2x50 movements that were performed in the training session. The length of the
template was 2 s, and it was extracted from the peak of maximum negativity and 2 s prior to this
point. The detection threshold was obtained for each subject using a receiver operating characteristic
(ROC) curve. The ROC curve was generated through 3-fold cross-validation of the training data. The
detection threshold was selected to maximize the true positive rate (TPR), but on the expense of more
false positive detections (FPs). Detections occurred when the cross-correlation, computed between the
template and the surrogate channel, exceeded the detection threshold, and the EOG activity was lower
than 125V. Detector decisions were made every 200 ms. To reduce the number of FPs, the detector
was disabled for 3 s after detection. The detection was evaluated through the TPR and number of
FPs/min. Six temporal features were extracted from the initial negative phase of the MRCP from the
detection onset and 2 s prior this point. These features were: i+ii) slope and intersection of a linear
regression fitted to the entire interval (-2 s until the detection onset), iii+iv) slope and intersection of a
linear regression fitted to the last 0.5 s of the interval (-0.5 s until the detection onset), v) maximum
negative amplitude and vi) mean amplitude.
The features were classified using a support vector machine (SVM) with a linear kernel. All trials
from the training session were used to build the classifier that was used in the testing session. The
classification accuracy was obtained for the correctly detected movements.
3 Results
The results are summarized in Table 1. The TPR was 854 % for the healthy subjects and 85 % for
the stroke patient, and less than 1 FP/min was registered. The number of correctly detected and
classified movement was higher for the healthy subjects (653 %) compared to the patient (51 %).
Healthy/stroke subject
Detection
[%]
Detection and
classification
FPs/min
H1
H2
H3
H4
H5
MeanSD
87
90
82
78
89
854
62
70
65
61
67
653
0.6
0.6
0.2
0.1
2.4
0.80.8
S1
85
51
0.9
Table 1: Performance of the system for healthy subjects and the stroke patient. Detection is the
TPR, and the column to the right is the performance when the detected movement is classified.
4 Discussion
In this study, movements were detected and the kinetic information classified for healthy subjects
with a performance of 653 % correctly detected and classified movements. The proposed techniques
were also tested by a stroke patient where 51 % of the movements were correctly detected and
classified.
The detection performance was slightly higher compared to what has been found in previous
studies where an online system was simulated [7, 9]; this may be explained by the selection of the
Article ID 035-3
DOI:10.3217/978-3-85125-378-8-35
detection threshold. In the previous studies the detection threshold was based on the midpoint of the
upward convex part of the ROC curve to obtain a tradeoff between the TPR and number of FPs. In
this study, the detection threshold was selected to increase the TPR, but on the expense of more FPs.
The number of FPs was accounted for by disabling the detector for 3 s after detection.
The performance of the classifier performed slightly worse than in the offline studies which may
be explained by the lower detection threshold leading to an earlier detection of the movements and
therefore less kinetic information can be extracted from the 2 s of data extracted prior the detection
point. Detection latencies were not calculated in the current study, but it is expected to be in the range
of 100 ms [9].
The findings suggest that the BCI system can be used for neuromodulation where task variability
can be introduced (two classes). The system performance is in the range of what has been reported to
induce plastic changes [4], although it remains an open question what the lower limit is for inducing
plasticity [3]. The effect on the induction of plasticity should be investigated to see if current BCI
protocols for this purpose can benefit from the introduction of task variability.
5 Acknowledgement
The authors would like to thank Physiotherapist Christoffer Wang, Trningsenheden Aalborg, for
recruiting the stroke patient.
References
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
Article ID 035-4
DOI:10.3217/978-3-85125-378-8-36
Motor-based brain-computer interface (BCI) use is a learned skill that has been
shown to involve multiple cortical areas, though it only explicitly requires modulation
of a small area of cortical tissue. The roles being played by these other areas have not
yet been determined. In this study, using an electrocorticographic (ECoG) model, we
apply a novel computational approach to quantifying the strength and nature of
interactions between putative primary motor cortex and ventral premotor cortex (PMv)
during BCI use and across BCI skill development. Our findings suggest multiple roles
being carried out in PMv that change in strength and nature during skill acquisition.
1 Introduction
Brain-computer interfaces (BCIs) show great promise both as a medical technology and a platform
from which to investigate the adaptive capacity of the nervous system (Wander & Rao, 2014). Though
it has been shown previously that many cortical (Ganguly et al., 2011; Wander et al., 2013) and
subcortical (Koralek et al., 2012) areas are active during use of a motor-based BCI, and that as
subjects gain experience activity in some of these areas lessens, it is still uncertain what role these
areas play in the initial learning and subsequent execution of the neuroprosthetic skill.
Qualitative review of activity patterns in motor cortical networks during BCI use has revealed that
some areas show increased activity only during active execution of motor imagery or overt movement
whereas others show increased activity during the task, regardless of the imagery state of the user
(Wander et al., 2013). In this study, we employ a newly developed computational method to
investigate the roles being played by ventral premotor cortex (PMv) during BCI use. PMv was
selected as a cortical area of interest because of its previous implication in effective motor skill
execution and adaptation (Xiao et al., 2006). We hypothesize that high-gamma (HG; 70-200 Hz)
activity in PMv and HG interactions with the controlling electrode (CTL) will demonstrate multiple
roles being carried out by PMv during BCI use. Further, we hypothesize that some of these PMv-toCTL interactions will lessen in strength as subjects develop automaticity in BCI task execution.
2 Methods
We have previously published methods describing our subject population, ECoG recording
methods and right justified box (RJB) task structure (Wander et al., 2013), but will briefly summarize
and discuss additional points of note.
Article ID 036-1
DOI:10.3217/978-3-85125-378-8-36
The RJB is a one-dimensional BCI task where HG power in a single ECoG electrode is mapped to
velocity of a cursor in the control dimension. The cursor travels across the workspace at a fixed
horizontal velocity while the user controls its vertical velocity to attempt to move it in to a target area.
This is a retrospective analysis of previously collected BCI data. Subject selection criteria were as
follows: subjects needed to (a) have conducted at least 40 trials of RJB task, (b) performed above
chance levels on the task, and (c) have electrode coverage of PMv. This resulted in inclusion of 10
subjects in the current study. Labels for individual electrodes were estimated using the human motor
area template (HMAT) (Mayka, et al., 2006). From all the electrodes recorded (N=792), we selected
only electrodes that were being used for BCI control (N=10) and non-control electrodes determined to
be over PMv (N=39) for subsequent analyses. The classifications of PMv electrodes were as follows:
electrodes were considered control-like if they exhibited significant HG increases during the feedback
period (relative to rest) for up targets, but not for down targets; they were considered effort-like if they
exhibited significant HG increases during the feedback period for both up and down targets; and they
were considered non-modulated if they did not exhibit task-positive modulation.
We assessed transient temporal correlations in HG activity between the CTL and PMv electrodes
using short-time windowed covariance/correlation (STWC) (Blakely et al., 2014). This method is
specifically suited to teasing out amplitude-amplitude interactions (correlations) in neural signals that
are not only transient (i.e. event-driven), but also potentially occur at slightly different points in time
in each of the two signals. STWC maps were calculated on smoothed (~50 msec FWHM) HG power,
using a window width of 500 msec and a maximum lag of 300 msec. STWC maps were then
generated by realigning individual maps based on HG onsets and averaging across all trials. To
isolate interactions occurring near the time period of HG onsets (+/- 500 msec), we finally extracted
the maximum STWC coefficient and corresponding lag from each average map. Significant
interactions were selected using a phase-randomized, surrogate-signal bootstrapping approach where
STWC coefficients were considered significant if they had less than a 5% chance of occurring in the
distributions of maximal STWC coefficients.
Figure 1 - (a) Example average time series HG activity showing the four electrode classes discussed
in this manuscript and (b) anatomical location of all electrodes considered in these analyses. (c) An
example STWC map, thresholded at +/- 1 standard deviation from the mean. Lags (y-axis) denote the
temporal relationship between the two signals being considered. The white (+) denotes an example peak
lag and STWC coefficient that was extracted from the map.
3 Results
Behavioral performance. By design, all subjects performed the BCI task at above chance level,
as was discussed previously. Mean task performance was 74.2% (=8.57%, N=10). For individual
subjects, task performance ranged from 60% to 86.2%.
HG activity at CTL and PMv. All subjects were able to significantly modulate HG power at
CTL for up targets relative to rest periods (two sample students t-test, p<0.05 in all cases, employing
Article ID 036-2
DOI:10.3217/978-3-85125-378-8-36
within-subject false discovery rate [FDR] correction). Each subject had at least one electrode over
PMv that exhibited significant modulation during the feedback period relative to rest. All PMv
electrodes were classified according to the methodology described above. There were 14 electrodes
classified as non-modulated, 13 electrodes classified as control-like, and 12 electrodes classified as
effort-like.
PMv-to-CTL HG interactions. In looking at the STWC maps generated for the multiple PMv-tocontrol interactions, it was clear that there is no single role carried out by PMv during BCI task
execution. Figure 1 gives a clear example of PMv becoming active before CTL, but other examples
exist showing both contemporaneous and lagging interactions. To quantify these relationships, we
used the STWC peak-finding approach described above to determine single lag values relative to HG
onset for all electrodes meeting the STWC threshold criteria. 23 of the 39 PMv electrodes exhibited
significant interactions with CTL (p<0.05; previously described bootstrap approach).
Figure 2 depicts the lags and interaction strengths for each of these 23 electrodes. When
considered together, peak covariance lags are not statistically different from zero (one-sample twotailed t-test, N = 39, p = 0.3157), however, there is an interesting relationship between PMv electrode
classes and covariance lags. Specifically, electrodes classified as being effort-like significantly lag
CTL by an average 81.1 msec (+/- 29 msec s.e.m.; one-sample two-tailed t-test, N=8, p=0.0267) and
at lags significantly lower than control-like electrodes (two-sample two-tailed t-test, N=[9, 8],
p=0.0398). Neither control-like nor non-modulated electrodes demonstrated lags that were
significantly different from zero, nor were they significantly different from each other. We also
observed that effort-like electrodes showing significant interactions were typically located more
anteriorly and superiorly within PMv, though we recognize that the transformations necessary to map
electrode locations to common brain space are approximations and may not be appropriate to uncover
spatial relationships on such a small scale.
The above results suggest that there are multiple meaningful interactions that take place between
PMv and M1 during the execution of a BCI task, yet it remains an open question as to which of these
interactions are involved in the continued execution of the task, and which are specifically associated
with the original acquisition of the neuroprosthetic skill. With that purpose in mind, we divided all
trials in half chronologically into early and late trials, and re-evaluated STWC maps on these
subgroups. Overall, PMv-to-CTL STWC strengths decreased significantly (p=0.0301) from early to
late trials, and lags changed significantly from lagging CTL by 44.2 +/- 19.7 msec (p=0.0305) to not
Figure 2 - STWC results. (a) Scatter plot showing lags and STWC coefficients for all PMv-to-CTL
interactions, taken across all trials. (b-c) By-class comparison of distributions for STWC coefficients and
lags, respectively. (d) Early-to-late comparison of STWC coefficients overall and by electrode class. (e)
Early-to-late comparison of lags overall and by electrode class. See text for summary and discussion.
Article ID 036-3
DOI:10.3217/978-3-85125-378-8-36
statistically lagging. When breaking down electrodes by class, we found significant decreases in
STWC for the control-like PMv electrodes (p=0.0419), a significant change in lags for control-like
electrodes (from lagging to leading CTL, p=0.0139), and no change in lags for effort-like electrodes.
4 Discussion
In this brief manuscript we have demonstrated that there are multiple statistically significant and
meaningfully different roles being performed by PMv during BCI task execution and that the nature
of at least one of these roles changes as BCI users gain task experience. These findings suggest that
previously demonstrated distributed activity changes seen over the course of BCI skill acquisition
may not be solely attributable to optimization of cortical activity, but that they may also play a
meaningful role in task learning and execution.
It is important to note that interactions evaluated by the STWC method are correlative in nature
and not causal. Interventional studies specifically altering function in one or more of these distributed
cortical areas will be necessary to understand the causal interactions between these regions.
Ventral premotor cortex was an excellent first candidate for this type of analysis, but as we have
demonstrated previously, other distributed cortical areas also appear to be task-modulated during BCI
use. A necessary extension to this work will be to evaluate the activity and interaction patterns taking
place in these areas. Additionally, for the sake of brevity, we have limited our analyses simply to HGHG interactions. There may be an excellent opportunity to uncover some of the neural underpinnings
of so-called BCI illiteracy by expanding the field of view to include cross-frequency interactions
between HG and the sensorimotor rhythms commonly used in non-invasive BCIs.
References
Blakely, T., Ojemann, J. G., & Rao, R. P. N. (2014). Short-time windowed covariance: A metric for
identifying non-stationary , event-related covariant cortical sites. J Neurosci Meth, 222, 2433.
Ganguly, K., et al. (2011). Reversible large-scale modification of cortical networks during
neuroprosthetic control. Nature Neuroscience, 14(5), 6627.
Koralek, A. C., Jin, X., Long II, J. D., Costa, R. M., & Carmena, J. M. (2012). Corticostriatal
plasticity is necessary for learning intentional neuroprosthetic skills. Nature, 483, 331335.
Mayka, M., et al. (2006). Three-dimensional locations and boundaries of motor and premotor cortices
as defined by functional brain imaging: a meta-analysis. Neuroimage, 31(4), 14531474.
Wander, J. D., et al. (2013). Distributed cortical adaptation during learning of a brain-computer
interface task. Proceedings of the National Academy of Sciences, 110(26), 1081810823.
Wander, J. D., & Rao, R. P. (2014). Braincomputer interfaces: a powerful tool for scientific inquiry.
Current Opinion in Neurobiology, 25, 7075.
Xiao, J., Padoa-Schioppa, C., & Bizzi, E. (2006). Neuronal correlates of movement dynamics in the
dorsal and ventral premotor area in the monkey. Exp Brain Research, 168(1-2), 10619.
Article ID 036-4
DOI:10.3217/978-3-85125-378-8-37
Echo state networks are good candidates for use as classifiers in brain computer interfaces. They can also be used to explore the data to some extent by manipulating their
training. In this paper similar results were achieved to those in the Brain Computer Interface Competition IV in classifying four-class movements from magnetoencephalography.
Introduction
Artificial neural networks consist of a series of many neurons which are connected to each other
in a network. Data is fed into the network and propagates from neuron to neuron. Each neuron
generates an output from all the combined inputs according to an activation function built into
it. How much influence each input to the neuron has is determined by a weight placed on each
connection. The internal values of a number of special output neurons are the final result of the
network. Most neural networks are feed-forward, meaning the propagation is in one direction
(no feedback). A recurrent neural network (RNN) has a memory because the inputs propagate
in all directions, blending with earlier inputs and being slowly drowned out by new inputs.
In a RNN the weights of all the connections between the neurons are modified. This creates
a very complex non-linear optimization problem which can be very difficult to solve. RNNs
and echo state networks (ESNs) are very similar architecturally, but dier in training. In an
ESN the weights of all the connections between the neurons are randomly generated at the
start of training and set permanently. This creates a large reservoir (i.e. reservoir computing)
of randomly connected neurons with random weights. Only the weights leading to the output
neurons are modified in training, turning a complex non-linear optimization problem into a
simple one solvable with linear regression [4].
ESNs are good candidates for brain computer interfaces (BCIs). They perform well in tasks
such as speech and handwriting recognition where there is high variability in the input, and a
noisy signal. Their memory allows recognition of events over time (e.g. slow rise and fall in
electrical potential), and events in sequence (e.g. propagation of brain activity). ESNs naturally
produce a time series output, allowing continuous BCI output such as arm velocity over time.
Crucially, they are easy to train; a dataset can be plugged into an ESN classifier with little
pre-processing, and produce a classification level above chance. ESNs have performed well when
compared to more commonly used classification techniques in several BCI studies [2, 1, 5].
ESNs are black box classifiers i.e. it is difficult or impossible to determine how the input
signal is manipulated to produce the output. This is not much of a problem in practical
applications, but can make it difficult to optimise the BCI in research. The classifier can also
cheat. For example, the classifier could exploit irregularities in the data, instead of recorded
signals. Non-brain signals could also be exploited e.g. blinking, or bumping EEG cables.
Article ID 037-1
DOI:10.3217/978-3-85125-378-8-37
We propose that with careful manipulation of the training these problems can mostly be
avoided. For example, if the classification accuracy is low when trained only on non-brain
signals and high when trained on signals from anatomically relevant brain areas, we can infer
that cheating is not occurring. The characteristics of the signal the classifier is using can also be
approximated by removing parts of the dataset and evaluating the classification accuracy. The
purpose of this paper is to determine the eectiveness of ESNs in a 4-class BCI classification
problem, also to investigate this manipulation of the training.
2
2.1
Methods
Training Dataset
Recordings from participants were extracted from one of the datasets distributed as part of the
BCI Competition IV [6]. To generate this dataset two right handed subjects moved a joystick
4.5 cm from the centre position in self-chosen cardinal directions. Trial onset was cued with
a grey circle. After a variable delay the grey circle would disappear signalling the subject to
move. During the trial the subjects were instructed to fixate on a red cross and not to blink.
Magnetoencephalography (MEG) was recorded at 625 Hz, using ten channels (LC21, LC22,
LC23, LC31, LC32, LC41, LC42, RC41, ZC01, ZC02) located above the motor areas. The MEG
signals were band-pass filtered 0.5 Hz to 100 Hz and re-sampled at 400 Hz. The recordings
were then cut into 1 s second trials, with movement onset at 200 ms. The training data set
contained 40 trials for each of the four classes per subject, the testing dataset contained 73
trials (true classification unknown during the BCI Competition). As the BCI Competition is
over, true labels are available for the testing dataset, and so the training and testing datasets
were combined to enlarge the training dataset. The trial order was randomised before training.
2.2
Classifier Training
MEG data was processed with the open-source OrGanic Environment for Reservoir computing
(Oger) toolbox that provides a robust implementation of ESNs. To prevent saturation of the
network the mean was shifted to zero for each channel. It was then normalised to 1, to keep
it in the range of the neuron activation function. Data before movement onset was cut because
Waldert et al. [6] found that classification was at chance level until then.
ESNs cannot be used directly for classification as the output neurons produce fluctuating
values. By representing the movement direction in the training data as four outputs from the
ESN (1.0 for the desired class), the channel with the highest output became the classification.
Standard deviation was reported to reflect the varying performance of the randomly generated
reservoirs. The accuracy of the trained ESN was determined by ten-fold cross validation.
The time taken to train a network increases exponentially with the reservoir size (Figure 1).
To achieve sufficient accuracy with a reasonable training time, a reservoir size of 500 neurons
was chosen. The remaining parameters of the ESN were optimised by sweeping through a range
of values and choosing those with the highest classification accuracy. This gave final values of
0.2 for the leaking rate, 0.1 for the input scaling, and 0.9 for the spectral radius.
Results
Figures 1 to 3 show results generated during the optimisation of the classifier, and the final
results in 4. Figure 2 shows the accuracy when the network is trained on each electrode indi-
Article ID 037-2
DOI:10.3217/978-3-85125-378-8-37
Figure 1: Size of echo state reservoir and its Figure 2: Accuracy for each electrode, all elecinfluence on accuracy and training time.
trodes, and left hemisphere only.
Figure 3: Accuracy of unfiltered data com- Figure 4: Final accuracy for subjects individpared to filtering with a 3rd order Butterworth ually and combined, compared to shued and
filter at dierent thresholds.
randomised classifiers.
vidually, all the electrodes, and only the electrodes on the left hemisphere. The left electrodes
all perform similarly at around 35 % accuracy. The electrodes located along the mid-line, and
the right hemisphere (RC41, ZC01, ZC02) perform at close to chance level, suggesting that the
classification is not due to EMG from the right wrist. This dierence in accuracy also indicates
that the classifier is not using EOG artefacts, as these artefacts would eect both hemispheres
equally. An accuracy of 57 % is achieved when all electrodes are combined, and with only the
electrodes of the left hemisphere accuracy increases to 61 %. A dierent subset of the electrodes
may perform better, but there are too many combinations to test exhaustively.
When the MEG signal is high-pass filtered at 10 Hz the classification accuracy drops o to
chance level (Figure 3). When low-pass filtered the accuracy remains the same until it drops
o at 2 Hz to 3 Hz. From these drops in accuracy we can infer that the signal being used by the
ESN for classification is in the range 1 Hz to 3 Hz. However, this may be because the ESN used
is most sensitive to frequencies in this range, and using dierent parameters when creating the
ESN may yield dierent results.
Using the results described in Figures 1 to 3, only the subjects left hemisphere electrodes
were used, and then bandpass filtered (1 Hz to 3 Hz). The final classification accuracy was
compared to shued and random classifiers (Figure 4). Shuing the inputs so they do not
match the outputs should make it impossible for the ESN to produce the correct output, and
Article ID 037-3
DOI:10.3217/978-3-85125-378-8-37
it should function like a random classifier with a mean accuracy of 25 %. In this case the
average accuracy for shued data is 27 % - indicating that there are some irregularities in the
dataset which can be exploited by the ESN. Therefore this is a better benchmark for chance
level accuracy, which the reported results still exceed.
Discussion
Using the methods described, a classification accuracy of 65 % was achieved for subject 1,
and for subject 2 a lower classification accuracy of 36 % (Figure 4). This is higher than, but
consistent with, the winning BCI Competition IV entrant who achieved 59.5 % accuracy for
subject 1, and 34.3 % for subject 2, with a smaller training dataset and without the benefit of
checking their results [3]. The other entrants did not achieve results above chance level.
Manipulating the training dataset showed that brain activity related to the task was localised
to the left hemisphere. More training rounds, and a higher electrode density could potentially
be used to localise it more precisely. The approximate frequency of the activity was also
determined, which shows the potential for using an ESN as a crude way to investigate brain
activity with unknown characteristics, or find activity in new frequency bands.
When the ESN was trained with both subjects data combined it achieved an accuracy of
47 %. This is unusual as typically BCIs must be trained for each individual due to variation in
brain activity. This may suggest that a single ESN BCI can be trained to work with multiple
subjects, or be generalised to work with any subject. However it is impossible to tell without
data from more subjects. A result of 47 % is comparable to the accuracy from simply combining the results from both subjects. This means the ESN may simply have been trained to
dierentiate between subjects and classify them accordingly (this in itself would be interesting).
References
[1] GP Cedric, M Sebag, A Souloumiac, and A Larue. Ensemble learning for brain computer-interface
using uncooperative democratic echo state communities. In Cinqui`eme conference pleni`ere francaise
de Neurosciences Computationnelles, 2010.
[2] Aysegul Gunduz, Mustafa C. Ozturk, Justin C. Sanchez, and Jose C. Principe. Echo State Networks for Motor Control of Human ECoG Neuroprosthetics. 2007 3rd International IEEE/EMBS
Conference on Neural Engineering, pages 514517, May 2007.
[3] Sepideh Hajipour Sardouie and Mohammad Bagher Shamsollahi. Selection of Efficient Features for
Discrimination of Hand Movements from MEG Using a BCI Competition IV Data Set. Frontiers
in neuroscience, 6(April):42, January 2012.
[4] Herbert Jaeger. The echo state approach to analysing and training recurrent neural networks
- with an erratum note. Technical report, German National Research Center for Information
Technology, St. Augustin, 2001.
[5] Pieter-jan Kindermans, Pieter Buteneers, David Verstraeten, and Benjamin Schrauwen. An Uncued
Brain-Computer Interface Using Reservoir Computing. In Workshop: machine learning for assistive
technologies, Proceedings, page 8. Ghent University, Department of Electronics and information
systems, 2010.
[6] Stephan Waldert, Hubert Preissl, Evariste Demandt, Christoph Braun, Niels Birbaumer, Ad Aertsen, and Carsten Mehring. Hand movement direction decoded from MEG and EEG. The Journal
of neuroscience : the official journal of the Society for Neuroscience, 28(4):10008, January 2008.
Article ID 037-4
DOI:10.3217/978-3-85125-378-8-38
Introduction
Brain computer interface (BCI) is a promising new technology with possible application in
neurorehabilitation after spinal cord injury [4]. Motor imagery (MI) based BCI coupled with
functional electrical stimulation (FES) enables the simultaneous activation of the motor cortices
and the muscles they control [4]. When using an MI-based BCI-FES, the subject activates the
motor cortex using MI of limb movement. For rehabilitation purposes the MI can be replaced
with attempted movement (AM). The BCI system detects the motor cortex activation and
activates the FES attached to the muscles of the limb the subject is imaging to move. In this
way the aerent and the eerent pathways of the nervous system are simultaneously activated.
This simultaneous activation encourages Hebbian type learning [2] which could be beneficial in
functional rehabilitation after spinal cord injury (SCI).
FES is already in use in several SCI rehabilitation units but there is currently not enough
clinical evidence to support the use of BCI-FES for rehabilitation.
The aim of this study is to assess outcomes in sub-acute tetraplegic patients using MI-based
BCI-FES for functional hand rehabilitation.
2
2.1
Methods
Subjects
Four sub-acute tetraplegic patients ps1, ps2, ps3 and ps4 (male, 70, 25, 35, and 20 years
old respectively) participated in this study after giving their informed consent. They all had
incomplete cervical injuries respectively at neurological levels, C6, C4, C6, and C5 with ASIA
classification levels C, B, B and C. As the result of the injury the patients lost most of their
hand functions with minimal elbow and shoulder movement available on both hands. They
were recruited into this after about eight weeks following their injuries.
Article ID 038-1
2.2
DOI:10.3217/978-3-85125-378-8-38
At the start and at the end of the study, an occupational therapist assessed the hand functions
of the patients using the manual muscle testing (MMT) [1] grading system. The grades ranges
from 0 to 5 for no observable or palpable muscle contraction and normal muscle function
respectively. The muscles tested included the extensor digitorum (ED), extensor carpi radialis
(ECR), extensor pollicis longus (EPL), flexor carpi radialis (FCR), flexor digitorum profundus
(FDP), and intrinsic hand muscles (I).
The initial and final assessment also involved measurement of multichannel EEG of the
patients with 48 channels while they were performing AM of the left and right hand. EEG
was recorded using the g.USBamp by (GTEC, Austria) at a sample rate of 256 Hz with filters
between 0.5 and 60 Hz and a notch at 50 Hz. Event related desynchronisation/synchronisation
(ERD/ERS) [3] maps used to quantify movement related cortical activities during MI/AM was
compared between before and after the study.
2.3
The patients were scheduled to attend therapy sessions between Monday and Friday until they
completed twenty sessions. This was in addition to their regular physiotherapy. Each session
lasted for about an hour.
BCI-FES: At the beginning of each therapy session, a BCI classifier was computed to
enable the control of the BCI-FES system. To obtain the classifier, a patient was instructed to
attempt closing and opening of the left and the right hand (20 trials for each hand). During
this task EEG was recorded from three pairs of bipolar electrodes to derive C3 (Fc3-Cp3), Cz
(Fcz-Cpz) and and C4 (Fc4-Cp4). The input signal from the amplifier was bandpass filtered
(5th order Butterworth) online between 0.5 to 30 Hz. The time domain parameter (TDP) [6] of
the filtered data was computed. After extensive test on healthy people, we preferred the TDP
to the bandpower feature because it gave better classification accuracy for small number of
trials and eliminated the need to select a specific frequency band. The TDP was computed in a
similar way as that described by Vidaurre and colleagues [6]. The feature was used to compute
linear discriminant analysis classifiers to discriminate the left and right AM from resting state.
This gave a 2-class system comprising a hands AM against the resting state. The classifiers
were saved for online use. These steps were all integrated into graphical user interface under
MATLAB and Simulink. This process and setting up the EEG on the patient typically took 15
minutes.
In the online classification, TDP was estimated using Simulinks dierence blocks to compute
derivatives. Each sample of the signal in the feature space was binary classified either as an
Active or Relaxed state using the classifier computed oine. The Active state occurred when
the subject attempted opening and closing of a hand while the Relaxed state corresponded to
a resting period. The classifier output (i.e Active or Relaxed) was then buered for a variable
length of time up to a maximum of 3 s or 768 samples. An Active state was detected when
the buer was filled with approximately 95% of Active state. The size of the buer, usually
1-2 s long, was determined for the patient and optimized to significantly reduce false positives
which was reported by the patient. When the patient found it difficult to control the system
or reported false positives, the classifier was updated online. The classifier was updated using
fixed rate supervised mean and covariance adaptation methods described elsewhere [5].
The FES device (Rehastim by Hasomed, Magdeburg) was used to assist the patients to
perform grasp functions. Electrodes were attached to sequentially stimulate the extensor digitorum, extensor pollicis longus, flexor digitorum superficialis and the flexor pollicis brevis
Article ID 038-2
DOI:10.3217/978-3-85125-378-8-38
muscles. The frequency of stimulation was set to 26 Hz and the pulsewidth was set to 200 s.
The FES current typically 15-35 mA was chosen for each subject to extend and flex the hands
or cause visible muscle contraction without discomfort.
The patient was instructed to attempt closing and opening of the hand with the electrodes
to activate the FES. Note that this was done on one hand at a time. The FES was activated
for 10 s to repeatedly assist in the opening and closing of the hand when the active state was
detected. A therapy session lasted for 60 minutes with 30-40 trials on each hand separately.
Patients ps1, ps2, and ps4 were able to control the BCI-FES system from the first session
because they had ERD of their EEG during AM. Patient ps3 had to be trained for four sessions
to produce ERD of his EEG before the patient could have a good control of the system. The
patients, ps1, ps2, ps3 and ps4 had a false positive of 10% ,0%,10%,0% respectively at the initial
sessions but this decreased to zero at the end of the therapy.
The MMT grading for the right hand of the patients before and after the therapy is shown
in Table 1. All the patients had improvement in the MMT with patient ps4 showing the most
improvement. Patient ps4 unlike the other patients completed the study in the shortest time
and therefore may have received a better eect. Patient ps1 showed the most improvements in
the FCR which moved up by one grade. Patients ps2 and ps3 who had more severe injuries
according the ASIA scale had improvements in a smaller number of muscles although patient
ps3 achieved the maximum MMT grading for ED and ECR.
Patients
ps1
ps1
ps2
ps2
ps3
ps3
ps4
ps4
(b)
(a)
(b)
(a)
(b)
(a)
(b)
(a)
ECR
44+
1+
3+
4
5
0
4+
EPL
33
0
0
0
0
0
3+
FCR
3
4
0
0
0
0
0
4
FDP
44
0
0
0
1
0
4
I
3
3
0
0
0
0
0
3+
Table 1: MMT gradings before and after the therapy for the right hand. a, after; b, before;
-/+, -0.5/+0.5.
The ERD/ERS maps computed for before and after the therapy are shown on Figure 1.
The data presented is of the right hand AM. ERD is shown with negative values while ERS is
shown with positive values on the figures. The dashed line on the ERD/ERS map show when
the patients were given a cue to begin AM.
Visual inspection of the ERD/ERS maps show that there are more ERDs after therapy for
all patients except for patient ps2. The improvement in the ERDs are within the characteristic
movement related ERD bands typically within 8-12 Hz and 16-24 Hz. These ERD changes are
in support of the MMT grades.
For patient ps2, the ERD seen before the therapy was not present after the therapy. This
patient had the highest injury level (C4) classified as ASIA B. With such a high level and
Article ID 038-3
Frequency (Hz)
ps1 after
ps1 before
40
DOI:10.3217/978-3-85125-378-8-38
ps2 after
ps2 before
30
20
10
-2
-1
0
1
ps3 before
-2
-1
-2
-1
0
1
Time (s)
-2
-1
40
0
1
ps3 after
-2
-1
0
1
ps4 before
-2
-1
-2
-1
0
1
ERS
-2
-1
0
1
ps4 after
30
20
10
0
ERD
1
1
Figure 1: ERD/ERS maps (over channel C3) of right hand AM for before and after the therapy.
Movement cue was given at t=0 ms. (Generated with EEGLAB, http://sccn.ucsd.edu/eeglab).
severe injury, it might be harder for the patient to recover functions. This is reflected in the
MMT grades before and after the therapy in which this patient was graded the least. The
disappearance of the ERD might reflect cortical reorganisation.
Conclusions
The MMT grading system showed improvement for all our patients using BCI-FES. This improvement and the increase in ERD activities in three patients were promising results which
suggested that BCI-FES might be used for functional rehabilitation in SCI patients. However
the improvements may have been a result of natural recovery, conventional therapy and the
use of FES. We will therefore require control patients who will receive only FES therapy. The
study is ongoing and we are recruiting more patients in order to find the statistical significance
of using BCI-FES compared to therapy using only FES for rehabilitation in SCI patients.
References
[1] S. C. Cuthbert and G. J. Goodheart. On the reliability and validity of manual muscle testing: a
literature review. Chiropractic & Manual Therapies, 15(1):4, 2007.
[2] D.O. Hebb. The Organization of Behavior: A Neuropsychological Theory. New York: John Wiley
& Sons, 1966.
[3] G. Pfurtscheller and A. Aranibar. Event-related cortical desynchronization detected by power
measurements of scalp EEG. Electroencephalogr Clin Neurophysiol, 42(6):817826, 1977.
[4] G. Pfurtscheller, G. R. M
uller, J. Pfurtscheller, H. J. Gerner, and R. Rupp. Thought - control of
functional electrical stimulation to restore hand grasp in a patient with tetraplegia. Neuroscience
Letters, 351(1):33 36, 2003.
[5] A. Schl
ogl, C. Vidaurre, and K. Muller. Brain-Computer Interfaces: Revolutionizing HumanComputer Interaction, chapter Adaptive Methods in BCI Research An Introductory Tutorial, pages
131356. Springer, 2010.
[6] C. Vidaurre, N. Kr
amer, B. Blankertz, and A. Schl
ogl. Time domain parameters as a feature for
EEG-based braincomputer interfaces. Neural Networks, 22(9):13131319, 2009.
Article ID 038-4
DOI:10.3217/978-3-85125-378-8-39
1 Introduction
Understanding human emotional process has been regarded as an important research topic in
neuroscience as emotion plays a key role in communication or interaction between humans. Many
different neuroimaging modalities have been used to study the neural substrates of emotion, such as
electroencephalography (EEG), magnetoencephalogram (MEG), and functional magnetic resonance
imaging (fMRI) (Peyk et al., 2008). Among these, EEG has been considered the most suitable tool to
study the temporal dynamics of emotion thanks to its high temporal resolution and reasonable cost
(Millan et al., 2008). In particular, decoding individual emotional states from EEG is attracting
increased attention because of the recent popularization of low-cost, wearable EEG systems and their
potential applications in affective brain-computer interfaces (BCIs).
Two representative methods have been most widely used to evaluate or recognize individual
emotional states, which are frontal alpha-asymmetry and event-related potentials (ERPs) elicited by
emotional stimuli (Degabriele et al., 2011). Despite a number of studies based on these methods, only
a few studies have attempted to continuously track or monitor the emotional changes using EEG data
recorded while continuous and complex emotional stimuli (e.g., movies and video clips) are presented.
In this study, we tried to search for a new index that can effectively estimate temporal changes in
emotional states of a group of individuals while they were watching video clips designed to evoke
different types of emotions, which can be potentially applied to the evaluation of various cultural
Article ID 039-1
DOI:10.3217/978-3-85125-378-8-39
contents including movies, TV commercials, and music videos. Among various possible indices, we
tested the feasibility of EEG synchronization as an index of emotional states because some previous
studies reported that functional connectivity is generally increased during cognitive processing and
decreased during emotional processing especially in beta & gamma frequency bands (Harding et al.,
2012, Northoff et al., 2004). Beta-band global field synchronization (GFS), a well-known measure
shown strong correlation with human cognitive process (Koenig et al., 2001, Lee et al., 2010), was
evaluated to quantify the global synchronization of multichannel EEG data recorded from a group of
participants while they were watching two video clips. Our study hypothesis was that decreased GFS
values might be observed when stimuli eliciting high emotional arousal are presented because
dominant emotional processing would act as a distractor of cognitive processing in the brain. To
verify our hypothesis, results of simple questionnaires inquiring two scenes that were most impressive
and memorable in each clip and the grand-averaged GFS waveforms were compared.
2 Methods
2.1 Experimental conditions and paradigm
EEG data were recorded using a multichannel EEG system (ActiveTwo, BioSemi) with 22 active
EEG channels (Cz, C3, C4, T7, T8, Pz, P3, P4, P7, P8, Fp1, Fp2, Fz, F3, F4, F7, F8, AFz, AF7, AF8,
O1, and O2) and two EOG channels (VEOG and HEOG). A 17-inch LCD monitor was used for the
presentation of visual stimuli. The subjects were seated in a comfortable armchair placed in front of
the monitor. The distance between the monitor and subjects was set to 70 cm. Two short video clips
were presented to the subjects. One clip used to evoke happy emotion was a short video clip titled
Isaacs Live Lip-Dub Proposal, which was watched by more than 25 million people in YouTube
(http://youtu.be/5_v7QrIW0zY) (referred to as a positive clip). In the video of 5:13-min long, a man
named Isaac surprised his girlfriend with the worlds first live lip-dub marriage proposal. The other
clip used to elicit negative emotion was edited from a famous movie The Grudge . In the video of
4:36-min long, sudden appearances of ghosts made subjects feel fearful (referred to as a negative clip).
Twenty-five healthy subjects initially participated in our experiment. The positive clip and
negative clip were presented to the participants one after another, between which a short break time of
about two minutes was given. To investigate the general feeling about each video, another group of
participants (n=21) was recruited and asked to select two most impressive scenes in each clip. They
watched each video clip twice without recording EEG, and then asked to mark two time points during
the second play of each clip.
Article ID 039-2
DOI:10.3217/978-3-85125-378-8-39
strong correlation with Mini-Mental States Examination (MMSE) scores that reflected cognitive
process in the brain. The GFS time-series of each participant was filtered using a ten-second movingaverage filter, and then grand averaged across all participants.
Figure 2: The questionnaire results on two most impressive scenes in each clip
3 Results
Figure 1 shows the grand-averaged GFS waveforms of each video clip and Figure 2 shows the
results of questionnaire (two most impressive scenes in each clip). Horizontal lines in Figure 1
Article ID 039-3
DOI:10.3217/978-3-85125-378-8-39
represent 1.96 standard deviation values (p < 0.05). Interestingly, two time periods in each video clip
at which the GFS values dropped below the lower horizontal line (denoted by A, B, A, and B)
matched fairly well with the time periods most frequently selected in the questionnaire. In the time
periods A and B, the woman was finally proposed by the man, and in A and B, sudden advent of the
ghost made the subjects feel most fearful. In Figure 1, there are some other time points showing
sudden drops of GFS values to around the lower horizontal line (denoted by 1, 2, 3, 4, 1, 2, and 3,
which were chosen objectively), at which time some reported high emotional arousal, as shown in
Figure 2. The average GFS values for all emotional events marked in Figure 1 were compared with
the GFS values averaged over all time points not marked as emotional events using paired t-test. The
statistical analysis results showed significant differences between two conditions (positive clip: p =
0.0068; negative clip: p = 0.0059).
4 Conclusion
The purpose of the study was to continuously track the changes in emotional arousal using a
global EEG synchronization measure. As expected in our study hypothesis, decreased GFS values
were observed when stimuli eliciting high emotional arousal are presented. It is noteworthy that the
decreased GFS values were commonly observed regardless of the types of stimulus valence, which
supports our main hypothesis that emotional processing might act as a distractor of cognitive
processing and thus the GFS value, an indicator of cognitive processing, would decrease during
emotional processing. The results of questionnaires could be elucidated fairly well with the grandaveraged GFS waveforms, suggesting that our study design would be potentially applied to practical
applications to evaluate various cultural contents or broadcasting products. Further studies need to be
conducted in future studies in order to generalize our hypothesis using more experiments and
investigate individual variability in the GFS waveforms.
References
Peyk P., Schupp H. T., Elbert T., Junghfer M. (2008). Emotion processing in the visual brain: a
MEG analysis. Brain Topogr
Millan J. D., Ferrez P. W., Galan F., Lew E., & Chavarriaga R. (2008). Non-invasive brainmachine interaction. International Journal of Pattern Recognition and Artificial Intelligence,, 22, 959972.
Degabriele R., Lagopoulos J., Malhi G. (2011). Neural correlates of emotional face processing in
bipolar disorder: An event-related potential study. Journal of Affective Disorders.
Harding I.H., Solowij N., Harrison B. J., Takagi M., Lorenzetti V., Lubman D. I., Seal M. L.,
Pantelis C., Ycel M. (2012). Functional connectivity in Brain Networks Underlying Cognitive
Control in Chronic Cannabis Users. Neuropsychoparmacology.
Northoff G., Heinzel A., Bermpohl F., Niese R., Pfennig A., Pascual-Leone A., Schlaug G. (2004).
Reciprocal Modulation and Attenuation in the Prefrontal Cortex: An fMRI Study on EmotionalCognitive Interaction. Human Brain Mapping
Koenig T., Lehmann D., Saito N., Kuginuki T., Kinoshita T., Koukkou M. (2001). Decreased
functional connectivity of EEG theta-frequency activity in first-episode, neuroleptic-nave patients
with schizophrenia: preliminary results. Schizophrenia Research
Lee S-H., Park Y-M., Kim D-W., Im C-H. (2010). Global synchronization index as a biological
correlate of cognitive decline in Alzheimers disease. Neuroscience Research.
Article ID 039-4
DOI:10.3217/978-3-85125-378-8-40
1 Introduction
Motivation was shown to have an influence on Brain-Computer Interface performance (e.g. Kleih
et al., 2010; Hammer et al., 2012). When using the event-related potential P300 as BCI input signal, it
was hypothesized that motivation might increase attentional resource allocation and thereby brain
activation and classification accuracy (Kleih et al., 2010). This hypothesis also suggests motivation to
be more influential when the task gets more demanding. The allocation of additional attentional
resources to the stimulation alone might increase brain activation and thereby balance the effect of less
detectable brain signals due to increased complexity of the task. However, this assumption has never
been systematically investigated. In the here presented study we systematically increased participants
motivation by monetary reward and compared performance in the visual and auditory sensory input
channel with the auditory modality representing the more difficult task. We hypothesized an increase
of motivation by monetary reward (H1), an increase of brain activation (P300 amplitude) as a function
of motivation and this effect to be more distinct in the auditory as compared to the visual modality (H2)
and higher performance when being motivated (H3).
2 Methods
2.1 Participants
We recruited N=14 students with an average age of 27.00 years (SD = 7.30, range 19-38). Three
males participated in the study and none of the participants reported a history of neurological or
psychiatric disorder. Participants were paid 8 per hour and all were nave to BCI training prior to
participation. Participants gave informed consent to the study, which had been reviewed and approved
by the Ethical Review Board of the Medical Faculty, University of Tbingen.
Article ID 040-1
DOI:10.3217/978-3-85125-378-8-40
3 Results
The first hypothesis that monetary reward would increase participants motivation was tested with
repeated measures analysis of variance (ANOVA) with the factors modality (visual and auditory) and
Article ID 040-2
DOI:10.3217/978-3-85125-378-8-40
reward (0ct, 5ct, 100ct) and the dependent variable VAS motivation. We found a main effect for reward
(F(2,28)= 4.94; p<.05). Within-subjects contrasts revealed a significant motivation increase between 5ct
and 100ct (F(1,14) = 5.34, p <.05). Neither a main effect for modality nor an interaction effect was found.
H2 stated that increased motivation would lead to increased brain activation as reflected in the P300
amplitude. When calculating repeated measures ANOVA with the factors modality and reward and the
dependent variable P300 amplitude, we found P300 amplitudes in the visual modality to be significantly
higher than the P300 amplitudes in the auditory modality (F(2,28) = 72.48, p < 0.01). However, there was
no effect of monetary reward on the P300 amplitude (F(2,28) = 0.09; p = 0.92) nor an interaction (F(2,28)
= 1.48; p = 0.25). When comparing brain activation in the auditory modality (see figure 1) between the
5ct and the 100ct conditions, it is apparent that there is higher activation in the 100ct reward condition
even though no clear ERP peak is detectable. Therefore, areas under the curve (AUC) in the different
auditory conditions were compared. The measure of AUC represents the integral under the curve in a
defined time interval. With AUC the distance of each data point from zero is ignored while calculating
the integral with reference to the first value of the interval (Pruessner et al., 2003). We compared AUC
values with repeated measures ANOVA for two target electrodes (Cz, Pz, see figure 1) and the reward
conditions (0, 5, 100 Cent). A trend towards a main effect of reward condition was found (F(2,22) = 3.21,
p = .06). Within-subjects contrasts revealed a significant difference between reward conditions 5ct and
100ct (F(1,11) = 4.99, p < .05) but not between 0ct and 5ct conditions (F(1,11) = .09, p = .77). No main
effect of electrode (F(1,11) = .21, p = .65) was found.
Figure 1: Brain activation 1 s after stimulus onset in the auditory modality at Fz, Cz and Pz for the
motivation conditions 0ct, 5ct and 100 ct reward.
Concerning performance (H3), participants were more successful in the visual (M = 95.12%
accuracy, SD = 9.60) as compared to the auditory task (M = 44.88% accuracy, SD = 30.66, F(1,14) =
65.91, p < .001). Motivation did not increase accuracy (F(2,28) = 2.03, p = .15) and no interaction effect
was found between modality and monetary reward (F(2,28) = .87, p = .43).
Article ID 040-3
DOI:10.3217/978-3-85125-378-8-40
4 Discussion
We showed that motivation to perform a BCI task could be successfully manipulated with monetary
reward. Compared to the study from Kleih and colleagues (2010), we used higher reward for a correct
selection (0.50 in Kleih et al., 2010 versus 1 in this study) and the intervals between the monetary
rewards were unequal (0, 5, 100 Cent). We assume this caused a more salient perception of the reward
value as compared to the study by Kleih and colleagues (2010). Our hypothesis of motivation to increase
and thereby balance potentially reduced brain activation in more difficult tasks was supported by the
here presented data. We found an increase of brain activation as measured with the area under the curve
in the auditory task in the 5ct and the 100ct motivation condition as compared to the 0ct motivation
condition. This result suggests the influence of psychological variables, such as motivation, to be higher
when tasks become more challenging while at the same time motivation seems to increase only with
the highest possible reward condition. However, the results should be interpreted with caution as
spelling accuracy was not influenced by motivation and in the visual version of the P300-speller it was
almost twice as high as compared to the auditory. Furdea and colleagues (2009) also found a clear
advantage of the visual over the auditory modality. However, in their study no differences in P300
amplitudes were found while in our study we found significantly lower P300 amplitudes in the auditory
condition compared to the visual one. This was probably caused by our decision to use less sequences
than Furdea and colleagues (2009) as we wanted to avoid a ceiling effect in performances especially in
the visual P300 speller (Furdea et al., 2009; Kleih et al., 2010). In conclusion, we found that motivation
can be manipulated by monetary reward in a BCI task, that motivation does influence brain activation
when the task is demanding and that this brain activation increase does not affect BCI performance in
terms of accuracy, i.e. correctly selected letters.
Acknowledgements
This work is supported by the European ICT Program Project FP7- 224631 (TOBI: Tools for BrainComputer Interaction). This manuscript only reflects the authors' views and funding agencies are not
liable for any use that may be made of the information contained herein.
References
Furdea, A., Halder, S., Krusienski, D. J., Bross, D., Nijboer, F., Birbaumer, N., & Kbler, A. (2009).
An auditory oddball (P300) spelling system for brain-computer interfaces. Psychophysiology,
46(3), 617-625. doi: PSYP783 [pii] 10.1111/j.1469-8986.2008.00783.x
Hammer, E. M., Halder, S., Blankertz, B., Sannelli, C., Dickhaus, T., Kleih, S., . . . Kbler, A. (2012).
Psychological predictors of SMR-BCI performance. Biol Psychol, 89(1), 80-86. doi:
10.1016/j.biopsycho.2011.09.006
Kleih, S. C., Nijboer, F., Halder, S., & Kbler, A. (2010). Motivation modulates the P300 amplitude
during brain-computer interface use. Clin Neurophysiol, 121(7), 1023-1031. doi:
10.1016/j.clinph.2010.01.034
Nijboer, F., Furdea, A., Gunst, I., Mellinger, J., McFarland, D. J., Birbaumer, N., & Kbler, A. (2008).
An auditory brain-computer interface (BCI). J Neurosci Methods, 167(1), 43-50. doi:
10.1016/j.jneumeth.2007.02.009.
Schalk, G., McFarland, D. J., Hinterberger, T., Birbaumer, N., & Wolpaw, J. R. (2004). BCI2000: a
general-purpose brain-computer interface (BCI) system. IEEE Trans Biomed Eng, 51(6),
1034-1043. doi: 10.1109/TBME.2004.827072.
Article ID 040-4
DOI:10.3217/978-3-85125-378-8-41
In this study, we investigated the spatiotemporal dynamics during spatial and nonspatial attention tasks employing human electrocorticographic (ECoG) signals. Ten
epileptic patients who underwent subdural electrodes insertion for epileptic surgery
performed spatial and non-spatial attention tasks. Time-frequency and statistical analysis
for ECoG signals resulted that both spatial and non-spatial attention commonly had event
related desynchronization (ERD) over the low frequency (theta, alpha, beta) and event
related synchronization (ERS) over the high frequency (gamma band). The difference
between two paradigms have been found in the right parietal area showing ERD in
superior parietal area during spatial task and ERD in inferior parietal area during nonspatial task.
1 Introduction
The posterior parietal cortex (PPC) has known to be a main part for cognitive processes including
attention, spatial representation, working memory, and visuomotor control [1]. However, details about
the neural mechanism underlying spatial and non-spatial attention are unclear because of the limitations
of poor spatial and temporal resolution of the method used in previous studies [2]. The
electrocorticographic (ECoG) recording may be the one of solutions because of its high temporal and
spatial resolution and high signal fidelity.
The aim of this study was to investigate the neural mechanisms associated with two types of
attention processing using ECoG signals from epileptic patients.
Article ID 041-1
P1
P2
P3
P4
DOI:10.3217/978-3-85125-378-8-41
P6
P10
P7
P11
P8
P12
P9
P13
P10
P13
P5
P12
Left
Right
P12
P12
Figure 1: Location of electrodes. Left two columns and right two columns illustrate the location of
electrodes for each participants. Four figures in the center show the electrode maps of whole participants
overlaid on the standard brain surface.
experiment. This study was approved by Asan Medical Center, Seoul, Korea. Informed consent was
obtained in accordance with the regulations of the Research Ethics Board of our institution.
Article ID 041-2
Target
Non-target
DOI:10.3217/978-3-85125-378-8-41
Target
Non-target
Spatial
-15
-5
0 5
t-value
15
Non-spatial
Figure 2: T-values calculated from the paired t-test during 400-600ms period after cue onset are
mapped on the MNI brain surface. If the power of an electrode is significantly decreased, the t-value
becomes negative and the region is colored blue. If the power of an electrode is significantly increased,
the t-value becomes positive and the region is colored red.
Spectral analysis was performed using the short-time Fourier transform (STFT), which is one of the
most widely used signal analyzing methods. Briefly, STFT divides the ECoG signal into small
overlapping time segments, analyzes each time segment and provides a time-frequency distribution. In
each 2000msec epoch, we used a 500msec sliding Hanning window with overlapping 7.5msec and
obtained 200 time bins. After that the time-frequency features are reduced into 10 time bins (200ms
window with 100ms overlap) and 5 frequency bands (theta: 4-7 Hz, alpha: 8-13 Hz, beta: 13-30 Hz,
low-gamma: 30-50 Hz, high-gamma: 50-150Hz).
To evaluate the spatiotemporal change during spatial and non-spatial attention tasks, we computed
paired t-test using the features of trials for each channel. P-values of less than 0.01% were considered
statistically significant and the color plot (Figure 2) shows that the p-values following the t-values are
much smaller than this criterion. We compared the spatiotemporal course during each spatial and nonspatial attention task with respect to the baseline interval (from 500ms before the cue to cue onset). In
this analysis, the p-values were used for defining the significance of the feature and t-values were for
identifying whether the power of the feature was increasing or decreasing in comparison with the
baseline.
Electrodes of all participants were projected on the template brain provided by the Montreal
Neurological Institute (MNI). The fractional changes in 5 frequency oscillatory powers were assigned
to each electrode and the signal power was displayed with color. A nearest-neighbor method was used
where a cortical triangular mesh was colored strongest when the closest electrode and a linear faded to
zero as the distance increased [5].
3 Results
Significant activations are displayed over cortices following the presentation of the cue onset during
both spatial and non-spatial attention tasks (Figure 2). Event related desynchronization (ERD) is
common across low frequency bands (theta, alpha, beta), and in gamma bands show event related
synchronization (ERS). Oscillatory power changes were salient at 400 ~ 600msec post stimulus onset
and maintained to 600 ~ 800msec. Especially spatial attention shows focused ERD in right superior
Article ID 041-3
DOI:10.3217/978-3-85125-378-8-41
parietal lobe, whereas non-spatial attention shows stronger ERD in inferior parietal lobe in low
frequency bands. Also, in high gamma band, the stronger ERS pattern is observed at the parietal area
from the spatial attention and at the anterior frontal lobe from the non-spatial attention. According to
the result from the non-target stimulus which the participants dont respond, these activation after 400
~ 600msec post stimulus onset may not be affected by the button press or motor preparation.
4 Discussion
In this study, we analyzed ECoG signals to investigate oscillatory power changes in the processing
of spatial and non-spatial attention in the bilateral cortex. The results demonstrated that both spatial and
non-spatial attention seem like sharing common areas over the cortices for all frequency bands. In the
right parietal area, we found some difference including spatial attention shows decreasing low frequency
power in the superior parietal lobe while non-spatial attention appears to be desynchronized in the
inferior parietal lobe. Although many studies about attention paradigm shows BOLD change using
fMRI and most of them demonstrated that the BOLD signal has increased in the parietal region during
the attention period [1], they were not able to show how the neural activity had been changed. To the
best of our knowledge, this is the first study demonstrating the neural variation with spatiotemporal
oscillatory power changes during spatial and non-spatial attention.
The current study drives a novel result that how the neural activity changes and which areas within
cortices are the main role in both spatial and non-spatial attention tasks. A further study will let us know
much more about the neural mechanisms of attention and it would be applicable to the passive mapping
technique.
Acknowledgement
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the
Korea government (MISP) (No. 2013R1A2A2A04015925)
References
[1] J. Culham and N. Kanwisher, "Neuroimaging of cognitive functions in human parietal cortex,"
Curr Opin Neurobiol, vol. 11, pp. 157-163, 2001.
[2] K. Jerbi, "Task-related gamma-band dynamics from an intracerebral perspective: review and
implications for surface EEG and MEG," Hum Brain Mapp, vol. 30, pp. 1758-1771, 2009.
[3] P. Malhotra, "Role of right posterior parietal cortex in maintaining attention to spatial locations
over time," Brain, vol. 132, pp. 645-660, 2009.
[4] A. Gunduz, "Neural Correlates of VisualSpatial Attention in Electrocorticographic Signals in
Humans," Front Hum Neurosci, vol. 5, no. 89, pp. 1-11, 2011.
[5] D. Englot, "Impaired consciousness in temporal lobe seizures: role of cortical slow activity,"
Brain, vol. 133, pp. 3764-3777, 2010.
Article ID 041-4
DOI:10.3217/978-3-85125-378-8-42
Introduction
Over the last decade, studies more and more suggest ECoG to be the preferred method for
acquiring high spatial and temporal resolution data to be used in brain-computer-interfaces
(BCI) [1]. Besides its many advantages, acquisition of ECoG data turns out to be a difficult
and demanding procedure. It is invasive and provides only limited recording times. Also,
because purely based on medical necessity, the placement of the implanted electrode grids
rarely covers all brain areas essential for BCI studies. Making best use of the limited patient
collective and recording time should therefore be of strong priority. Former studies combined
the high spatial resolution of magnetic resonance imaging (MRI) and high temporal resolution
of electroencephalography (EEG) [2] to achieve similar signal characteristics to ECoG while
being non-invasive. Here we investigate MEG data, which to a certain degree combine the
advantages of both imaging modalities [3], and compare it to ECoG. The comparison is based
on results of a picture category/duration decoding paradigm using state-of-the-art features
and classification routines. Because of its immobility, MEG is not a solution for acquiring
data in real-life BCIs. However, with the limited access to ECoG data it is helpful to test
paradigms with other modalities and make assumptions about the possible outcomes in ECoG.
By showing similarities and differences between MEG and ECoG findings, we conclude that
MEG can provide just that.
Article ID 042-1
DOI:10.3217/978-3-85125-378-8-42
Data ECoG data were recorded from two volunteering patients (right handed males). Both
received subdural electrode implants for pre-surgical planning of epilepsy treatment at Stanford,
CA, USA. Electrode grids were solely placed based on clinical criteria and covered a variety
of cortical areas including lateral occipital and medial temporal areas. All patients gave their
informed consent before recordings started. The ECoG was recorded with a sampling frequency
of 3051.7 Hz. Pre-processing steps included high pass filtering (cut-off: 0.5 Hz) as well as notch
filtering around the power line frequency (60 Hz). The electric potentials of all electrodes
were re-referenced to Common-Average-Reference. Afterwards, the time series were visually
inspected for artifacts (e.g. epileptic activity) and epoched into trials representing the interval
between -100 ms and 2000 ms with respect to picture onset times. MEG data were recorded
with a bandwidth of 100 Hz (sampling frequency: 1017.25 Hz), using a whole-head 248-sensor
BTi Magnes system (4D-Neuroimaging, San Diego, CA, USA). Data have been acquired from
four healthy volunteers (age 25-28, one female, one left-handed). Like the ECoG, MEG data
were epoched into trials containing the interval [-100 ms, 2000 ms].
Task Pictures from four different categories, namely objects, faces, watches and clothing, were
presented to the subjects. Presentation durations were randomly chosen from five different time
spans (300, 600, ..., 1500 ms). Intervals of varying duration (600, 750,..., 1200 ms), in which
only a focus is visible, intersect the pictures. Stimuli were shown either on a projection screen
(MEG data) positioned 1 m away from the subject or on a notebook screen (ECoG data) within
the patients reach. Subjects were requested to respond to the presentation of a piece of clothing
with a button press. Target stimuli (i.e. clothing) accounted for approximately 10 % of the total
trial count.
Feature Extraction and Selection Two different feature types have been extracted for
this study, namely low-frequency time domain (LFTD) and high gamma (HG) features. LFTD
features were obtained by low-pass filtering in Fourier domain with a cutoff frequency of 30 Hz
(MEG: 10 Hz) and subsequent down sampling of the time series. For the computation of HG
features, spectral power was measured using a sliding Hann-window approach (window length
= 250 ms). For each window the square root of the power spectrum was computed by Fast
Fourier Transform (FFT). The resulting coefficients were then averaged in the frequency band
of 70-200 Hz. Feature selection is performed on training data only using an algorithm based on
the Davies-Bouldin index. Full details on feature selection (as well as extraction) routines can
be found in our previous study [4]. The algorithm was employed to select a set of the twenty
(MEG: ten) most informative channels corresponding to the actual class separation problem
(i.e. either discrimination of picture categories or presentation durations).
Article ID 042-2
DOI:10.3217/978-3-85125-378-8-42
Results
In order to restrict information to any kind of visual processing within the brain, target stimuli
(i.e. clothing) have been omitted for the analysis of decoding accuracies. Otherwise, activation
of motor areas during subjects button presses might influence the decoding.
Signal-to-noise ratio in the high gamma band of the MEG data was too low to be used for
single trial analysis. Therefore, only LFTD feature accuracy is plotted for the MEG datasets.
Figure 1: a), b) LFTD features for a dominantly selected channel (trials sorted by duration)
c), d) Decoding accuracies (in %) for both classification tasks (i.e. category and duration) and
both feature types (i.e. LFTD and HG features; ECoG only).
Decoding accuracies have been computed for all datasets by means of a 50-times-5-fold crossvalidation procedure. The results for all ECoG and MEG datasets can be found in figure 1 c)
and d). Figure 2 shows a comparison of the averaged decoding accuracies of both acquisition
modalities. Chance levels for the two classification tasks are 20 % (duration, five classes) and
33 % (category, three classes) respectively.
We also found that the dominantly selected MEG channels were located in the same areas
as the dominantly selected ECoG channels. This holds true for duration detection as well as
picture category decoding.
The results show that MEG can classify picture category and presentation duration with high
accuracy on a single trial basis, though not as stable as ECoG. More importantly, spatial
resolution of the MEG seems to be high enough that only channels from the area expected to
have highest activity for the given task are selected by our routines. By manually restricting
MEG data to certain channels, decoding accuracies of different ECoG grid placements might
Article ID 042-3
DOI:10.3217/978-3-85125-378-8-42
Acknowledgements
The work of this paper is partly funded by the German Ministry of Education and Research
(BMBF) within the Forschungscampus STIMULATE under grant number 03FO16102A.
References
[1] J. A. Wilson, E. A. Felton, P. C. Garell, G. Schalk, and J.C. Williams. Ecog factors underlying multimodal control of a brain-computer interface. IEEE Trans. Neural Syst. Rehabil. Eng.,
14(2):246250, Jun 2006.
[2] L. Ding, C. Wilke, B. Xu, X. Xu, W. van Drongelen, M. Kohrman, and B. He. EEG source
imaging: Correlating source locations and extents with electrocorticography and surgical resections
in epilepsy patients. Journal of Clinical Neurophysiology, 24(2):130136, Apr 2007.
[3] J. Mellinger, G. Schalk, C. Braun, H. Preissl, W. Rosenstiel, N. Birbaumer, and A. Kbler. An
meg-based brain-computer interface (bci). NeuroImage, 36(3):581 593, 2007.
[4] T. Wissel, T. Pfeiffer, R. Frysch, R. T. Knight, E. F. Chang, H. Hinrichs, J. W. Rieger, and
G. Rose. Hidden markov model and support vector machine based decoding of finger movements
using electrocorticography. Journal of Neural Engineering, 10(5):056020, 2013.
Article ID 042-4
DOI:10.3217/978-3-85125-378-8-43
Introduction
Many types of brain computer interfaces (BCIs) have been developed in the last decade, employing various measuring devices and operation principles [6]. P300-speller which is one of
the non-invasive BCIs, is used widely because of its simpleness, high information transfer rate
(ITR), and reliability [1]. Several methods to improve the classification accuracy and ITR of
P300-speller have been investigated [4, 3, 2].
In practical BCI, detection of users input intention is a significant function. When we use
some input interface, we do not always input information. We sometimes think, wait for a
response, or do something else. Although P300-speller can detect users input intention using
P300 response, its detection is not accurate.
In this paper, we investigate new BCI spelling system using N100 as well as P300 in order to
improve ITR and the detection accuracy of the users input intention compared to P300-speller.
In the proposed method, nine kinds of stimulus images including a 23 matrix containing four
commands are used (Fig. 1 (a)). P300 is used to detect the target stimulus image containing
the desired character. N100 is used to detect the target position that the user gazes on. By
using both N100 and P300 features, ITR of the proposed method was 0.25bit/sec higher than
that of P300-speller in our experiment by ten subjects. In addition, the detection accuracy of
users input intention was an averaged of 32.7% higher than that of P300-speller.
2
2.1
Proposed method
Detection of target character
26 alphabets (A, B, , Z) and 10 numerals (0,1, , 9) are used as spelling commands in the
proposed system as well as P300-speller. The characters are arranged in nine stimulus images
having four characters and two blanks in the 23 matrix (Fig. 1 (a)). The desired character
is detected by the following three steps: (i) the target image containing the desired character
is detected by using P300; (ii) the target position that the user gazes on is detected by using
Article ID 043-1
DOI:10.3217/978-3-85125-378-8-43
Figure 1: Stimulus images and how to type. (a) Stimulus images. (b) Stimulations and ERP
responses for the target character K. Circles means the users gazing position. (c) How to
detect the desired character.
N100; (iii) the desired character is detected by the combination of the target image and the
target position.
P300 is elicited by mental tasks such as counting how many times the desired character is
flashed as well as P300-speller. N100 is elicited by any visual stimuli. In order to detect the
users gazing position by using N100, two blank positions are arranged for each stimulus image.
Since N100 is elicited by any visual stimulus without a mental task [5], we can detect the users
gazing position by comparing N100 responses for the blank and non-blank positions. Each
position has six characters and three blanks for the nine images. The position of characters is
informed to the users beforehand by displaying the positioning table to them near the screen.
An example of the detection scheme of the desired character K is shown in Fig. 1-(b),(c).
Two linear support vector machines (SVMs) are independently used to detect N100 and
P300. We respectively denote them by SVM1n and SVM1p.
2.2
We categorize the users condition into two classes, intended and non-intended. N100 and P300
responses are obtained during the intended condition, whereas both of them are not elicited
during the non-intended condition. This dierence is adopted to detect the users intention.
Feature vectors to detect the intention are made by concatenating the output values of
SVM1n and SVM1p. Output values of SVM1n corresponds to the six positions in the stimulus
images. Output values of SVM1p corresponds to the nine stimulus images. These output values
are sorted in ascending order before the concatenation. A linear SVM (denoted by SVM2) is
trained by the training data of SVM1n and SVM1p, and pre-recorded training data that the
user does not intend to input. The procedure is depicted in Fig. 2.
Article ID 043-2
DOI:10.3217/978-3-85125-378-8-43
Ten healthy subjects (22-24 years old males) conducted experiments of P300-speller and the
proposed system 60 times alternately. These experiments include 20 trials that the user does
not input the command without gazing on the display every three times. 40 samples of intended
condition and 20 samples of non-intended condition were recorded for each subject. P300-speller
flashes 24 times (= 12 flashes two loops), and the proposed system flashes 18 times (= nine
flashes two loops) in a trial. Each row and column is presented two times in P300-speller,
whereas nine kinds of stimulus images are presented two times in the proposed method per one
trial. The flash has 125 ms duration with 62.5 ms inter stimulus interval (ISI). The proposed
system takes 3.375 s in a trial, whereas P300-speller takes 4.5 s. The subjects were asked to
gaze on only one position that the target character is presented and silently count the number of
times the target character flashes. If the maximum instantaneous amplitude of EEG is greater
than 100 V, we discarded the trial, and repeated the trial again.
EEG was recorded by using an active EEG (Guger technologies) with a sampling frequency
of 512 Hz and a bio-signal amplifier (Digitec) with 0.5Hz analogue high-pass filter and 100 Hz
analogue low-pass filter. 16 electrodes (FCz, FC2, FC1, Cz, CP1, CP2, Pz, POz, P3, P4, TP8,
TP7, C3, C4, C5, and C6 of the extended international 10-20 system) were used [5]. AFz was
used as the ground and A2 was used as the reference. The second-order Butterworth band
pass filter (1-13 Hz) and the third-order Butterworth band stop filter (49-51 Hz) were used to
extract feature and remove the hum noise, respectively. Signal was downsampled from 512 Hz
to 64 Hz.
P300 and N100 components were extracted from 125 ms to 625 ms and from 100 ms to 250
ms after the stimulus, respectively. P300 signals were averaged for each stimulus image in both
methods. N100 signals were averaged for each position. The soft margin parameter C for SVM,
such that shows highest averaged accuracy over the five validations for each subject, was selected
from {0.1, 1, 10, 100, 1000}. ITR [bit/s] is used as the performance index since the length of one
1P
trial is dierent [6]. ITR is calculated by B = T1 {log2 N +P log2 P +(1P ) log2 ( N
1 )} [bit/s],
where T [s] is the time of one trial, P is the classification accuracy, and N is the number of
commands. The classification accuracy and ITR were evaluated by 5-fold cross-validation.
Hence, the number of training data was 32, and that of test data was eight for SVM1n and
SVM1p. For SVM2, the number of training data was 48, (32 samples was from intended trial
and 16 samples was from non-intended trial.) The number of test data was 12, (eight samples
was from intended trial and four samples was from non-intended trial.)
For P300-speller, since only P300 is used as feature, we obtained the classifier SVM1p
and SVM2 to detect the target character and the users input intention, respectively. SVM1p
outputs 12 values corresponding to flashes of six rows and six columns. We sorted these values
and obtained a 12-dimensional feature vector to detect the users input intention.
Table 1 shows the averaged classification accuracies, standard deviations, and ITR. ITR of
the proposed method is an average of 0.25bit/sec higher than that of P300-speller because i) the
trial length of the proposed system is shorter than that of P300-speller; and ii) the classification
accuracy of N100 is higher than that of P300 because N100 signal is averaged over six times,
on the other hand, P300 signal is averaged over two times. Moreover, the detection accuracy
of the users input intention of the proposed method is an average of 32.7% higher than that
of P300-speller because both N100 and P300 components are used to discriminate the users
intention. The improvement is statistically significant (t-test, p<0.05).
Article ID 043-3
DOI:10.3217/978-3-85125-378-8-43
Table 1: Averaged classification accuracy and ITR; P300 is accuracy of detecting P300 response, and N100 is accuracy of detecting N100 response. Intention is accuracy of detecting
the users input intention. Character is accuracy of detecting the desired character.
Method
P300-speller
Proposed
Accuracy of feature
P300 [%]
N100 [%]
80.9 7.4
Without detection of
the users input intention
Character [%] ITR [bit/sec]
67.8 15.6
0.60 0.22
70.3 17.1
0.85 0.34
With detection of
the users input intention
Intention [%] Character [%] ITR [bit/sec]
57.8 8.5
56.0 21.9
0.46 0.27
90.5 10.6
66.0 18.6
0.78 0.35
Conclusion
We have proposed a novel design for a spelling system using both N100 and P300 to reduce the
number of flashes per trial, increase ITR, and detect the users input intention. The advantages
of proposed system are i) it takes shorter time to input one command since the number of
flashes can be reduced to nine flashes; ii) the performance of detecting the input intention by
using N100 and P300 is 32.7% higher compared to P300-speller in our experiments; iii) each
character never flashes in a row whereas at least one character flashes twice in a row in P300speller; iv) the number of response times can be reduced. That is because the user only has
to respond the desired character only two times per trial in the proposed system, whereas in
P300-speller the user has to respond it four times, that are the row and column corresponding
to the desired character. Hence, the users fatigue caused by mental task can be reduced in the
proposed system.
In future work, we extend the proposed method to improve ITR by extending the size
of matrix. The selection of electrode positions should also be investigated. The comparison
between the proposed BCI and other P300-BCIs needs to be done.
References
[1] L. A. Farwell and E. Donchin. Talking o the top of your head: Toward a mental prosthesis utilizing
event-related brain potentials. Electroenceph. Clin. Neurophysiol., 70:510523, 1988.
[2] J. Hill, J. Farquhar, S. Martens, F. Biemann, and B. Sch
olkopf. Eects of stimulus type and of
error-correcting code design on BCI speller performance. NIPS 2008 online, 2008.
[3] D. J. Kr
usienski, E. W. Sellers, D. J. McFarland, T. M. Vaughan, and J. R. Wolpaw. Toward
enhanced P300 speller performance. Journal of Neuroscience Methods, 167:1521, 2008.
[4] W. L. Lee, T. Tan, and Y. H. Leung. An improved P300 extraction using ICA-R for P300-BCI
speller. 35th Annual International Conference of the IEEE EMBS, pages 70647067, 2013.
[5] E. K. Vogel and S. J. Luck. The visual N1 component as an index of a discrimination process.
Psychophysiology, 37:190203, 2000.
[6] J. R. Wolpaw, N. Birbaumer, W. J. Heetderks, D. J. McFarland, P. H. Peckham, G. Schalk,
E. Donchinand L. A. Quatrano, C. J. Robinson, and T. M.Vaughan. Brain-computer interface
technology: a review of the first international meeting. IEEE Trans. Rehabil., 8:162173, 2000.
Article ID 043-4
DOI:10.3217/978-3-85125-378-8-44
Introduction
Passive error potential (ErrP) detectors may be utilised to correct or negate the output of
active BCI systems, providing output verification. Coupling active and passive BCI systems can
increase training requirements. In this analysis we demonstrate an approach which generalises
information across subjects, allowing poor BCI performance by some subjects to aid other users.
The underlying EEG component of ErrP detection systems is feedback related negativity
(FRN). Key features of FRN make it a primary candidate for the reinforcement learning (RL)
theory of error feedback, since it is thought to originate from the anterior cingulate cortex (ACC)
[6], and to be triggered by phasic changes in dopaminergic signals from the basal ganglia [3].
In our method neurophysiological knowledge is used to inform projection filters, which perform
joint dimensionality and noise reduction [1]. In RL theory, error trials will not necessarily
exhibit FRN, therefore we only target a subset of trials using precision based loss functions [4].
2
2.1
Method
Data
Results from two BCI datasets are presented - detailed descriptions are available for both [2, 5].
The development data set, acquired in Marseille, consisted of 64 channel EEG recorded
using a Biosemi Active2. Eleven subjects performed online motor imagery sessions using a Graz
protocol modified to display discrete feedback. Subjects performed four runs of 40 trials. Three
feedback periods were embedded in each trial. Feedback was manipulated to ensure all subjects
experienced minimum error rates of 20%. Data from two subjects was rejected. After artefact
removal the average error rate was 232%. We refer to this data as the MI set. A second data
set, acquired in Lyon was recorded with 56 EEG sensors using a VSM-CTF compatible system.
This analysis is based on 32 electrodes. Sixteen subjects used a pseudorandom stimulation
procedure P300-speller. Flash duration and stimulus onset asynchrony parameters were selected
to increase the difficulty of spelling. Each subject performed four 12 minute spelling sessions,
a session consisted of 12 five letter words with feedback for each letter. After artefact rejection
the average error rate was 3019%. Four subjects with a error rate below 10% after artefact
rejection were not included in this analysis. This data will be referred to as the P300 set.
Article ID 044-1
2.2
DOI:10.3217/978-3-85125-378-8-44
EEG Preprocessing
EEG data was bandpass filtered between 3 and 12 Hz using 4th order Chebyshev filters. Source
time series were obtained using transformation matrices designed for real time use [1]. Applied
to EEG, these matrices output x, y and z current triplet for each of the voxels in the LORETA
head model. Voxels belonging to Brodmann areas 24, 32, 33, 8 or 6 were included in a fixed
region of interest (ROI), encompassing all FRN activation foci described in meta analysis [6].
The radial, z, value of all voxels in the ROI were averaged to create a single virtual channel, of
sample rate equal to surface channels, on which all further analysis / processing was performed.
Using this approach, the virtual channel retains a proxy to polarity observed in central channels.
2.3
Feature Space
Training data was generated for each subject using data from others in the same set. Each
subjects virtual channel amplitude was normalised by the standard deviation. Subjects virtual
channel ERPs were aligned to the median FRN latency of the group, the normalised latency,
by circular shifting of trial values and peak alignment. A template error FRN was derived from
the average of the normalised error trials. Three features were then extracted from a window
around the normalised latency. Amplitude measured peak negativity within the window. Jitter
measured the distance between peak negativity and the normalised latency. Similarity measured
correlation between single trial activity and the template error FRN when accounting for Jitter.
2.4
Classifier Calibration
perf
SVM , a classification method for multivariate performance measures [4], was trained on the
feature space. A radial basis function kernel and a precision/recall breakeven point loss function
were used. SVM parameters cost, C, and gamma, g, were obtained through grid search. At
each search step nested 210 cross validation was performed and an F-score obtained from
rates of precision and sensitivity. The C and g pair which maximised F was selected. For
P300 calibration five of 12 subjects were excluded from training data (the hold-back group)
after visual inspection of virtual channel data suggested they had few discriminable error trials.
2.5
Online Steps
As subject independent classifiers were calibrated on amplitude and latency normalised feature
spaces, online calculations required equivalent distributions. Amplitude normalisation in online
steps was based on estimates of statistical moments, updated on a trial by trial basis, using
parameterless one pass methods described by Terriberry [7]. Temporal normalisation of the
feature space required an estimate be made of each new subjects FRN latency without use of
class labels. Latency was tracked via estimation of how a temporal window of activity influenced
d
the Similarity feature. A moving average, Similarity,
was maintained on a trial by trial basis:
d t = Similarity.U C + Similarity
d t
Similarity
1 .(1
U C)
(1)
d
The update coefficient, UC, was bounded between 0 and 0.1. The seed, Similarity
0 , was
set to perfect correlation with mean training error FRN, scaled by a mid-range UC value of
0.05. As Similarity was invariant to magnitude, UC was weighted by trial amplitude data in
d
order to only update Similarity
on trials exhibiting negative polarity in the period of interest:
U C = 0.1/1 + e13(Amp+0.5)
(2)
Article ID 044-2
DOI:10.3217/978-3-85125-378-8-44
0.50
1.00
1.00
0.80
0.70
0.99
Specificity
0.90
Sensitivity
Precision
0.40
0.30
0.20
0.98
0.97
0.96
0.10
0.95
0.5
1.5
0.00
0.5
1.5
0.5
1.5
Figure 1: Error detection rates for MI dataset. Influence of F selection on overall group rates
of precision, sensitivity and specificity. Results calculated from 50 simulation runs for each
subject. Whiskers indicate standard deviation. Plus symbol indicates the mean.
Amp being the mean normalised amplitude in a window around the normalised latency for
d
the trial. After each trial the estimate of FRN latency, Latency,
was set to the trial period
estimated to be most similar to the template error FRN:
d =t|
Latency
d
max(Similarity
t)
(3)
After amplitude normalisation, Amplitude, Jitter and Similarity features were extracted
d and trials classified by SVM. An additional step detected
from a window centred on Latency
ERP polarity inversion. Polarity inversion may occur due to dierences in sulci orientation
and the relative position of dipoles. This step was based on sequential calculation of the third
central moment and the standard error of skewness. If significant positive skew was detected
d
at Latency,
data was deemed inverted and the polarity of all succeeding trials reversed.
Results
Article ID 044-3
MI
Prec.
Prec.
80.05.4 +59.3
95.61.4 +68.7
83.86.2 +61.3
93.21.6 +69.1
97.02.8 +74.1
77.45.1 +55.9
95.92.6 +75.5
79.36.2 +53.2
77.44.7 +57.7
S1
S2
S3
S4
S5
S6
S7
S8
S9
S10
S11
S12
86.68.6
+63.9
Data
Sens.
10.81.2
42.21.8
17.61.8
33.42.4
32.12.2
12.31.5
35.93.1
23.75.9
36.82.6
DOI:10.3217/978-3-85125-378-8-44
P300 Data
Spec.
Prec.
Prec.
Sens.
99.30.2 9.312.3
-28.7
2.23.0
99.30.4 85.11.9 +49.4 42.53.5
99.00.3 92.73.0 +29.6 10.51.1
99.20.5 99.90.8 +44.9 16.71.0
99.70.5 11.523.7 -16.3
0.91.9
99.00.3 99.06.9 +39.6
0.80.7
99.60.6 57.55.7 +33.3 18.51.8
97.91.3 62.33.6 +49.4 47.13.3
97.40.5 69.86.3 +41.9 14.31.6
99.52.7 +56.8 18.13.1
44.018.3
-2.1
1.70.7
70.55.3 +39.1
9.92.0
27.211.5 98.90.8 66.831.9 +28.1 15.315.4
Spec.
89.90.8
95.91.1
98.50.3
100.00.3
98.50.4
100.00.2
95.50.4
95.80.4
97.50.4
99.90.9
98.20.3
98.10.5
97.32.8
Table 1: Error-potential detection rates for MI and P300 data sets. Precision denotes dierence between precision obtained and equivalent randomisation of true label distributions. Italics
denote members of the hold-back group. Results obtained using value of 0.5. Simulation runs
were repeated 100 times.
Conclusion
Discrimination rates demonstrate that strong neurophysiological a priori, with inverse solution
methods to apply them, can produce viable training free approaches for ErrP detection. Signal
enhancement provided by inverse methods can enable use of simple descriptive interpretable
features.
Acknowledgments
This work was carried out as part of the CoAdapt project, funded via ANR-09-EMER-002-01.
References
[1] M. Congedo. OpenViBE tools. http://sites.google.com/site/marcocongedo, 2011.
[2] M. Dyson, L. Casini, and B. Burle. Discrimination of discrete feedback during performance of motor
imagery. In Proc. Int. Workshop on Pattern Recognition in NeuroImaging, pages 9396, 2012.
[3] C. B. Holroyd and M. G. H. Coles. The neural basis of human error processing: Reinforcement
learning, dopamine, and the error-related negativity. Psychological Review, 109(4):679709, 2002.
[4] T. Joachims. A support vector method for multivariate performance measures. In Proceedings of
the 22nd International Conference on Machine Learning, pages 377384. ACM Press, 2005.
[5] M. Perrin, E. Maby, S. Daligault, O. Bertrand, and J. Mattout. Objective and subjective evaluation
of online error correction during P300-based spelling. Adv. in Human-Computer Interaction, 2012.
[6] K. R. Ridderinkhof, M. Ullsperger, E.A. Crone, and S. Nieuwenhuis. The roles of the medial frontal
cortex in cognitive control. Science, 306:443447, 2004.
[7] T. B. Terriberry.
Computing Higher-Order Moments Online.
http://people.xiph.org/
~tterribe/notes/homs.html, 2007.
Article ID 044-4
DOI:10.3217/978-3-85125-378-8-45
Abstract
When pathways for normal motor function are interrupted (e.g. after stroke), braincomputer interfaces (BCI) can be used i) as an alternative channel for communication
by translating brain signals measured with electroencephalography (EEG) into a
computer output (Kbler et al., 2005) or ii) for rehabilitation by influencing brain
plasticity processes to induce recovery of motor control (Pichiorri et al., 2011). This
study investigated if a realistic visualization of an upper limb movement can amplify
motor related potentials during motor imagery (MI). We hypothesized that a 3D sensory
richer visualization might be more effective during instrumental conditioning, resulting
in more pronounced event related desynchronisation (ERD) of the mu band (10-12Hz)
over the sensorimotor cortices and can therefore improve sensorimotor rhythm-based
based BCI protocols for motor rehabilitation.
1 Introduction
Over the past years advances in the analysis of EEG signals and improved computing capabilities
have enabled people with severe motor disabilities to use their brain activity for communication and
control of objects in their environment, thereby bypassing their impaired neuromuscular system
(Kbler et al., 2001; Allison et al., 2007). A new potential BCI therapeutic approach is generating
substantial interest in the use of EEG-based BCI protocols to improve volitional motor control, that
has been impaired by trauma or disease: A repetitive movement practice e.g., with non-invasive
sensorimotor rhythm-based (SMR) BCIs should lead to an increase in the motor cortical excitability
(Pichiorri et al., 2011). By inducing a better engagement of motor areas with respect to motor
imagery, BCI protocols might be able to guide the neuroplasticity to promote recovery in the affacted
brain regions to restore motor function (Cincotti et al., 2012).
Since BCI systems use immediate, typically visual feedback of performance, the influence of the
visual feedback presentation should be considered. For instance, there seems to be no difference in the
classification results between groups provided with realistic feedback (moving hand performing an
object-related grasp) and an abstract feedback (moving bar; Neuper et al., 2009) in a SMR basedBCI task. However there is some evidence that instead a rich visual representation of the feedback
*
Masterminded EasyChair and created the first stable version of this document
Created the first draft of this document
Article ID 045-1
DOI:10.3217/978-3-85125-378-8-45
signal, e.g., in the form of a three dimensional video game or virtual reality environment, may
enhance the user`s control of a SMR based-BCI (Pineda et al., 2003; Pfurtscheller et al., 2006).
This study investigated if a 3-dimensional visualization of upper limb movement can amplify
motor cortex activation during motor imagery (MI) and thereby support the use of a sensorimotor
rhythm based BCI. We hypothesize that this realistic sensory richer visualization might be more
effective during instrumental conditioning, resulting in more pronounced event related
desynchronisation of the mu band (10-12Hz) over the sensorimotor cortices.
2 Methods
Fifteen healthy SMR BCI nave participants were recruited for the study. In a within subject
design all participants were instructed to watch attentively randomized videos of three different left
and right upper limb movements (Fig. 1) on a True3Di monitor in 2D and in 3D visualization, by
using stereoscopic glasses. Every session consisted of 3 runs with 12 trials each (3 different
movements for the left and right upper limb in randomized order consecutively in 2D and 3D). After
every video, participants were instructed to replicate subsequently the observed movements by motor
imagery, while the EEG signals were recorded from a grid of 40 Ag/AgCl scalp electrodes. For
statistical analyses, we used the ERD values obtained from the right (recording position CP4) versus
left sensorimotor cortex (recording position CP3), temporally aggregated over imagery period (1-5 s).
The probability of a Type I error was maintained at 0.05
Figure 1: Visualization of three different upper limb movements: rotation of the hand around wrist, rotation of
the whole forearm around elbow and a grasping movement of the whole arm. All movements were shown for the
left and right upper limb randomized in 2D and 3D.
3 Results
Fig. 2 presents the topographical analyses of the mean ERD values for the two visualization
modality groups, separately for the respective task (right and left hand, arm and grasp motor imagery)
and electrode position (CP3 and CP4). Characteristic patterns of ERD of mu band components for left
and right upper limb motor imagery were present mainly contralateral at electrode positions CP3 and
CP4 over the sensorimotor areas. In order to analyze the potential influence of the visualization
modality on the ERD patterns during task performance, we performed a repeated measures ANOVA
on the ERD data using the Visualization Modality (VM, 2 levels: 2D vs. 3D), Electrode Position (2
Article ID 045-2
DOI:10.3217/978-3-85125-378-8-45
levels: CP3 vs. CP4), Task (3 levels: arm rotation, hand rotation and grasping motor imagery) and
Task Side (2 levels: left vs. right) as within-subjects variables. Overall, significant differences were
observed as a function of Visualization Modality. This main effect indicates that largest ERD was
obtained with a 3-dimensional visualization modality. ANOVA revealed a significant interaction of
VM x Task Side x Electrode Position (F3.01 = 8.77, p<0.001), VM x Task (F2.22 = 5.45,
p<0.005) and a significant interaction between electrode position and side of the motor imagery
(Electrode Position x Task Side; F(1,56) = 18.25 . Largest enhancement in the mu band
event related desynchronisation was obtained in the tasks: 3D arm mi left on the right hemisphere at
electrode position CP4 (t = 2.75, p<0.005), 3D arm mi right on the left hemisphere at CP3 (t = 2.62,
p<0.005) and 3D hand mi left on the right hemisphere at CP4 (t = 2.64, p<0.005), compared to the
2-dimensional task.
Figure 2: Maps displaying the topographical distribution of mu band power (10-12 Hz; ERD) during motor
imagery tasks (left and right upper limb) after presenting 2D and 3D limb movement visualizations. The two
graphs show the modulated power spectral density during motor imagery (averaged 15 participants, baseline
corrected).
4 Discussion
Our findings support the assumption that stimulus-rich and realistic feedback conditions can lead
to stronger motor cortex activation during motor imagery (Pineda et al., 2003; Pfurtscheller et al.,
2007). Depending on the visualization modality different activations of the hemispheres could be
detected corresponding to the two different electrode positions CP4 and CP3. Furthermore, this motor
cortex activation was also related to the three different motor imagery tasks depending on the type of
visualization. Participants were able to enhance the desired electrophysiological signals significantly
in three of the 3-dimensional tasks and according to findings of Neuper et al. (2009) a contralateral
dominance of the ERD was detectable. In the tasks where the 2-dimensional visualization lead to
comparable strong ERDs the visualization seemed to become closely associated to the MI task and no
additional benefit could be gained from the sensory richer realistic 3D visualization.
Article ID 045-3
DOI:10.3217/978-3-85125-378-8-45
5 Conclusion
Motor imagery offers a promising technique for motor rehabilitation (Cincotti et al., 2012). With
a realistic visualization of the limb movements, we tried to increase motor cortex activation during
motor imagery and thereby gradually improve performance in SMR based BCI tasks. These
preliminary results suggest that the patients involvement in the motor imagery task and the functional
outcome in the motor related potentials may be improved by the use of the 3-dimensional
visualization modality.
6 Acknowledgement
This work was funded by the Deutscher Akademischer Austauschdienst (DAAD), the Department
of Mechanical Engineering at the Curtin University (Perth/ Western Australia) and partly supported
by the GK Emotion and the EU grant FP7-ICT-2011 CONTRAST project.
7 References
Allison, B.Z., Wolpaw, E.W. and Wolpaw, J.R. (2007) Braincomputer interface systems:
progress and prospects. Expert Review of Medical Devices, Vol. 4, No. 4, 463-474.
Cincotti, F., Pichiorri, F., Arico, P., Leotta, F., de Vico Fallani, F., del R Millan, J., Molinari, M.
and Mattia, D. (2012) EEG-based Brain-Computer Interface to support post-stroke motor
rehabilitation of the upper limb. Proceedings of the 34th Annual International Conference of the IEEE
Engineering in Medicine and Biology Society, San Diego, CA, pp. 4112-4115.
Kbler, A., Kotchoubey, B., Kaiser, J., Wolpaw, J.R. and Bierbaumer, N. (2001) Braincomputer
communication: Unlocking the locked in. Psychological Bulletin, Vol. 127, No. 3.
Kbler, A., Nijober, M.S., Mellinger, J., Vaughan, T.M., Pawlzik, H., Schalk, G., McFarland, D.J.,
Birbaumer, N. and Wolpaw, J.R. (2005) Patients with ALS can use sensorimotor rhythms to operate a
brain-computer interface. Neurology, Vol. 64, No. 10, pp. 1775-1777.
Neuper, C., Scherer, R., Wriessneggerand, S. and Pfurtscheller, G. (2009) Motor imagery and
action observation: Modulation of sensorimotor brain rhythms during mental control of a brain
computer interface. Clinical Neurophysiology, Vol. 120, Issue 2, pp 239-247.
Pfurtscheller, G., Scherer, R., Leeb, R., Keinrath, C., Neuper, C., Lee, F. & Bischof, H. (2007)
Viewing Moving Objects in Virtual Reality Can Change the Dynamics of Sensorimotor EEG
Rhythms. Presence ,Vol. 16, No.1, pp 111-118.
Pichorri, F., De Vico Fallani, F., Cincotti, F., Babiloni, F., Molinari, M., Kleih, S.C., Neuper, C.,
Kbler A. and Mattia, D. (2011) Sensorimotor rhythm-based braincomputer interface training: the
impact on motor cortical responsiveness. J. Neural Eng., Vol.8, No.2, 025020.
Pineda, J.A., Silverman, D.S., Vankov, A. and Hestenes, J. (2003) Learning to control brain
rhythms: making a braincomputer interface possible. IEEE Trans Neural Syst Rehabil Eng, 11, pp.
181184.
Article ID 045-4
DOI:10.3217/978-3-85125-378-8-46
1 Introduction
To allow for BCI based communication even in patients who are unable to control their eye
movement and therefore, cannot benefit from visual input has been one crucial research goal in BrainComputer Interfacing (BCI). In the P300-based auditory paradigm presented by Schreuder and
colleagues (2011) letters of the alphabet were divided into six groups and each group was represented
by one sound (consisting of a base tone ranging between 440 and 1099 Hz and a noise ranging between
3 and 10.5 KHz) presented from spatially distributed loudspeakers. The six loudspeakers were arranged
in a circle around the person (in a distance of 60 from each other) and one could choose a letter group
by focusing attention to the sound as well as the direction from which the sound was presented (spatial
information). After the group was selected, single letters were presented to the participant and the
desired letter could be chosen. Another promising P300-based approach was presented by Hhne and
colleagues who combined sound frequency (high=708Hz, medium=524Hz and low=480Hz) and spatial
information (right ear, left ear, both ears) via headphones in a paradigm that was based on the T9 system
that is used for mobile phones. By focusing attention on the desired group of letters, possible letter
combinations were narrowed down to one single desired word and the user could spell fast and easy.
However, in both systems, the letters or groups of letters were represented by sounds in combination
with spatial information. Therefore, one has to keep in mind which group is represented by which tone
and from which direction. Also the presentation of a visual support matrix decreases external validity
as in case visual input cannot be perceived, the presentation of a support matrix would not represent a
realizable solution for a patient. Thus, in this paper, we present a speller which uses natural words as
stimuli and thereby allows intuitive communication for which no visual support matrix is needed.
Spatial information was not implemented in the here presented speller as we aimed at an intuitive and
easy-to-set-up solution for auditory BCI use in people with motor impairment.
Article ID 046-1
DOI:10.3217/978-3-85125-378-8-46
2 Methods
2.1 Participants
We tested N=2 severely motor impaired patients. Participant A was paralyzed in a wheelchair
because of a traumatic accident, participant B was diagnosed with muscle dystrophy (Duchenne) and
also used a wheelchair. Both participants were male and 43 and 49 years old. Both participants used
assistive devices for communication but could communicate single words by voice. Participants gave
informed consent for the study, which had been reviewed and approved by the Ethical Review Board
of the University of Wrzburg.
Article ID 046-2
DOI:10.3217/978-3-85125-378-8-46
3 Results
Participant A spelled very successfully with the MOPS Speller with an average online accuracy of
90% correctly selected letters. The three times he had to use the undo function, he corrected his previous
false choice properly and spelled correctly all three words BOJE, SYLT and HARZ. His offline
classification accuracy was 100% after 7 sequences (see figure1). Participant B was less successful with
the MOPS-speller and reached an average accuracy of 53% correctly selected letters online. His offline
classification accuracy was 65% with 10 sequences of stimulus presentation (see figure1).
In line with the presented spelling result we found main differences for signal distinction between
targets and non-targets between the two patients. While in participant B, the P300 is very clearly
detectable (see figure 2), targets and non-targets were less distinguishable in participant A (see figure
2). The Information Transfer Rate (ITR) was 0.30 for participant A and 1.04 for participant B.
Figure 2: P300 evoked by targets (red) compared to non-targets (blue). Red and blue shades indicate the
standard deviation (SD) of EEG signals.
Article ID 046-3
DOI:10.3217/978-3-85125-378-8-46
4 Discussion
With the here presented auditory speller, we showed that intuitive auditory communication
independent from a visual support matrix can be used successfully by people with severe motor
impairment. However, we cannot judge general usability of the MOPS-speller as we only included two
patients, and one did not reach sufficient accuracy for meaningful communication (Kbler et al., 2001),
but we demonstrated the proof-of-principle.
Our major goal of using intuitive stimuli in an easy-to-setup-paradigm was very well accepted by
the participants. They reported the auditory stimulation with natural words to be pleasant and simple to
understand, even though after spelling the second word, they started to feel exhausted. Also the stimuli
themselves were long and together with the here implemented ISI, ITR is low. Nonetheless, we believe
that for a patient who is unable to use the visual input channel, the time that is needed for spelling would
still be acceptable.
Acknowledgements
This work is supported by the European ICT Program Project FP7-287320 (CONTRAST). This
manuscript only reflects the authors' views and funding agencies are not liable for any use that may be
made of the information contained herein.
References
Hhne, J., Schreuder, M., Blankertz, B., & Tangermann, M. (2011). A Novel 9-Class Auditory ERP
Paradigm Driving a Predictive Text Entry System. Front Neurosci, 5, 99.
doi: 10.3389/fnins.2011.00099.
Kbler, A., Kotchoubey, B., Kaiser, J., Wolpaw, J. R., & Birbaumer, N. (2001). Brain-computer
communication: unlocking the locked in. Psychol Bull, 127(3), 358-375.
Schalk, G., McFarland, D. J., Hinterberger, T., Birbaumer, N., & Wolpaw, J. R. (2004). BCI2000: a
general-purpose brain-computer interface (BCI) system. IEEE Trans Biomed Eng, 51(6),
1034-1043. doi: 10.1109/TBME.2004.827072
Schreuder, M., Rost, T., & Tangermann, M. (2011). Listen, You are Writing! Speeding up Online
Spelling with a Dynamic Auditory BCI. Front Neurosci, 5, 112. doi:
10.3389/fnins.2011.00112
Article ID 046-4
DOI:10.3217/978-3-85125-378-8-47
Introduction
One of the most convenient applications of brain-machine interfaces is the control of prosthetic
devices. Many approaches have been developed so far, including non-invasive [1] and invasive
[2] control of an artificial upper limb. However, for continuous control, the commonly performed
decoding of motor sensory rhythms, originated in motor imagery, has two drawbacks: due to
only a few distinguishable classes the degrees of freedom to control the eector are limited and
the training phase for users is long. An alternative control signal is the steady state visual
evoked potential (SSVEP) which provides a robust signal for detection and an easy usage. In a
previous study [3] we showed that the magnetic fields induced by SSVEPs (steady state visual
evoked field, SSVEF) can be reliably decoded to select virtual reality objects. The control of
a robotic hand with SSVEP detection was presented by [4] but still the degrees of freedom
which can be controlled are very limited. Here we present an SSVEF based prototype system,
aimed at autonomously grasping and manipulating objects, which is controlled asynchronously
via visual overt attention. The decoding scheme we applied is designed to minimize erroneous
actions of the manipulator which facilitates successful and secure grasping when combined with
an intelligent actuator.
2
2.1
Methods
Robotic Grasping System
We developed a virtual reality scenario designed to simulate the control of a robotic device.
Virtual objects, placed in the working area of the robot, can be grasped autonomously by the
robot [5, 6]. In the current setup we placed four spherical objects on the table serving as
potential grasp targets. Furthermore, we placed a green start button and a red stop button
on the table to send control commands to the robot. All objects were tagged with stimulation
Article ID 047-1
DOI:10.3217/978-3-85125-378-8-47
Figure 1: Robotic gripper in the states start (A), grasped (B) and presenting (C) and the
state diagram (D) of the system which is controllable by only three commands (start, stop and
object#).
surfaces flickering at dierent frequencies (object 14: 6.818 Hz, 7.5 Hz, 8.333 Hz, 9.375 Hz; start
button: 10.714 Hz; stop button: 12.5 Hz). Whenever only the selection of a command button
was reasonable, e.g. during a running grasping process, the stimulation of objects 14 was
stopped. Thus, selection of a new object to grasp was only possible when the gripper was in the
states retract, break and start. This reduces the probability of unwanted selections during the
grasping and manipulation process. As a representative manipulation the object could be picked
up and presented. For resting periods, a non-flickering area, e.g. the fixation cross must have
been focused. For the selection of the object to be grasped next, the corresponding flickering
surface has to be attended. The selection of the start button initiates a grasp to the lastly
selected object, the presentation of the object and the resuming of a stopped grasp movement.
In contrast, a selection of the stop button causes the reverse operations. Figure 1 (D) visualizes
the state transitions of the robot dependent on activation of the start button (green transitions)
and the stop button (red transitions). States marked in grey indicate stimulation of start and
stop button only, states marked in blue indicate stimulation of all six surfaces.
2.2
Decoding Algorithm
We decoded the SSVEF by means of canonical correlation analysis (CCA) [7]. This method
finds the optimal weighting Wx,f and Wy,f of input channels X (here the signals of 59 occipital
MEG sensors) and model channels Yf (sine and cosine waves of the stimulation frequency f
and optionally harmonics) to maximize the correlation CCA (f ) between linear combinations
xf = X T Wx,f and yf = YfT Wy,f . The CCA was applied to MEG data asynchronously every
1000 ms from the preceding 2000 ms interval. To distinguish resting intervals from selection
intervals, we determined a confidence threshold thf for each frequency f from the empirical
Article ID 047-2
DOI:10.3217/978-3-85125-378-8-47
distribution of coefficients CCA which we revealed in calibration runs. Finally, the classification was determined according to the frequency fmax = argmax(CCA (f )) that revealed the
maximum correlation coefficient and exceeded the threshold thf .
Feedback of the instantaneous target detection of the decoder was presented at each decoding
step by coloured rings around the decoded target. Only after three successive predictions of
the same target it was finally selected. The stage of prediction was indicated by stepwise colour
changes from light blue to green for the final selection. After a selection, the next decoding
was performed three seconds after feedback presentation. The detection of a resting period
was displayed at the fixation cross identical to the selectable targets and did not aect the
prediction stage of targets.
2.3
Experimental Setup
Eight healthy subjects (age 2134 years) participated in this study. MEG data were acquired
with a 248 magnetometer 4DNeuroimaging system, at a sampling rate of 678.17 Hz.
The task of the subjects was to control the autonomously grasping robot depicted in Figure 1 (AC) by overtly attending the flickering surfaces. The experiment started with calibration
runs in which one of six targets or the fixation cross was cued by a black ring during each trial.
If the final selection was dierent from the cued target a red ring indicated the erroneous decoding. A trial was counted as false after 30 s of selection failure. The subjects performed 34
calibration runs to obtain a reliable performance estimate of the proposed decoding scheme.
Additional 34 runs were performed to control the robot. In these runs, a high amount of
resting periods was required due to the slow movement of the robot. Consequently, relatively
few selection trials were performed. Subjects were instructed to grasp, pick up and replace the
objects in a predefined order, which was announced by the experimenter directly before the
run. In case of an unwanted selection due to erroneous SSVEF detection, the subject had to
perform the consequential selection to continue the desired workflow. The sequence of control
commands was as follows: Select the object to be grasped, select the start button to grasp
the object, select the start button to present the object, select the stop button to replace the
object, select the stop button to retract the gripper. In intervals of robot movement, subjects
had to prevent selections.
Results
Although feedback was provided from the first run the decoding of resting intervals started
with the second run, when initial calibration data were available. On average, 74% of target
trials (Tcorr /trials) and 73% of resting trials (Rcorr /trials) were decoded correctly while the
guessing level was 14%. Average selection time tselect was 11.7 s while fastest possible selection was 5 s. In robot control runs, the speed of a complete grasp sequence (Gcompl /time)
was 0.52 min 1 on average. In order to safely perform gripper control, the movement of the
robot was relatively slow (grasping: 28 s, picking up: 8 s) such that an optimal complete grasp
sequence, without complete retracting but including selections, was possible in at least 69 s
(maximum Gcompl /time = 0.87min 1 ). As an important result, in our setup false object selections (Serr ) occured reasonably (26%) but accidently initiated robot commands (Cerr ) occurred
rarely (4.7%) although long intervals of preventing erroneous selections were required. Table 1
summarizes the results. The number of trials performed depends on the latency of a final selection and on the number of unwanted selections. One of the subjects was able to perfectly
control the system without any erroneous selection.
Article ID 047-3
Subject
#
1
2
3
4
5
6
7
8
Calibration
Tcorr
Rcorr tselect
/trials /trials
[s]
56 / 75 18 / 25
12.1
11 / 27
4 / 10
16.5
62 / 62 23 / 23
10.9
40 / 63 15 / 23
10.7
46 / 64 21 / 23
10.5
46 / 54 17 / 21
12.8
60 / 69 16 / 25
9.7
45 / 67 16 / 23
10.3
DOI:10.3217/978-3-85125-378-8-47
8.2
7.4
16.6
10.2
12.4
12.6
15.0
10.3
Robot control
Gcompl
Cerr
Serr
/time /trials /trials
/ 1804
3 / 38
3 / 14
/ 2404
8 / 44
2 / 13
/ 2406
0 / 67
0 / 20
/ 2423
2 / 44
6 / 26
/ 2422
1 / 50 13 / 33
/ 2448
1 / 60
4 / 24
/ 2139
1 / 61 14 / 35
/ 2339
1 / 53 21 / 38
Conclusions
As an alternative modality to EEG we probed the MEG for detecting SSVEFs. Provided that
the EEG allows for comparable detection success, our results indicate that an intelligent actuator in combination with noninvasive detection of SSVEPs could help severely paralyzed patients
to initiate complex grasp movements asynchronously. The presented system allows for voluntary robot control with a low error rate even in subjects with moderate selection performance,
indicating that noninvasive BMIs may be suitable for grasping despite its low information transfer rate. It is important to note that the transfer of the system into an EEG-driven real-world
setting requires further development towards more elaborated stimulus presentation techniques
and intelligent prostheses. In addition, a combination with other modalities, for instance tracking of eye movements, could serve to make the system even more robust.
References
[1] G. R. M
uller-Putz, R. Scherer, G. Pfurtscheller, and R. Rupp. EEG-based neuroprosthesis control:
a step towards clinical practice. Neuroscience Letters, 382(1-2):169174, 2005.
[2] L. R. Hochberg, D. Bacher, B. Jarosiewicz, N. Y. Masse, J. D. Simeral, J. Vogel, S. Haddadin, J. Liu,
S. S. Cash, P. van der Smagt, and J. P. Donoghue. Reach and grasp by people with tetraplegia
using a neurally controlled robotic arm. Nature, 485(7398):372375, 2012.
[3] C. Reichert, M. Kennel, R. Kruse, H. Hinrichs, and J. W. Rieger. Efficiency of SSVEF recognition
from the magnetoencephalogram - A comparison of spectral feature classification and CCA-based
prediction. In Proceedings of the International Congress on Neurotechnology, Electronics and Informatics, pages 233237. SciTePress Digital Library, Sep 2013.
[4] G. R. M
uller-Putz and G. Pfurtscheller. Control of an electrical prosthesis with an SSVEP-based
BCI. IEEE Trans Biomed Eng, 55(1):361364, 2008.
[5] C. Reichert, M. Kennel, R. Kruse, H.-J. Heinze, U. Schmucker, H. Hinrichs, and J. W. Rieger.
Robotic grasp initiation by gaze independent brain-controlled selection of virtual reality objects. In
Proceedings of the International Congress on Neurotechnology, Electronics and Informatics, pages
512. SciTePress Digital Library, Sep 2013.
[6] M. Kennel, C. Reichert, U. Schmucker, H. Hinrichs, and J. W. Rieger. A robot for brain-controlled
grasping. In Proceedings of the Human-Robot Interaction Workshop, March 2014.
[7] Z. Lin, C. Zhang, W. Wu, and X. Gao. Frequency recognition based on canonical correlation
analysis for SSVEP-based BCIs. IEEE Trans Biomed Eng, 54(12):11721176, 2007.
Article ID 047-4
DOI:10.3217/978-3-85125-378-8-48
Introduction
The eects of ageing present physical handicaps that all-too-often prevent older people from
maintaining relationships and actively participating in social life, even when they can live safely
and independently in their own homes. Modern communication technologies could help elderly
people with physical impairments if they provided special interfaces that worked independently
of the persons limitations. For those people there is a way to express their needs, wishes and
feelings by using a Brain-Computer Interface (BCI). This technology translates brain signals,
usually acquired non-invasively using electroencephalogram (EEG), in computer commands
without using the brains normal output pathways of peripheral nerves and muscles [3].
The use of BCI as a spelling interface has been one of the main focuses in many BCI studies.
One of the BCI paradigms used for realization of a spelling interface is the SSVEP BCI which
measures the brain responses to a visual stimulation at specific constant frequencies [3]. High
system speed is an essential goal in BCI studies. However, especially regarding elderly users, the
layout of the graphical user interface plays a key role in ensuring user friendliness and eective
control, therefore, the arrangement of the visual stimuli and the desired targets are important.
For older people readability and simplicity are crucial, therefore we decreased the number of
simultaneously displayed targets to reduce cognitive load and misselections.
2
2.1
Seven subjects participated in the study, with a mean age of 23.63.06 years (range 21 30).
Two subjects were female. All subjects were students or employees of the Rhine-Waal University
of Applied Sciences. The EEG recording took place in a standard laboratory room with low
background noise and luminance. None of the subjects had neurological or visual disorders.
Spectacles were worn when appropriate. All persons who volunteered to participate in the study
became research subjects after reading the subject information sheet and signing a consent form.
Subjects did not receive any financial reward for participating in this study.
Article ID 048-1
2.2
DOI:10.3217/978-3-85125-378-8-48
Hardware
The subjects were seated in front of a LCD screen (BenQ XL2420T, resolution: 1920 1080
pixels, vertical refresh rate: 120 Hz) at a distance of about 60 cm. The used computer system
operated on Microsoft Windows 7 Enterprise and was based on an Intel processor (Intel Core
i7, 3.40 GHz). Standard Ag/AgCl electrodes were used to acquire the signals from the surface
of the scalp. The ground electrode was placed over AFZ , the reference electrode over CZ , and
the eight signal electrodes were placed at predefined locations on the EEG-cap marked with
PZ , P O3 , P O4 , O1 , O2 , OZ , O9 and O10 according to the international system of EEG electrode
placement. Standard abrasive electrolytic electrode gel was applied between the electrodes and
the scalp to bring impedances below 5 k. An EEG amplifier g.USBamp (Guger Technologies,
Graz, Austria) was utilized. The sampling frequency was set to 128 Hz. During the EEG signal
acquisition, an analog band pass filter between 2 and 30 Hz and a notch filter around 50 Hz
were applied directly in the amplifier.
2.3
Each subject used two dierent user interface layouts subsequently. The order of the presented
GUIs as well as the spelling tasks were randomized for each subject in order to minimize the
risk of adaption. The 2-steps GUI resembles to an earlier developed GUI layout [2]. In the
2-steps GUI seven commands were represented on the computer screen by flickering boxes of
default sizes (125 x 125 pixels). The size of the boxes varied during the experiment as described
in [1]. Initially the subject faced seven boxes containing A-F, G-L, M-R, S-X, Y-
and Del in a first window (see Figure 1(a)). After initial selection, the GUI changed to the
second window with six individual letters of the initially selected group, so this layout allowed
the user to select every letter with only two commands.
In order to improve user friendliness and to decrease the number of frequencies, in the here
presented 3-steps GUI the number of boxes was reduced to four. This also allows us to choose
the frequency set more freely. The box for the written word and the word to spell was moved
to the centre of the screen. The sizes and appearances of the boxes were the same as in the
2-steps GUI, however, the number of steps needed to select a letter had increased to three
and the position of the boxes changed in every step to increase the user-friendliness. In the
first window there were three boxes in the upper part of the screen, arranged horizontally with
all the letters divided into three groups A-I, J-R and S- , respectively. The additional
4th box, containing the command Del (delete the last spelled character) was located on the
right side of the screen. In the second window the position of three boxes changed from upper
horizontal to left-hand vertical according to the first selection made in the initial window, e.g.
A B C, D E F, G H I or J K L, M N O, P Q R or S T U, V W X, Y Z
. After selecting a desired box in the second window the user faced a third window. Here
each box contained a single letter and their positions changed once more from left vertical to
horizontal bottom position. In both the second and the third window the far right box (Del
in window 1) would contain the command back, giving the user the option to switch to the
previous window. The first window as well as an overview of the three steps necessary to choose
a single letter are shown in Figure 1(b) and 1(c).
In both GUIs every command classification was also followed by an audio feedback with the
name of the selected command or the letter spelled in order to reduce the information load of
the visual channel. For SSVEP signal classification we used a minimum energy combination
method. The SSVEP classification was performed on-line every 13 samples (ca. 100ms) on the
basis of the adaptive time segment length of the acquired EEG data. More details about the
Article ID 048-2
ABC
DEF
DOI:10.3217/978-3-85125-378-8-48
MNO
PQR
GHI
JKL
Del
Start
STU
VWX
Clr
Del
YZ Y
K -
Cursor: E
Target:
Segment: SubjectNr: WordOK
BCI
NextWord
AccY: -K--1
ITR: --K--bpm
VerticalRefreshRate: UW-Hz
BCI
JKL
MNO
PQR
STU
VWX
YZ_
ABC
Del
JKL
MNO
PQR
STU
VWX
YZ_
DEF
Time: 208710ms
Stop
Target: Q
Segment: 4063
SubjectNr: 3
WAXY AND QUIVERING JOCKS FUMBLE PI
ABC
DEF
GHI
WordOK
NextWord
Acc.: 100%
GHI
ITR: 15.52bpm
VerticalRefreshRate: 120Hz
2.4
Procedure
After completing the consent form, each subject completed a brief questionnaire and was prepared for the EEG recording. Subjects participated in a familiarization run spelling words
BCI, BRAIN, and the word of choice (e.g. the own first name) and the experimenters
used the collected data to manually adjust the individual parameters. Next, each subject used
both GUIs to spell the phrase WAXY AND QUIVERING JOCKS FUMBLE PIZZA (commonly used pangram, a sentence using every letter of the alphabet at least once). Each spelling
phase ended automatically when the presented word was spelled correctly. The experiment was
stopped manually in case a subject could not execute a desired classification within a certain
time frame, the subject wished to end the experiment or unintentional repeated misclassifications occurred. Spelling errors were corrected via the implemented Del button. Information
needed for the analysis of the test was stored anonymously during the experiment. After finishing the spelling, subjects were asked to rate how well the BCI-System worked for them.
Results
BCI performance for each subject was evaluated by calculating the commonly used information
transfer rate in bit/min, employing the formula as discussed e.g. in [4]. In the here presented
GUIs, the overall number of possible choices was equal to seven and four for the 2-steps GUI
Article ID 048-3
DOI:10.3217/978-3-85125-378-8-48
and the 3-steps GUI, respectively. The accuracy was calculated based on the number of correct
command classifications divided by the total number of classified commands. The overall BCI
performance is given in Table 1. All subjects were able to complete the pangram. Since for
one subject the data were not stored due to a computer error, this table presents the data from
only six subjects.
Subject
1
2
3
4
5
6
Mean
SD
2-steps GUI
Time
Acc
ITR
sec
%
bpm
894.66 95.06 13.02
430.42
81.51
27.19
702.71
72.00
23.59
557.38
70.00
23.53
325.51
92.39 37.69
623.09 69.48
23.20
588.96 80.07 24.70
201.54 11.47
7.95
3-steps GUI
Time
Acc
ITR
sec
%
bpm
1686.55
92.09
7.28
845.61
85.89
13.71
599.93
85.55
20.27
455.61 76.23
24.34
516.85
88.74
22.97
460.08 95.94 27.08
760.77 87.41 19.27
476.06
6.74
7.43
Subjective impression
2-steps
3-steps
GUI
GUI
2
4
3
4
4
4
3
3
2
4
2
4
1- worked not at all
5- worked very well
Table 1: Results for each subject and layout. Mean values are at the bottom of the table.
Discussion
Four out of six subjects reached higher ITR with the 2-steps GUI, also the spelling time was
faster for three subjects with this layout. However, the majority of subjects (five out of six)
stated that they prefer the 3-steps GUI over the 2-steps GUI (right sight of Table 1). Also,
some subjects stated that they found the overall design of the 3-steps GUI more intuitive. All
in all, the here presented 3-steps GUI did not surpass the 2-steps GUI in terms of spelling time
and ITR. But because of other factors, such as user comfort, user-friendly design and better
accuracies the 3-steps GUI could be a valuable option especially for elderly users. Main advantage of the 3-steps GUI is that only 4 boxes are displayed, which decreases the misselections
and seems to be less stressful for the users.
This research was supported by the German Federal Ministry of Education and Research
(BMBF) under Grant 16SV6364. We also thank all the participants of this research study.
References
[1] Ivan Volosyak. SSVEP-based Bremen-BCI interface boosting information transfer rates. J. Neural
Eng., 8(3):036020, 2011.
[2] Ivan Volosyak, Anton Moor, and Axel Gr
aser. A dictionary-driven SSVEP speller with a modified
graphical user interface. In Advances in Computational Intelligence, pages 353361. Springer, 2011.
[3] Jonathan Wolpaw and Elizabeth Winter Wolpaw, editors. Brain-computer interfaces: principles
and practice. Oxford University Press, 2011.
[4] J.R. Wolpaw, N. Birbaumer, D.J. McFarland, G. Pfurtscheller, and T.M. Vaughan. Brain-computer
interfaces for communication and control. Clin. Neurophysiol., 113:767791, 2002.
Article ID 048-4
DOI:10.3217/978-3-85125-378-8-49
1 Introduction
On brain computer interface (BCI), decoding and predicting intention of the subject is one of main
issues. In primates, saccadic eye movements are often used to fixate objects of interest. The frontal eye
field (FEF), supplementary eye field (SEF), and superior parietal lobule (SPL) including Intraparietal
sulcus (IPS) are known as principal neocortical regions involved in the execution of both saccadic and
pursuit eye movements. However it is not fully understand because of the limited measurement
equipment such as fMRI, which has low time-resolution (LunaB, ThulbornK, StrojwasM, SweeneyJ,
1998) or depth electrode, which has low spatial-resolution (LeeK, AhnK, KellerE, 2012).
Recently, epidural electrocorticogram (eECoG) is widely known to give a great SNR, high spatialresolution, and broad frequency bandwidth. In this study, we verified the usability of implanted eECoG
in non-human primate by predicting saccadic movement using eECoG signals. We used implanted multi
channels micro electrode for non-human primate and eye movement behavioral task for verification.
Article ID 049-1
DOI:10.3217/978-3-85125-378-8-49
electrode patches (32 channels, 4 by 8), positioned in the left hemisphere including dorsal parietal cortex
and posterior part of frontal cortex (Figure 1). It covered the SEF, FEF and SPL including IPS, which
were well-known as related to saccadic behavior. The implanted electrode grids consisted of gold
electrodes that were embedded in poly-imide and spaced at an inter-electrode distance of 3 mm.
2U
#U
('
('(
'(
'
(
5'(
5
%U
+2UU
The performance in the task was monitored by infrared video-oculography with a sampling rate of
500Hz (Eyelink2, SR Research Ltd, Kanata, Ontario, Canada). Saccade behavior was measured off-line
using programs written in MATLAB (The Mathworks, Natick, MA, USA). The onset and offset of
saccades were determined by velocity criteria (30/s radial velocity for onset and 10/s for offset).
eECoG signals were recorded with a sampling rate of 512Hz per channel, using data acquisition
system (Brainbox EEG-1164 amplifier, Braintronics B. V., Almere, Netherlands). 60 Hz analog notch
filter was used during the data acquisition.
Article ID 049-2
DOI:10.3217/978-3-85125-378-8-49
3 Results
We measured the eECoGs impedance to observe the impedance change after implantation. As a
result, in-vivo impedance test showed stable values as time passes. Electrodes had impedance of
5.4920.046k, 5.4700.048k, 5.3920.050k and 5.3580.047k (meanSEM for 64 electrodes at
1 kHz), from 1st to 4th week, respectively.
The eECoG decoding processing was carried on three sessions to predict the monkeys saccadic
movement. The classification accuracies were 87%, 86% and 89% for predicting saccade eye
movements direction, from 1st session to 3rd session, respectively. Calculating the weight contribution
ratio in decoding models revealed that the greater spatial contributions were found in SEF, FEF and
SPL, which are consistent with previous reports and support the feasibility of eECoG based on
neuroscientific background. In addition, Contribution of temporal information was significantly higher
as closer to saccade onset time. The spectral contributions were greater in (18~26Hz) band than other
frequency bands.
Article ID 049-3
DOI:10.3217/978-3-85125-378-8-49
4 Discussion
The 4-direction classification accuracies, which were 53%, 66%, and 56%, were higher than a
chance level (25%). The left/right classification accuracies were significantly higher than a chance level
(50%). It shows that 2-direction classification is much more efficient than 4-direction classification at
BCI. Moreover, the prediction accuracy of eECoG-based decoding showed no significant monotonic
decrease for 2 months. This result indicates that eECoG has durability and that the signal quality of the
eECoG was maintained for several months at least.
As seen in the Figure 3, the weight of -band was increased gradually over time and the highest at
just before the physical eye movement. It seems that the -band played a large role in saccadic plan and
execution.
In the present study, we predicted saccadic eye movement using eECoG and investigated the
feasibility of long term implanted eECoG on BCI. This work demonstrated that it was possible to detect
the eye movements direction before physical eye movement using eECoG, which reflects the
possibility of eECoG for brain signal decoding and BCI study.
Acknowledgments
This research was supported by the National Research Foundation of Korea (NRF) grant funded by
the Ministry of Education, Science and Technology (No. 2010-0020807, No. 2013R1A2A2A04014987).
References
Boser, B. E., Guyon, I. M., & Vapnik, V. N. (1992). A training algorithm for optimal margin classifiers.
ACM Proceedings of the fifth annual workshop on Computational learning theory, 144-152.
Chao, Z. C., Nagasaka, Y., & Fujii, N. (2010). Long-term asynchronous decoding of arm motion using
electrocorticographic signals in monkeys. Frontiers in neuroengineering, 3.
Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene Selection for Cancer Classification using
Support Vector Machines. Machine Learning, 46(1-3), 389-422.
Lee, K. M., Ahn, K. H., & Keller, E. L. (2012). Saccade generation by the frontal eye fields in rhesus
monkeys is separable from visual detection and bottom-up attention shift. PLoS One, 7(6).
Luna, B., Thulborn, K. R., Strojwas, M. H., & Sweeney, J. A. (1998). Dorsal cortical regions subserving
visually guided saccades in humans: an fMRI study. Cereb Cortex, 8(1), 40-47.
Vapnik, V. N. (1999). An overview of statistical learning theory. IEEE Trans Neural Netw., 10(5), 988999.
Article ID 049-4
DOI:10.3217/978-3-85125-378-8-50
Abstract
Stroke, one of the leading causes for acquired disability in adults affects motor and cognitive
abilities, and greatly increases the risk for depression. In search for new therapies, traditional
biofeedback as well as BCI controlled neurofeedback have been suggested to improve emotional wellbeing and cognitive functions. To provide the best possible therapy for an individual patient, we
developed an easy-to-use online interface for shared decision-making between clinicians and patients.
After data entry, the system presents a visually attractive review of aggregated and individual test
results, relates performance to normative samples, and suggests possible treatment options. This
approach to rehabilitation follows the Chronic Care Model that considers patients as partner and fosters
their empowerment. Patients, health care providers, and developers of technology work closely together
to define problems, set priorities, establish goals and create a training plan. Such a user-centered
approach is the pre-requisite for the implementation of bio-and neurofeedback supported rehabilitation
in clinical practice.
1 Introduction
Stroke is the second leading cause (6.3%) for acquired disability in adults in Europe and affects
motor and cognitive abilities. Most prevalent are cognitive deficits in attention and memory, which
contribute to reduced quality of life, depression and impairments in activities of daily living (Sheldon
& Winocur, 2014). In search for new therapies after stroke we suggest to include bio-and neurofeedback
based interventions, which can support patients convalescence and, thereby, reduce long-term effects
after stroke.
Within the EU funded project CONTRAST, we, therefore, developed specific bio- and
neurofeedback-based modules for rehabilitation training of patients with sub-acute and chronic stroke.
The training modules comprise five different domains: (1) depression, (2) attention, (3) declarative
memory, (4) working memory, and (5) inhibitory control. To provide every patient with the appropriate
training module and to support the shared decision-making between clinician and patient, an algorithm
was developed. In doing so, we followed two major goals: To facilitate shared decision-making by
creating a standardized, comfortable and efficient tool for presenting a structured overview of patient
Article ID 050-1
DOI:10.3217/978-3-85125-378-8-50
test results. This enables the clinician to explain further steps concerning rehabilitation treatments while
patients can follow, comment, discuss and finally co-decide on appropriate interventions. In addition,
we aimed to create an individualized hierarchy of treatments on the basis of patients psychometric test
scores.
2 Methods
In a first step, a comprehensive profile of possible impairments is obtained from a battery of
standardized and well-normed psychometric tests (see Table 1). All included tests fulfilled the quality
criteria of being objective, reliable, and valid and are available in German, English, French and Italian.
Based on the patients demographics, such as age, gender, or education, test scores are automatically
related to the appropriate test norms, and visualized. In a second step, our system proposes a set of
recommendations for rehabilitation training (see Figure 1).
Domain
Depression
Attention
Declarative memory
Working memory
Inhibitory control
Measure
Center for Epidemiologic Studies Depression Scale, Perseverative
Thinking Questionnaire
Test of Attentional Performance (TAP) Alertness, TAP Divided
Attention
California Verbal Learning Test
TAP Working Memory, Digit Span Task, Visuo-Spatial Memory
TAP Go/No-Go
Table 1: Dimensions and respective tests serving as a basis for the algorithm
Figure 1: The algorithm including the bio-and neurofeedback modules for cognitive deficits and if necessary,
treatment for depression.
We suggest to address depression first, since mood disorders have a ubiquitous negative impact on
cognitive training and need to be treated with psychological therapy additionally to HRV biofeedback,
then to focus on attentional problems, which are likely to affect other cognitive functions (Lezak, 1987),
and finally to address specific cognitive functions such as declarative memory, inhibitory control, and
working memory (see Figure 1).
Article ID 050-2
DOI:10.3217/978-3-85125-378-8-50
Figure 2: Online overview of test performance scores including date, cognitive function and possibility to
request test report.
Graphical presentation of test scores follows a hierarchical approach, where the most critical results
and recommendations of the most appropriate training module are summarized on a one-page report. In
addition, a consistent colour coding scheme is used to highlight deviant test scores, allowing the rapid
identification of critical test results. Figure 3 illustrates the presentation of results from a (fictitious)
patient. In this example, working memory is easily identified as the most critical domain, with the
below-average performance in the TAP working memory test being responsible for this result. To
address potential colour vision deficiencies after stroke we plan to provide an additional black and white
high-contrast display of results.
Figure 3: Colour coded overview of patient AD07 test results. Red=highly critical, orange=critical,
yellow=less critical, green=most likely normal.
Article ID 050-3
DOI:10.3217/978-3-85125-378-8-50
clinicians and patients to agree on a treatment choice (a training module) as we provide a common basis
for this decision with the presentation of tests scores indicating deficits in cognitive functions.
Figure 4: The Chronic Care Model (modified from Wagner et al., 1996) requires that Community and Health
Systems contribute to well-prepared practice teams and informed active patients who are in a productive
interaction allowing for shared decision- making. Improved outcome is demonstrated, for example, by increased
patients self-efficacy, which is associated with low levels of depression and feelings of helplessness.
References
Godolphin, W. (2009). Shared decision-making. Healthcare Quarterly, 12, (e186-e190).
Lezak, M. D. (1987). Assessment for rehabilitation planning. In M. J. Meier, A. L. Benton, & L. Diller
(Eds.), Neuropsychological rehabilitation. New York: Guilford Press.
Sheldon, S., & Winocur, G. (2014). Memory Loss After Stroke. In T. A. Schweizer & R. L. Macdonald
(Eds.), The Behavioral Consequences of Stroke (pp. 151176). New York, NY: Springer New York.
Wagner EH, Austin BT, Von Korff M. Organizing care for patients with chronic illness. Milbank Q
1996;74(4 ):511-44.
Article ID 050-4
DOI:10.3217/978-3-85125-378-8-51
Abstract
This paper presents a series of recent improvements made on the P300 speller paradigm
in the context of the CoAdapt project. The flashing sequence is elicited by a new design
called RIPRAND, in which the flashing rate of elements can be controlled independently of
grid cardinality. Element-based evidence accumulation allows early-stopping of the flashes
as soon as the symbol has been detected with confidence. No calibration session is necessary, thanks to a mixture-of-experts method which makes the initial predictions. When
sufficient data can be buered, subject-specific spatial and temporal filters are learned,
with which the interface seamlessly makes its predictions, and the classifiers are adapted
online. This paper, which presents results of three online sessions totalling 26 subjects, is
the first to report online performance of a P300 speller with no calibration.
The P300 speller presented in this work was implemented in C++ with OpenViBE [7], and a
dedicated stimulating software controlled the keyboard display. The software is opensource and
part of OpenViBE release 0.18 We used a single Windows laptop to run all software components.
The P300 speller keyboard was displayed on a separate LCD screen. A TMSi Refa8 amplifier,
synchronized via hardware to the laptop, was used to record from 12 actively shielded electrodes.
The visual stimulations consisted of briefly flashing smiley pictures. The P300 wave was
detected via 3 channels of an xDAWN spatial filter [8], combined with a Regularized LDA
classifier hereforth called RDA, which incorporates a regularisation of the common covariance
matrix. The output of the classifier at each flashing time t is denoted y(t).
To save time, elements are always flashed in groups. Initial design of P300 speller groups
involved rows and columns of a square matrix [2] or their randomizations [1]. The target element
is then found at the intersection of the groups eliciting a P300 response. But repetitively flashing
the same groups causes elements within the target groups to be wrongly selected, because of
visual attention eects, and because of the contamination of all group elements by classification
errors.
Element-wise evidence accumulation avoids these two eects. A dierent random permutation can then be performed at each repetition of the flashes, eectively changing elements
group membership across repetitions. At each flash t, let the binary vector a(t) represent the
set of n flashed elements within the grid of cardinality N . The score (t) of each element
(initialized to 0 at time 0) is updated with the following scheme, in which both target and
non-target flashes contribute to the accumulation:
1
1
(t) = (t 1) + log
a(t)
y (t) +
(1 a(t))(1 y(t))
(1)
n
N n
Article ID 051-1
DOI:10.3217/978-3-85125-378-8-51
Figure 1: Left: template for a 36-element grid and 1-in-4 flash-rate. Each line represents the
flashing elements a(t). Right: example of accumulation based on the randomized grid on the
left, over 60 flashes, with element 1 as target.
The groups definition, encoded in a(t), follows a restricted isometry principle (RIP) [10]. This
principle oers an minimal intersection between groups, while allowing the flash-rate to be
adapted (e.g. 1-in-4, 1-in-6, ...), independently of the number of elements. A random permutation of flashed elements is applied after each repetition (RIPRAND). Figure 1 illustrates
the RIP (left) and RIPRAND (right) flashing strategies. Large display grids had already been
investigated by [11] and [3], but without optimizing the sequences employed for faster search of
the target letter.
Another prominent advantage of element-wise accumulation is that it allows early stopping
of flashes, as soon as the score of one element clearly outperforms the others. This Early
Stopping has been shown to improve user motivation, and in turn, the quality of the P300
signal [6].
RIPRAND with early stopping was first evaluated oine, using simulated data created by
fitting pdfs for P300 detection from a 20-subject database from Inserm Lyon. To test its actual
usage, two online experiments were performed with 10 subjects each, on a 6x6 grid (Exp
1), and on a 9x9 grid (Exp 2).
To remove the need for a calibration session, a transfer learning method has been implemented, which uses a mixture of experts (MOE) classifier learned from the 20-subject data
of Experiments 1 and 2. A decision y(t), bounded between 0 and 1, is taken by taking the mean
of all binary decisions of the experts. This subject-independent decision is then accumulated
using (1).
After the P300 speller has been initialized through the above subject-independent approach,
and enough data has been collected, a subject-dependent learning takes place, which exploits
the notion of target relabelling: decisions made after the accumulation provide labels for supervised learning. xDAWN and RDA parameters are thus learned, and the RDA parameters
are subsequently adapted online, by considering a data buer.
In summary, the MOE classifier, implemented in OpenViBE, thus replaces the classic calibration phase, after which a new individual classifier is calibrated and the experts are ignored.
The individual classifier is updated on the data contained in a buer (a sliding window, which
for the moment has a fixed size).
Article ID 051-2
DOI:10.3217/978-3-85125-378-8-51
An online experiment (Exp 3) was performed with 6 naive subjects to test the zerocalibration and online learning. Experiment
3 used the keyboard layout displayed on the
right. The concepts of zero-calibration and
online adaptation have been suggested earlier,
and validated oine [5], but this is the first
time an online study is presented.
Table 1 (simulated data) shows the advantages of RIPRAND over row-column flashing in reducing the number of flashes required for an accurate selection. Accuracy refers to character
selection accuracy (not target detection accuracy). Table 2 presents results from Online Experiments 1 (6x6 grid) and 2 (9x9 grid), with two types of bit-rate: theoretical bit-rate, and actual
bit-rate with a 5 second break between characters. In Experiment 1, one subject was disregarded, as he was unable to stay focussed during the session, and his performance degraded,
making comparisons delicate. Finally, Table 3 shows results of the session with no calibration:
the session starts with a Mixture-of-experts prediction, which quite high accuracy (86.7%) but
requires many flashes (88.7 target + non-target per character) to reach a decision. Bit-rate for
this initial period is thus quite low (13.8). In the next part of the session, adaptive learning on
the users own data allows the bit-rate to improve to about 30 bits/minute (real bit-rate).
Table 1: Comparison of flashing strategies on simulated data (average over 10000 trials).
Type
6x6 RIPRAND 1/6
6x6 Row-Column
9x9 RIPRAND 1/6
9x9 Row-Column
Accuracy
94.5%
93.7%
93.9%
87.2%
Flashes
34.2
43.7
42.9
66.9
Theo. Bit-rate
62.8
47.9
63.8
37
Table 2:Results of Exp 1 (6x6 grid, averaged over 9 subjects) and Exp 2 (9x9 grid, averaged over 10 subjects).
Character selection accuracy, average number of flashes per character selection before early stopping, bit-rate
accounting for the 5 s pause between characters, and theoretical bit-rate.
Type
6x6 Row-Column
6x6 RIPRAND 1/6
9x9 Row-Column
9x9 RIPRAND 1/6
Accuracy
92.2%
93.3%
74.5
88
Flashes
21.6
21.6
53.8
42.1
Bit-rate
30.1
31.2
18
26.5
Theo. Bit-rate
71.3
72.7
30.8
47.9
Table 3: Results of Exp 3 (averaged over 6 subjects). Character selection accuracy, average number of flashes and
bit-rate per minute including a 5s pause between characters.
Article ID 051-3
Type
Mixture-of-experts
Adaptive learning, buer length 10
Adaptive learning, buer length 30
DOI:10.3217/978-3-85125-378-8-51
Accuracy
86.7%
95.7%
94.6%
Flashes
88.7
37.7
27.2
Bit-rate
13.8
27.2
30.8
We have thus developped a new P300 speller paradigm which does not need any calibration session and boosts user motivation thanks to early stopping. A clinical feasibility study is
currently taking place at Nice University Hospital on 20 ALS patients. Further improvements
considered include the use of Natural Language Models to improve the prediction stage [4, 9]
and further improve the bit-rate thanks to word completion.
Acknowledgment: This work was carried out as part of the CoAdapt project, funded
through the French National Research Agency grant ANR-09-EMER-002-01.
References
[1] Hubert Cecotti and Bertrand Rivet. One step beyond rows and columns flashes in the p300 speller:
a theoretical description. International Journal of Bioelectromagnetism, 2010.
[2] L.A. Farwell and E. Donchin. Talking o the top of your head: toward a mental prosthesis utilizing
event-related brain potentials. Electroencephalography and Clinical Neurophysiology, 70(6):510
523, 1988.
[3] J. Jin, B.Z. Allison, E.W. Sellers, C . Brunner, P. Horki, X. Wang, and C. Neuper. An adaptive
P300-based control system. Journal of Neural Engineering, 8(3), June 2011.
[4] Pieter-Jan Kindermans, Hannes Verschore, David Verstraeten, and Benjamin Schrauwen. A P300
BCI for the masses: Prior information enables instant unsupervised spelling. In NIPS, pages
719727, 2012.
[5] Pieter-Jan Kindermans, David Verstraeten, and Benjamin Schrauwen. A Bayesian Model for
Exploiting Application Constraints to Enable Unsupervised Training of a P300-based BCI. PLOS
One, 2012.
[6] Margaux Perrin. Coadaptation cerveau machine pour une interaction optimale: application au
P300-Speller. PhD thesis, Universite Claude Bernard Lyon 1, 2012.
[7] Yann Renard, Fabien Lotte, Guillaume Gibert, Marco Congedo, Emmanuel Maby, Vincent Delannoy, Olivier Bertrand, and Anatole Lecuyer. Openvibe: An open-source software platform
to design, test, and use braincomputer interfaces in real and virtual environments. Presence:
Teleoperators and Virtual Environments, 19(1):3553, 2010.
[8] B. Rivet, A. Souloumiac, V. Attina, and G. Gibert. xDAWN Algorithm to Enhance Evoked
Potentials: Application to Brain-Computer Interface. Biomedical Engineering, IEEE Transactions
on, 56(8):2035 2043, 2009.
[9] William Speier, Corey Arnold, Jessica Lu, Ricky K Taira, and Nader Pouratian. Natural language
processing with dynamic classification improves p300 speller accuracy and bit rate. Journal of
neural engineering, 9(1):016004, 2012.
[10] Eoin M. Thomas, Maureen Clerc, Alexandra Carpentier, Emmanuel Dauce, Dieter Devlaminck,
and Remi Munos. Optimizing P300-speller sequences by RIP-ping groups apart. In IEEE/EMBS
6th international conference on neural engineering (2013), San Diego, United States, November
2013. IEEE/EMBS.
[11] G. Townsend, B.K. LaPallo, et al. A novel P300-based braincomputer interface stimulus presentation paradigm: Moving beyond rows and columns. Clinical Neurophysiology, 121(7):1109
1120, 2010.
Article ID 051-4
DOI:10.3217/978-3-85125-378-8-52
Introduction
The need for a fast brain switch emerges in the development of brain-computer interface (BCI)
control for assistive robotic devices. While low speed and impreciseness of noninvasive BCIs
can be, to a large extent, compensated by assigning the low level control to robotic intelligence,
the user must be given an opportunity to rapidly stop an action if something goes wrong [3]. A
BCI for this function (a fast brain switch) should posses the following features: (1) be as fast
as possible, (2) be asynchronous, i.e. able to detect the users intention at any time moment,
(3) have a low false activation rate (FA). An ability to detect a command irrespective of the
preceding activity in which the user was involved is also desirable.
Rebsamen et al. [3] proposed a fast asynchronous switch based on a modified P300 BCI.
In their design, stimuli were presented in a 3x3 matrix, but a command could only be issued
using attention to the central stimulus. Robotic wheelchair was stopped with mean response
time (RT) of 6.0 s and FA of 1.2 per min. In [1], the P300 BCI was combined with steady-state
visual evoked potential (SSVEP) based BCI, providing RT of 5.3 s and FA of 0.5 per min.
One may ask if the non-target stimuli (8 of 9 stimuli in [3] design) can be removed from
the P300 BCI stimulus sequence, to improve the conditions for perceiving the target stimuli
and make possible its fast presentation. Although the absence of non-targets contradicts the
traditional view on the P300 BCI, we demonstrated that a simplified and more ergonomic
variant of the P300 BCI without non-targets can be successfully used for calibrating a classifier
for a normal P300 BCI [6]. Using this approach, reminiscent of the single-stimulus paradigm
known in psychophysiology [2], we designed a new single-stimulus switch. In oine simulation
using data from four participants, the BCI classifier exceeded the threshold already about 2 s
after a saccade to the stimulus, with FA rate of 1.2 to 3.4 per minute [5].
Article ID 052-1
DOI:10.3217/978-3-85125-378-8-52
Figure 1: Left: A screenshot from the auditory feedback phase. Right: a view from behind the
participant in the robot control phase.
In the current study, the single-stimulus switch was tested online for the first time. To
demonstrate the eectiveness of the switch in diverse conditions, we tested it in two tasks. In
these tasks, the participants were involved into dierent types of visual activity (still image
inspection task and passive pursuing a moving robot arm) from which they had to switch
urgently to BCI control at unpredictable time moments.
Methods
Five healthy volunteers participated in the study. EEG was acquired from Cz, Pz, POz, P2,
Oz, O1, O2 locations. We used BCI2000 system [4] with our module for single-stimulus
presentation and online processing. Stimuli were stylized animal faces presented for 150 ms
with interstimulus interval varied from 300 to 550 ms.
For calibration, the participants silently counted stimuli (8 control runs, each consisted
of 8 blocks of 8 stimuli with 2 s pauses between blocks) and silently read a text (a 10 min
non-control run). Rebsamen et al. [3] found no dierence between reading and other conditions without stimuli counting when they represented the non-control condition in estimating
classification performance of their BCI in similar settings. We chose the reading task because
it is simple and can be well controlled. 512 epochs of 700 ms length were extracted from the
control EEG starting from the stimulus. From non-control EEG, 598 epochs of the same
length were extracted with fixed 300 ms interval between them. The EEG was downsampled to
50 Hz and the channel data were concatenated, resulting in feature vector length of 245. Fisher
Discriminant Analysis with Tikhonov regularization (ridge regression) was applied to control
and non-control epoch sets to compute the classifier weights.
In the subsequent two phases of BCI online testing, the stimuli (one of the animal faces used
in calibration) were presented with the same interstimulus interval, but without 2 s pauses.
Each time a stimulus-related EEG epoch was obtained after a new stimulus, the classifier was
applied to an average of the last five stimulus-related epochs and its output was compared to
a predefined threshold. Once the threshold was exceeded, the BCI was activated, i.e. caused
actions specific for each testing phase (see below). Data from the first 3 min run (not used in
the analysis of the BCI performance) were used to adjust the threshold so that RT and FA were
minimized.
In the auditory feedback phase, the background task was to guess what cities were shown
on the screen (Figure 1, left). A photo on the screen was changed each 60 s. A sound signal was
Article ID 052-2
DOI:10.3217/978-3-85125-378-8-52
presented at random moments. After this signal event, the participant had to start counting
stimuli until he or she heard a recorded voice saying Yes and the stimulus was replaced by a
green circle. In the robot control phase, the participants watched a robotic arm (AREXX Mini
Robot Arm in the first two experiments and ST Robotics R12-six in the last three experiments)
at 1.5 m distance (Figure 1, right). The robot made fast movements (Mini Robot Arm was
waving its hand and R12-six was drawing a horizontal line) until it suddenly changed movement
direction (a signal event in this phase). When this occurred, the participant had to stop
the robot as fast as possible by counting the stimuli until the green circle appeared and the
robot returned to its normal behavior.
The BCI activation was considered as correct if it occurred between 750 ms and 8 s after the
signal event, and as a false activation otherwise. RT was computed for correct activations as
their time relative to the preceding signal events. FA was computed as the false activation
number divided by duration of testing, and miss rate was computed as the proportion of signal
events not followed by a correct activation. In an additional analysis, a hybrid EEG and EOG
based command detection was simulated, with number of averaged EEG epochs decreased from
five to three. Large saccades in the direction of the stimulus were detected in EOG using a
combined threshold-based and template-based detector and used to confirm activations: BCI
threshold exceeding produced a command only if preceded by such a saccade.
In the auditory feedback phase, mean RT was 3.7 s, FA was 0.3 min
In the robot control phase, mean RT was 4.0 s, FA rate was 0.1 min
subj 1
Ntot = 28
subj 2
Ntot = 14
subj 2
Ntot = 20
subj 3
Ntot = 12
subj 3
Ntot = 31
subj 4
Ntot = 14
subj 4
Ntot = 30
subj 5
Ntot = 10
subj 5
Ntot = 34
1
1
RT, s
FA
Miss
FA
Miss
subj 1
Ntot = 28
subj 2
Ntot = 14
subj 2
Ntot = 20
subj 3
Ntot = 12
subj 3
Ntot = 31
subj 4
Ntot = 14
subj 4
Ntot = 30
RT, s
subj 5
Ntot = 34
subj 5
Ntot = 10
0
RT, s
FA
Miss
RT, s
FA
Miss
Figure 2: Distribution of Reaction Times (RT), False Alarms (FA) and Misses in online (BCI)
and oine (BCI + saccades) tests. Here, FA and Misses data are not normalized. The y-axes
shows the number of observations, while Ntot is the total number of signal events.
Article ID 052-3
DOI:10.3217/978-3-85125-378-8-52
Acknowledgements
Part of this study related to the hybrid EEG and EOG based command detection was supported
by Russian Science Foundation Grant 14-28-00234.
References
[1] Y. Li, J. Pan, F. Wang, and Z. Yu. A hybrid bci system combining p300 and ssvep and its application
to wheelchair control. IEEE Transactions on Biomedical Engineering, 60(11):31563166, 2013.
[2] J. Polich and M. R. Heine. P300 topography and modality eects from a single-stimulus paradigm.
Psychophysiology, 33(6):747752, 1996.
[3] B. Rebsamen, C. Guan, H. Zhang, C. Wang, C. Teo, V. M. H. Ang, and E. Burdet. A brain
controlled wheelchair to navigate in familiar environments. IEEE Transactions on Neural Systems
and Rehabilitation Engineering, 18(6):590598, 2010.
[4] G. Schalk, D. J. McFarland, T. Hinterberger, N. Birbaumer, and J. R. Wolpaw. Bci2000: a generalpurpose brain-computer interface (bci) system. IEEE Transactions on Biomedical Engineering,
51(6):10341043, 2004.
[5] S. L. Shishkin, A. A. Fedorova, Y. O. Nuzhdin, I. P. Ganin, A. E. Ossadtchi, B. B. Velichkovsky,
A. Y. Kaplan, and B. M. Velichkovsky. Towards high-speed eye-brain-computer interfaces: combining the single-stimulus paradigm and gaze translation (in russian). Herald of Moscow University,
Ser. 14: Psychology, (4):419, 2013.
[6] S. L. Shishkin, A. A. Nikolaev, Y. O. Nuzhdin, A. Y. Zhigalov, I. P. Ganin, and A. Y. Kaplan.
Calibration of the p300 bci with the single-stimulus protocol. In Proceedings of the 5th International
BCI Conference, pages 256259, 2011.
Article ID 052-4
DOI:10.3217/978-3-85125-378-8-53
!"#$%&'()*+(*",-./.01(2/.345$/."-6(7%"3(
)"//51-189'(/"(:;;"%/4-./.1'((
)1,"-<$(:/$5=>?(@5".'$(A$%,.4=B($-&(715.;(C.%$551'=(
=(
)$%815"-$(D.,./$5(!18E-"5",<(*1-/%1(F)D.,./$5G?()$%815"-$?(2;$.-(
botal@bdigital.org, evargiu@bdigital.org, fmiralles@bdigital.org
Abstract
@31%,.-,( )*+( H$'1&( 8",-./.01( '/.345$/."-( $;;5.8$/."-'?( 5.91( -14%"I11&H$89( $-&(
,$3.-,?($%1(;%"H$H5<($3"-,(/E1(3"'/(,%"#.-,(3$%91/(";;"%/4-./.1'(/"&$<J()1'.&1'?(/E1(
%.'.-,($,.-,(;";45$/."-($-&(./'($''"8.$/1&(8",-./.01(&185.-1(8%1$/1($(;%1''.-,(&13$-&(I"%(
.--"0$/.01( /18E-"5",.1'( "II1%.-,( $( %1$5( 8E$-81( /"( ;%1'1%01( ,1-1%$5( #155H1.-,J( K15$/1&(
%1'1$%8E(H%1$9/E%"4,E'(/E$/(4'1()*+($'(3$.-(/18E-"5",<(/"(1-E$-81(4'1%L'(;1%I"%3$-81(
$%1(1M$3.-1&?($'($5'"(/E1(8"33"-(H"//51-189'(/E$/('/.55(E.-&1%()*+(1-&(4'1%($&";/."-J(((
= )*+(@0"54/."-(
)%$.-N*"3;4/1%(+-/1%$8/."-(F)*+G(H$'1&(/18E-"5",<(.'(/E%.0.-,($-&(E$'(/E1(;"/1-/.$5("I(';%1$&.-,(
.-/"( '"8.1/<( H<( $&&%1''.-,( /E1( -11&'( "I( 0$%."4'( 4'1%( ,%"4;'( 4-&1%( &.II1%1-/( $;;5.8$/."-( '81-$%."'J(
O"5;$#($-&(O"5;$#(FPQ=PG(.&1-/.I.1&(/E1(I.01(;%.-8.;$5()*+('81-$%."'("%($;;5.8$/."-(/<;1'(/E$/(3$<(
E$01(10"501&('"(I$%?(-$315<?(replace?(restore?(improve?(supplement($-&(enhanceJ(*"334-.8$/."-(.'(
;1%E$;'( /E1( 3"'/( I45I.55.-,( .331&.$/1( 4'1( "I( )*+( '<'/13'( I"%( $( ;$/.1-/?( E1%( I$3.5<( $-&( 8$%1,.01%'(
#E1-( -"( .-/155.,.H51( .-/1%$8/."-( 8$-( "/E1%#.'1( /$91( ;5$81( F).%H$431%( 1/( $5J?( =RRRGJ( +-( /E.'( 8$'1?( /E1(
)*+("4/;4/(851$%5<(replaces(/E1(-$/4%$5(;$/.1-/L'(8"334-.8$/."-(I4-8/."-(5"'/($'($(%1'45/("I(.-S4%<("%(
&.'1$'1J( @01-( '.3;51( .-/1%$8/."-'( /"( 3$91( -11&'( 9-"#-?( $-'#1%( T41'/."-'( #./E( $( '.3;51( <1'( "%( -"?(
$-&('1518/($3"-,($('3$55(3$/%.M("I(8E".81'(3$<(%1.-/1,%$/1(/E1(.'"5$/1&(;$/.1-/(#./E("/E1%'J(2.3.5$%5<?(
$(;1%'"-(3$<(#.'E(/"(replace(5"'/(5.3H(I4-8/."-(4'.-,()*+'($'(#E1158E$.%(8"-/%"551%'(.-(%1$5(FUE.5.;'(
1/( $5J?( PQQVG( "%( 0.%/4$5( 1-0.%"-31-/'( FW11H( 1/( $5J?( PQQVG?( "%( $'( $;;5.$-81( $&S4'/1%'?( $5/1%.-,( H"&<(
;"'./."-( .-( $-( 1518/%.8( H1&( I"%( 8"3I"%/( $'( #155( $'( /"( &18%1$'1( /E1( 8E$-81( I"%( &1015";.-,( $( H1&( '"%1J(
X&&./."-$55<?( )*+'( 8$-( H1( 4'1&( /"( ";1%$/1( ;%"'/E1/.8( "%( I4-8/."-$5( 1518/%.8$5( '/.345$/."-( &10.81'( .-(
.-0$'.01(FY"8EH1%,(1/($5J?(PQQZG("%(-"-N.-0$'.01(FUI4%/'8E1551%(1/($5J?(PQQ[G(restoration("I(5"'/(-$/4%$5(
"4/;4/'?('48E($'(3"/"%("%(H5$&&1%(I4-8/."-(.-(;$%$5<\1&(E43$-'J((
+-/1%1'/.-,5<?( )*+'( 8$-( -"#( H1( 8"-'.&1%1&( $'( -14%"%1E$H.5./$/."-( /""5'( /"( improve( 34'845$%(
$8/.0$/."-( $-&( 5.3H( 3"0131-/'( .-( .3;$.%1&( ;"'/N'/%"91( ;$/.1-/'( .-( 85.-.8$5( '1//.-,'?( I"%( 1M$3;51J(
]1-1%$55<(';1$9.-,?()*+'(8"45&(E15;('/.345$/1(8"%/.8$5(;5$'/.8./<(51$&.-,(/"(/E1(%18"01%<("I('"31(5"'/(
I4-8/."-'(F*$%$H$5"-$(1/($5J?(PQQRGJ(7"55"#.-,(/E.'($;;%"$8E?()*+(H$'1&(8",-./.01(%1E$H.5./$/."-(3$<(
H1($3"-,(/E1(3"'/("4/'/$-&.-,($;;5.8$/."-'(/E$/(8"45&(H1-1I./($(5$%,1(-43H1%("I(;$/.1-/'(%$-,.-,(I%"3(
8"3;51/15<(5"891&N.-(;$/.1-/'(F^_H51%($-&().%H$431%?(PQQ`G(/"(;$/.1-/'(#./E(8",-./.01(.3;$.%31-/?(/"(
.-8%1$'.-,5<(improve(/E1.%(8",-./.01(&1I.8./'J(@0.&1-81(/"('4;;"%/(/E1(4'1("I(8"3;4/1%(H$'1&(8",-./.01(
%1E$H.5./$/."-( ;%",%$3'( E$'( H11-( ,%"#.-,( I"%( /E1( 5$'/( &18$&1( #./E( 1M$3;51'( 1M/1-&.-,( /"( 313"%<(
F!$3( $-&(C$-?(PQQaGb(#"%9.-,( 313"%<(Fc"E$-''"-( $-&(!"%-3$53?(PQ=PGb($//1-/."-(Fd.891I""'1( 1/(
$5J?(PQ=[Gb($-&?(0.'4$5(;1%81;/."-(F^$-,(1/($5J?(PQQRGJ(X(3"%1(I4/4%.'/.8('81-$%."(3.,E/(H1(4'.-,($()*+(
(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((
>
()J(:/$5(.'(/E1(8"%%1';"-&.-,($4/E"%6(H"/$5eH&.,./$5J"%,(
()J(:/$5($-&(@J(A$%,.4(8"-/%.H4/1&(1T4$55<(/"(/E.'(&"8431-/(
Article ID 053-1
DOI:10.3217/978-3-85125-378-8-53
/"( supplement $( -$/4%$5( -14%"34'845$%( "4/;4/( #./E( $-( $&&./."-$5?( $%/.I.8.$5( F.J1J?( %"H"/.8G( "4/;4/(
FO"5;$#($-&(O"5;$#?(PQ=PGJ(W$'/(H4/(-"/(51$'/?($-"/E1%($%1$("I(.-8%1$'.-,(%181-/(%1'1$%8E(.-/1%1'/(.'(
.-( /E1( %18",-./."-( "I( /E1( 4'1%L'( 31-/$5( '/$/1'( F1J,J?( '/%1''( "%( $//1-/."-( 51015'?( f8",-./.01( 5"$&g( "%(
f31-/$5( I$/.,41gG( $-&( 8",-./.01( ;%"81''1'( F1J,J?( 51$%-.-,( "%( $#$%1-1''( "I( 1%%"%'G( /E$/( #.55( I$8.5./$/1(
.-/1%$8/."-($-&('/.345$/1(4'1%L'(.-/1%1'/J(+-(/E1'1(8$'1'?(H<(;%101-/.-,('/%1''("%($//1-/."-$5(5$;'1'?(/E1(
)*+(enhances(/E1(-$/4%$5("4/;4/J()$'.8$55<?(#1(8$-(&1$5(#./E(-14%"I11&H$89("%(,$3.-,($;;5.8$/."-'(/"(
enhance(4'1%L'(;1%I"%3$-81J(+-(5.-1(#./E(/E1(enhance('81-$%."?($-&("01%5$;;.-,(/E1(improve '81-$%."?(
/E1(%1'/("I(/E1(;$;1%(8"-81-/%$/1'("-(1M;5"%.-,(;"/1-/.$5(";;"%/4-./.1'($-&(%1'1$%8E(H%1$9/E%"4,E'(/E$/(
3$<( 4'1( )*+( H$'1&( 8",-./.01( '/.345$/."-( $;;5.8$/."-'( /"( enhance( ;1";51( "01%$55( ;1%I"%3$-81( .-(
85.-.8$5( $-&( -"-N85.-.8$5( '1//.-,'( /"( 3$.-/$.-( ,1-1%$5( #155H1.-,( $-&( T4$5./<( "I( 5.I1J( U%."%( /"( /E$/?( #1(
$.3($/(.&1-/.I<.-,(8"33"-(H"//51-189'(/E$/(;%101-/(84%%1-/()*+('<'/13'(I%"3(1-&(4'1%($&";/."-J(((((
P 2"50.-,()*+()"//51-189'(h(2E"%/8"3.-,'(
!E1( 4'$H.5./<( "I( 84%%1-/( )*+( &10.81'( .'( I$%( I%"3( ;1%I18/J( C1&.8$5( )*+( $;;5.8$/."-'( $%1( '/.55( 01%<(
5.3./1&?( $-&( 3$-<( 8%./.8$5( .''41'( -11&( /"( H1( $&&%1''1&( H1I"%1( /E1<( 8$-( H1( 1II18/.015<( $&";/1&( .-(
85.-.8$5('1//.-,'("%(.-(4'1%'L(E"31'J(*$%$H$5"-$(1/($5J(FPQQRG(E$01(&.'84''1&('101%$5(;".-/'(/E$/(-11&(/"(
H1(8"-'.&1%1&(#E1-(&1'.,-.-,?('1518/.-,($-&(4'.-,($()*+('<'/13(I"%(-14%"%1E$H.5./$/."-(;4%;"'1'J(+-(
;$%/.845$%?(/E1<( E$01( 13;E$'.\1&(/E1( .3;"%/$-81("I(/18E-"5",<($881;/$-81($-&(4'$H.5./<?($'( #155($'?(
.''41'(%15$/1&(/"(/E1(.3;$8/("-(/E1(;$/.1-/L'(13"/."-$5($-&(3"/.0$/."-$5('/$/1'J(X5/E"4,E('"31(4'1%'(
$-&($''.'/.01(/18E-"5",<(1M;1%/'(3$<(H1(T4./1('$/.'I.1&(#./E('"31()*+(&10.81'(Fd.8951%(1/($5J?(PQ==G?(
"/E1%'(8"45&(-"/(.3$,.-1(4'.-,(3"'/("I(/E1(&10.81'(.-(&$.5<(5.I1(#./E"4/(I4%/E1%(.3;%"0131-/'J( i'1%N
81-/1%1&(&1'.,-(.'(8%./.8$5?($-&(/1'/.-,(31&.8$5()*+($;;5.8$/."-'(#./E(E1$5/E<(4'1%'(3$<(H1(.-$&1T4$/1J((
X'($(#E"51?(31&.8$5($-&("/E1%(-"-N31&.8$5(131%,.-,($;;5.8$/."-'(1';18.$55<($&&%1''1&(/"(E1$5/E<(
4'1%'(3$<(1-8"4-/1%('.3.5$%('E"%/8"3.-,'J( C$.-("H'/$851'(0$%<(I%"3(/E1(5"-,(;%1;$%$/."-($-&('1/4;(
/.31'?(/"(/E1(1%,"-"3.8'("I(/E1(1518/%"&1(8$;'?($-&(/E1(5"#( ';11&($-&(5$89("I(%15.$H.5./<("I(/E1()*+(
'<'/13J( !E1( 51$%-.-,( 84%01( .'( $5'"( $( 3$S"%( &%$#H$89J( !E$/( .'?( /E1( 4'1%( 34'/( 51$%-( 8"3;51/15<( -1#(
'9.55'(/"(";1%$/1($()*+('<'/13J(!E1%1I"%1?(/E1%1(.'('/.55($-(.3;"%/$-/(-11&(/"("01%8"31(/E1(I"55"#.-,(
.&1-/.I.1&(f91<(I$8/"%'g(FX55.'"-?(PQ=QG(I"%(H1//1%()*+($&";/."-6(
=J(Cost(FI.-$-8.$5?(E15;?(1M;1%/.'1?(/%$.-.-,?(.-0$'.01-1''?(/.31?($//1-/."-?(I$/.,41G(
PJ(Throughput F$884%$8<?(';11&?(5$/1-8<?(1II18/.01(/E%"4,E;4/G(
[J(Utility(F'4;;"%/?(I51M.H.5./<?(%15.$H.5./<?(.55./1%$8<G(
aJ(Integration(FI4-8/."-$5?(&.'/%$8/."-(T4"/.1-/?(E<H%.&j8"3H.-1&()*+'?(4'$H.5./<G(
kJ(Appearance(F8"'31/.8'?('/<51?(31&.$?($&01%/.'.-,G(
7"55"#.-,($(4'1%N81-/1%1&($;;%"$8E($-&(.-8%1$'.-,(/E1(1-,$,131-/(#./E(/E1($;;%";%.$/1(1-&(4'1%'(
#E.51(&1'.,-.-,?('1518/.-,($-&(4'.-,($( )*+('<'/13( #.55("II1%(/E1( ";;"%/4-./<(/"(.-8%1$'1( /18E-"5",<(
$881;/$-81( $-&( 4'$H.5./<J( !E1( ;"''.H.5./<( "I( 4'.-,( &%<( 1518/%"&1'( I"%( @@]( $8T4.'./."-?( $-&( /E1(
3"-./"%.-,( "I( ;'<8E"5",.8$5( 1II18/'( &4%.-,( )*+( /$'9'( 3$<( ,.01( %.'1( /"( $( H%"$&1%( %$-,1( "I( 8",-./.01(
%1E$H.5./$/."-($-&('/.345$/."-(;%",%$3'(F*$%$H$5"-$(1/($5J?(PQQRGJ((X88"%&.-,(/"('48E(0.1#?(#1(85$.3(
/E$/( /E1( 84%%1-/( ;%"H513'( #"45&( H1( "01%8"31( #E1-( -1#( /18E-"5",.1'( ;%"0.&1( -"-N8"-01-/."-$5(
'1-'"%'( I"%( 51''( "H/%4'.01( H%$.-( '.,-$5( %18"%&.-,?( $-&( $II18/.01( .-/1%I$81'( $H51( /"( $&$;/( /E1( )*+(
$88"%&.-,(/"(13"/."-$5('/$/4'(8E$-,1'(.-(/E1(;$/.1-/J((
[ U"/1-/.$5(:;;"%/4-./.1'(h(K1'1$%8E()%1$9/E%"4,E'(
*",-./.01( &185.-1( .'( "-1( "I( /E1( 3"'/( ;1%0$'.01( 8"-'1T41-81'( "I( $,.-,?( $-&( ./( 8$-( H1( $''"8.$/1&(
#./E(5"''("I($4/"-"3<?(I4-8/."-$5(.3;$.%31-/($-&(&1/1%."%$/."-(.-(T4$5./<("I(5.I1(.-(3"&1%-('"8.1/.1'(
Article ID 053-2
DOI:10.3217/978-3-85125-378-8-53
FW11( 1/( $5J?( PQ=[GJ( !E1( .-8%1$'.-,( 15&1%5<( ;";45$/."-( .'( $( ;"/1-/.$5( -1#( /$%,1/( I"%( )*+( 8",-./.01(
'/.345$/."-( $;;5.8$/."-'?( '48E( $'( -14%"I11&H$89( $-&( ,$3.-,?( -"/( "-5<( I"%( 1-E$-8.-,( 3"/"%( $-&(
8",-./.01(%1E$H.5./$/."-(/E1%$;.1'(.-(85.-.8$5('1//.-,'?(H4/(1';18.$55<($'(8",-./.01(/%$.-.-,(;%",%$3'(.-(
;$/.1-/L'( E"31'J( D1015";31-/( "I( )*+( &10.81'( #./E( '48E( 8",-./.01( '/.345$/."-( 8$;$H.5./.1'( #.55(
&1I.-./15<( H1-1I./( $53"'/( $55( 4'1%( ,%"4;'?( .-854&.-,( 3$-<( ,%"4;'( "I( ;$/.1-/'( "%( &.'$H51&( 4'1%'?( $-&(
101-(E1$5/E<(;1";51(.-(/%$.-.-,(;%",%$3'(FH%$.-N,<3(5.91($;;5.8$/."-'GJ(+-(8",-./.01(%1E$H.5./$/."-?(/E1(
4'1("I(101-/N%15$/1&(;"/1-/.$5'(.'(/%$&./."-$55<(5.3./1&(/"($-($''1''31-/("I(.-S4%.1'(.-84%%1&("%(&.'"%&1%(
'101%./<J( Y"#101%?( $( /1-/$/.01( )*+( H$'1&( -14%"I11&H$89( /E1%$;<( #$'( &1'.,-1&( /"( /%1$/( $//1-/."-N
&1I.8./(;$/.1-/'(#./E(H%$.-(.-S4%<(Fl1'E.,1(1/($5J?(=RRkG6(I.01(;$/.1-/'(#./E(8E%"-.8(31-/$5(&.'/4%H$-81'(
%181.01&( $( )*+( H$'1&( -14%"I11&H$89( /E1%$;<( I"%( $( I"4%( #119( ;1%."&?( $-&( $55( 'E"#1&( %13$%9$H51(
.3;%"0131-/J(l101%/E151''?(/E.'(;."-11%.-,(#"%9(E$'(-"/(<1/(H11-(I"55"#1&(H<($(5$%,1%(85.-.8$5('/4&<J(
l1#( %1'1$%8E( H%1$9/E%"4,E'( .-( )*+( 8",-./.01( '/.345$/."-( 8"45&( ;%"I./( I%"3( /E1( .-854'."-( "I(
f8",-./.01( -14%";%"'/E1/.8'g( .-( .-0$'.01( )*+( %1'1$%8E( ;%",%$3'?( .-( #E.8E( /E1( 8",-./.01( '/$/1( "I( /E1(
'4HS18/?(%$/E1%(/E$-('.,-$5'('/%.8/5<(%15$/1&(/"(3"/"%(1M184/."-("%('1-'$/."-(.'(%18"%&1&(FX-&1%'1-(1/($5J?(
PQ=QGJ(!E.'(9-"#51&,1(8"45&(H1(4'1&(/"(8$;/4%1(4'1%L'(.-/1-/."-'($-&($''1''(13"/."-$5('/$/1'(/"(911;(
3"/.0$/."-($5.01?(1-E$-81(8",-./.01($-&(3"/"%(%18"01%<(3"%1(1II.8.1-/5<?($-&(.-8%1$'1("01%$55(T4$5./<(
"I( 5.I1( .-( /E1( 5"-,( /1%3J( !E$/( .'?( H<( '/.345$/.-,( ;$/.1-/'( /"( $8T4.%1( -1#( '9.55'( m( $8/.0$/.-,( ';18.I.8(
8"%/.8$5($%1$'(m()*+'(3.,E/(H1(4'1&(I"%(.--"0$/.01($-&(3"%1(1II18/.01("884;$/."-$5(/E1%$;.1'(#./E(-1#(
;%"3.'.-,(85.-.8$5($;;5.8$/."-'J(W.91#.'1?()*+'(8"45&(51$&(/"($884%$/1(8",-./.01(/%$.-.-,'($-&(51$%-.-,(
;%",%$3'(I"%(/E1(1-/.%1(;";45$/."-J((((
W11( 1/( $5J( FPQ=[G( 1M$3.-1( /E1( I1$'.H.5./<( "I( 4'.-,( $( )*+( H$'1&( '<'/13( #./E( $( -1#( ,$31( .-(
.3;%"0.-,( 313"%<( $-&( $//1-/."-( .-( $( ;.5"/( '$3;51( "I( E1$5/E<( 15&1%5<J( !E1( '/4&<( .-01'/.,$/1&( /E1(
'$I1/<?(4'$H.5./<($-&($881;/$H.5./<("I(/E1()*+('<'/13(/"(15&1%5<?($-&("H/$.-1&($-(1II.8$8<(1'/.3$/1(/"(
#$%%$-/($(;E$'1(+++(/%.$5J(+-(/E1(@i(;%"S18/()$89Y"31(FE//;6jj###JH$89E"31NI;VJ14jG?($(I%$31#"%9(
/E$/(;%"0.&1'($(H$89&%";(/"(8",-./.01('/.345$/."-(/$'9'(;1%I"%31&(4'.-,($(E"31(H$'1&()*+('<'/13(.'(
84%%1-/5<( 4-&1%( '/4&<J( !#"( '1%."4'( ,$31'( m( "-1( "-( 313"%<?( $-&?( $-"/E1%( "-1( "-( $//1-/."-( $-&(
8"-81-/%$/."-(m(#1%1(&1I.-1&($-&(.3;5131-/1&J(!E.'(.'(;$%/("I($(H%"$&1%(/1513"-./"%.-,(E"31('4;;"%/(
'<'/13(FA$%,.4(1/($5J?(PQ=[G(/E$/($55"#'(85.-.8.$-'(/"(;%1'8%.H1(/$'9'($-&('8$51'?($-&(3"-./"%(./'(%1'45/'(
"-( T4$5./<( "I( 5.I1( /E%"4,E( /E1( 4'1( "I( $( %13"/1( /E1%$;.'/( '/$/."-J( 74%/E1%?( '"8.$5.\$/."-?( 1&48$/."-?(
1-/1%/$.-31-/( $-&( 101-( '4;;"%/( ,%"4;'( $%1( I1$'.H51( 4'.-,( )*+( 0.$( .-/1%-1/( 13$.5?( 8E$/'( $-&( ,$31'(
F^$%.3( 1/( $5J?( PQQZGJ( !E4'?( .-( "%&1%( /"( 1-'4%1( 1T4$5( $881''( /"( +-I"%3$/."-( $-&( *"334-.8$/."-(
!18E-"5",.1'( F+*!'G( I"%( $55( ;";45$/."-( ,%"4;'( $-&( $''4%1( /E$/( 101%<"-1( 8$-( H1-1I./( I%"3( +*!(
&1015";31-/'?( -"/( "-5<( ;1";51( #./E( &.'$H.5./.1'?( H4/( ';18.$55<( /E1( 15&1%5<( 'E"45&( -"/( H1( 51I/( H1E.-&J(
l1#("884;$/."-$5(;%",%$3'($.3.-,($/(8",-./.01('/.345$/."-(/$'9'(/"(.3;%"01(+*!('9.55'(#.55(/4%-(/"(H1(
-1#(;"/1-/.$5(";;"%/4-./.1'(I"%()*+(H$'1&(-14%"I11&H$89(/%$.-.-,($-&(.--"0$/.01('1%."4'(,$31'J((((
X89-"#51&,31-/'(
!E.'( #"%9(.'('4;;"%/1&(H<(/E1( @4%";1$-(+*!(U%",%$331( U%"S18/'(7UVN+*!NPQ==NV(F)$89Y"31?(
l"J(P``kZZG($-&(7UVN+*!NPQ=[N=Q(F)l*+(Y"%.\"-(PQPQ?(l"J(ZQRkR[GJ((
K1I1%1-81'(
X55.'"-?()J(dJ(FPQ=QGJ(!"#$%&(iH.T4./"4'()*+'J(+-()J(]%$.3$--?()J(dJ(X55.'"-?(h(]J(UI4%/'8E1551%(
F@&'JG?(Brain-Computer InterfacesJ(2;%.-,1%()1%5.-(Y1.&15H1%,J(
(((((X-&1%'1-?( KJ( XJ?( Y#$-,?( @J( cJ?( h( C455.91-?( ]J( YJ( FPQ=QGJ( *",-./.01( -14%$5( ;%"'/E1/.8'J( Annual
review of psychology?(Z=?(=ZRN=RQJ(
Article ID 053-3
DOI:10.3217/978-3-85125-378-8-53
).%H$431%?( lJ( ]E$-$<.3?( lJ?( Y.-/1%H1%,1%?( !J?( +01%'1-?( +J?( ^"/8E"4H1<?( )J( F=RRRGJ( X( ';155.-,(
&10.81(I"%(/E1(;$%$5<'1&?(Nature([R`?(PRVNPR`J((
*$%$H$5"-$?( KJ?( *$'/.,5."-.?( UJ?( h( ]%$3$/.8$?( 7J( FPQQRGJ( )%$.-m8"3;4/1%( .-/1%I$81'( $-&(
-14%"%1E$H.5./$/."-J(Stud. Health Technol. Inform(=ak6(=ZQN=VZJ((
Y"8EH1%,?( WJKJ?( 21%%4<$?( CJDJ?( 7%.1E'?( ]JCJ?( C49$-&?( cJXJ?( h( 2$51E?( CJ( FPQQZGJ( l14%"-$5(
1-'13H51(8"-/%"5("I(;%"'/E1/.8(&10.81'(H<($(E43$-(#./E(/1/%$;51,.$?(Nature(PaP?(=ZaN=V=J((
c"E$-''"-?()J?(h(!"%-3$53?(CJ(FPQ=PGJ(O"%9.-,(313"%<(/%$.-.-,(I"%(;$/.1-/'(#./E($8T4.%1&(H%$.-(
.-S4%<6(1II18/'(.-(&$.5<(5.I1J(Scandinavian Journal of Occupational Therapy?(=R?(=VZN=`[J(
^$-,?(XJ2JYJ?(^.3?(DJ^J?(^<4-,?(CJ2J?(*E".?(^JlJ?(n""?(cJnJ?(24-,?(2JnJ?(h(U$%9?(YJcJ(FPQQRGJ(X(
8"3;4/1%.\1&( 0.'4$5( ;1%81;/."-( %1E$H.5./$/."-( ;%",%$331( #./E( .-/1%$8/.01( 8"3;4/1%( .-/1%I$81( 4'.-,(
3"/."-( /%$89.-,( /18E-"5",<( o( $( %$-&"3.\1&( 8"-/%"551&?( '.-,51NH5.-&1&?( ;.5"/( 85.-.8$5( /%.$5( '/4&<J(
Clinical Rehabilitation?(P[?(a[aNaaZJ(
^$%.3?( XJXJ?( Y.-/1%H1%,1%?( !J?( K.8E/1%?( cJ?( C155.-,1%?( cJ?( l143$--?( lJ?( 75"%?( YJ?( ^_H51%?( XJ?( h(
).%H$431%?( lJ( FPQQZGJ( l14%$5( .-/1%-1/6( O1H( '4%I.-,( #./E( H%$.-( ;"/1-/.$5'( I"%( /E1( 8"3;51/15<(
;$%$5<\1&J(Neurorehabil Neural RepairJ(PQFaG6kQ`N=kJ(
^_H51%?(XJ?(h().%H$431%?(lJ(FPQQ`GJ()%$.-N8"3;4/1%(.-/1%I$81'($-&(8"334-.8$/."-(.-(;$%$5<'.'6(
1M/.-8/."-( "I( ,"$5( &.%18/1&( /E.-9.-,( .-( 8"3;51/15<( ;$%$5<'1&( ;$/.1-/'p( Clin NeurophysiolJ?(
==RF==G6PZk`NZZ(
W11?(!J(2J?(]"E?(2J(cJ(XJ?(q419?(2J(nJ?(UE.55.;'?(KJ?(]4$-?(*J?(*E14-,?(nJ()J?(h(^%.'E-$-?(^J(KJ(KJ(
FPQ=[GJ(X(H%$.-N8"3;4/1%(.-/1%I$81(H$'1&(8",-./.01(/%$.-.-,('<'/13(I"%(E1$5/E<(15&1%5<6(X(%$-&"3.\1&(
8"-/%"5(;.5"/('/4&<(I"%(4'$H.5./<($-&(;%15.3.-$%<(1II.8$8<J(PloS one?(`F==G?(1VRa=RJ(
W11H?( KJ?( 7%.1&3$-?( DJ?( C41551%NU4/\?( ]JKJ?( 28E1%1%?( KJ?( 25$/1%( CJ?( h( UI4%/'8E1551%?( ]J( FPQQVGJ(
215IN;$81&(F$'<-8E%"-"4'G()*+(8"-/%"5("I($(#E1158E$.%(.-(0.%/4$5(1-0.%"-31-/'6(X(8$'1('/4&<(#./E($(
/1/%$;51,.8?(Computational Intelligence and Neuroscience?(=N`J(
l1'E.,1?(KJ?(@-&"4?(!J?(h(C.<$3"/"?(!J(F=RRkGJ(U%";"'$5("I(U[QQ(H."I11&H$89(/E1%$;<(.-(;$/.1-/'(
#./E(31-/$5(&.'/4%H$-81'($'(8",-./.01(%1E$H.5./$/."-?(The Japanese Journal of Rehabilitation Medicine(
P?([P[N[PRJ((
UE.5.;'?(cJ?(&15(KJ(C.55r-?(cJ?(A$-$891%?(]J?(W1#?(@J?(]$5r-?(7J?(71%%1\?(UJOJ?(A$-()%4''15?(YJ?( h(
l4//.-?(CJ(FPQQVGJ(X&$;/.01('E$%1&(8"-/%"5("I($(H%$.-N$8/4$/1&('.345$/1&(#E1158E$.%?(Proceedings of
the 2007 IEEE 10th International Conference on Rehabilitation Robotics, Noordwijk?( !E1(
l1/E1%5$-&'?(aQ`Na=aJ(
UI4%/'8E1551%?(]J?(C41551%NU4/\?(]JKJJ(UI4%/'8E1551%?(cJ?(]1%-1%?(YJcJ?(h(K4;;?(KJ(FPQQ[GJ(s!E"4,E/L(
m( 8"-/%"5( "I( I4-8/."-$5( 1518/%.8$5( '/.345$/."-( /"( %1'/"%1( E$-&( ,%$';( .-( $( ;$/.1-/( #./E( /1/%$;51,.$?(
Neuroscience Letters([k=?([[N[ZJ(
!$3?(2J7J?(h(C$-?(OJ^J(FPQQaGJ(@0$54$/.-,(8"3;4/1%N$''.'/1&(313"%<(%1/%$.-.-,(;%",%$331'(I"%(
;1";51(#./E(;"'/NE1$&(.-S4%<($3-1'.$J(Brain Injury?(=`?(aZ=maVQJ(
A$%,.4?( @J?( 71%-r-&1\?( cJCJ?( !"%%155$'?( 2J?( D$4#$5&1%?( 2J?( 2"5t?( CJ?( h( C.%$551'?( 7J( FPQ=[GJ( X(
'1-'"%NH$'1&(/1513"-./"%.-,($-&(E"31('4;;"%/('<'/13(/"(.3;%"01(T4$5./<("I(5.I1(/E%"4,E()l*+J(+-(UJ(
@-8$%-$uv"?( WJ( X\101&"?( ]JcJ( ]15&1%H5"3?( XJ( l1#155?( h( lJN@J( C$/E.$''1-( F@&'JG?( Assistive
Technology: From Research to Practice, AAATE 2013J((
O"5;$#?(cJ?(h(O"5;$#?(@J(OJ(FPQ=PGJ()%$.-(8"3;4/1%(.-/1%I$81'6( 2"31/E.-,(-1#(4-&1%(/E1('4-J(
+-( cJ( O"5;$#( h( @J( OJ( O"5;$#( F@&'JG?( Brain-Computer Interfaces: Principles and Practice( F;;J( [N
=aGJ(:MI"%&6(:MI"%&(i-.0(U%1''J((
d.891I""'1?(2J?(Y4M?(^J?()%"#-?(cJ?(h(O45I?(^J(FPQ=[GJ(W1/(/E1(,$31'(H1,.-6(X(;%15.3.-$%<('/4&<(
4'.-,( X//1-/."-( U%"81''( !%$.-.-,N[( $-&( W43"'./<!C( H%$.-( ,$31'( /"( %131&.$/1( $//1-/."-( &1I.8./'(
I"55"#.-,(/%$43$/.8(H%$.-(.-S4%<J(Brain Injury?(PV?(VQVNV=ZJ(
d.8951%?( *J?( K.88."?( XJ?( W1"//$?( 7J?( Y.55.$-N!%1''?( 2J?( Y$5&1%?( 2J?( Y"5\?( @J?( 2/$.,1%N2w5\1%?( UJ?(
Y"",1%#1%I?(@JcJ?(D1'.&1%.?(WJ?(C$//.$?(DJ?(h(^_H51%?(XJ(FPQ==GJ(X(H%$.-N8"3;4/1%(.-/1%I$81($'(.-;4/(
8E$--15(I"%($('/$-&$%&($''.'/.01(/18E-"5",<('"I/#$%1J(Journal of Clinical EEG & NeuroscienceJ(
Article ID 053-4
DOI:10.3217/978-3-85125-378-8-54
Introduction
During stoke rehabilitation, patients involvement rewarded with performance-dependent feedback has been shown to be crucial in improving compliance-adherence [1]. Also evidence exists
that the level of patients participation in rehabilitation has an impact on the outcome [2].
Several authors have explored the potentialities of Brain Computer Interfaces (BCI) in stroke
rehabilitation [3]. To eectively encourage training and practice the BCI design should incorporate principles of current rehabilitative settings, apt to stimulate patients engagement. In
this proof of concept study, we report on an electroencephalographic (EEG)-based BCI system
intended to support hand motor imagery (MI) training. Our system was designed with the
participation of professional users; it included the presence of a therapist to guide the patient
during training sessions and it was introduced into a conventional rehabilitation setting. The
BCI system was endowed with a visual feedback mimicking movements of patient own hands
to maintain consistency with the MI task and it was intended as an add-on tool to enhance
hand motor functional recovery of hospitalized patients aected by stroke. The intervention
was tested on a small selected hospitalized patient sample, admitted for their rehabilitation
treatment after stroke, in order to describe acceptability and usability. In accordance with a
Article ID 054-1
!"#$%&'()*(+,-
DOI:10.3217/978-3-85125-378-8-54
.')-$
5%.1(
3$#
%34+(
##12
.)*(+0
/%)'#
#.
Methods
Participants. Eight patients with stroke were recruited from a consecutive cohort admitted
to Hospital. Outcomes of rehabilitation ecacy were evaluated at baseline and after training:
arm-section of the Fugl-Meyer (FM) (minimal clinically important dierence - MCID - set to 7
points), National Institute of Health Stroke Scale (NIHSS) and Barthel Index (BI) (MCID set
to 14). A descriptive analysis is reported.
BCI training protocol. Figure 1 illustrates a training session performed with the proposed
EEG-based BCI system. A customized software was implemented to provide patients with
a real-time feedback consisting of a visual representation of their own arms and hands. The
software allowed the therapists to create an artificial image of a given patients hand by adjusting
a digitally created image in shape, colour and size. The digital image was projected over
the patients real hands covered by a white blanket. To drive the two states of the virtual
hand (grasping or finger extension) the BCI system exploited the EEG sensorimotor rhythms
modulation (SMRs) induced by hand MI of the same movements. The BCI2000 software
platform (www.bci2000.org) was used for real-time estimation and classification of the SMRs
state modulation and to drive the instant BCI feedback (i.e. a cursor motion on the therapists
screen) and the corresponding virtual hand action (actuated through a UDP connection).
The BCI-based device was installed in a rehabilitation hospital ward. Training lasted 4 weeks,
with 3 weekly sessions. The EEG features to control the cursor motion during the BCI training
were extracted from a screening session during which patients were asked to perform MI of
their own aected hand in the absence of feedback. Performances were expressed as mean
percentage of correct trials per run; the second and last training sessions were considered for
statistical analysis, conducted by means of a t-test for dependent samples. Scalp EEG potentials
during the screening session were collected from 61 positions (according to an extension of the
10-20 International System) bandpass filtered between 0.1 and 70Hz, digitized at 200Hz and
amplified by a commercial EEG system (BrainAmp, Brainproducts GmbH, Germany). During
the BCI-training, EEG was recorded from a subset of 31 electrodes distributed over the scalp
centro-parietal regions.
Acceptability and usability assessment. Acceptability and usability were explored by
means of participants mood, motivation and satisfaction assessment and participants perceived
Article ID 054-2
DOI:10.3217/978-3-85125-378-8-54
Figure 2: Clinical characteristics and training outcome of patients. Arm section of the FuglMayer scale ranges from 0 (most aected) to 66 (normal); NIHSS ranges from 0 (normal) to
42 (most aected); Barthel Index ranges from 0 (most aected) to 100 (normal). The asterisk
indicates achievement of the Minimally Clinical Important Dierence.
workload. Usability was also assessed by professional users by means of a focus group setting.
Before starting each training session, patients mood and motivation was monitored by means
of visual analogue scales (VAS). Mood was also assessed by means of the Center of Epidemiologic Studies Depression Scale (CES-D) scale, administered once a week across the 4 weeks of
training. Motivation was also assessed by means of an adapted version of the Questionnaire for
Current Motivation (QCM) which was administered at the end of each training session. Satisfaction was reported by users by means of a VAS. The NASA-Task Load Index (NASA-TLX)
was administered at the end of the first and last training session as a measure of workload.
The evaluation of the proposed BCI-based rehabilitation approach was also addressed with
professional users identified as therapists in the context of a focus group. Fifteen therapists attended a training session with the participation of one stroke patient. Professionals users were
administered a slightly modified version of the The Quebec User Evaluation of Satisfaction
with Assistive Technology 2.0 (QUEST2.0), and an open discussion was held. The Spearmans
coecient was applied to explore separately the correlation between BCI performance and each
psychological variable and between psychological variables.
Results
Figure 2 depicts clinical and BCI-control features for each patient. All patients succeeded
in controlling the virtual hand by practicing aected hand MI, with mean performances of
57 24% (n=8 patients; 73 training sessions; chance level of 5%). No significant change in
performance was found between second (62, 3 20, 4%) and last (49, 1 19, 1%) BCI training
sessions (p > .05). Patients score at CES-D scale was 6.86 4.8 on average for all patients
and determinations. Patients rated their mood as good (VAS 7.15 1.91, average of all patients/sessions) and were highly motivated during training (VAS 7.70 1.90, average of all
patients/sessions); satisfaction was also high (VAS 8.36 1.65, average of all patients/sessions).
Article ID 054-3
DOI:10.3217/978-3-85125-378-8-54
VAS mood and motivation scores positively correlated (p=.00,r=.479) and satisfaction was also
positively correlated with motivation (p=.001, r=.393). The Interest QCM factor and the
BCI performance percentage were positively correlated (p=.027, r=.257) as well as the BCI
performance and VAS motivation scores (p=.012, r=.289). Analysis of the NASA-TLX questionnaires revealed no significant dierences in perceived workload obtained at the end of the
first and last training sessions (NASA-TLX: 50.22 21.73 end of first session; 54.17 23 end
of last session). According to the QUEST2.0 results, all therapists (total n=15) stated as the
most important system features: Eectiveness, Ease to Use, Learnability, Safety, and
Reliability. Several strengths and weaknesses of the BCI-assisted MI training design emerged
from the open discussion. Professionals identified as a strength point the potentiality of such
BCI-based system to provide them with a quantitative measure of the patients adherence to
a cognitive-motor rehabilitation session (i.e. SMR modulation induced by MI). They considered as most relevant weaknesses: i) prototype setup and functioning (hardware and software)
which requires technical skills that a therapist might not have and they would not feel confident in being able to carry out a session without some technical assistance (cap and electrodes
adjusting, EEG signal monitoring, software operating); ii) the lack of a goal-directed action
feedback (e.g. holding and releasing a glass of water); iii) the need to monitor a possible increase
in arm spasticity during MI task practice.
Discussion
All patients were highly motivated supporting the idea that the specific BCI training was positively accepted with a good compliance/adherence. In particular, motivation was maintained
across training sessions and correlated with satisfaction. Encouragingly, we also found no significant self-rated workload dierences across training. Our BCI-assisted motor imagery training
has proven to be tolerable and acceptable by patients and although usability still requires
some improvement, also professional users are prone to accept such technology when added
to standard motor rehabilitation during hospitalization. Our preliminary data suggest that
BCI-technology might successfully be adopted to support the practice of MI tasks and thus
positively influence recovery outcome in stroke patients. A clinical trial with a large cohort of
patients is needed to establish the extent to which any clinical improvement might be imputed
to the BCI, and to confirm the current positive results on the acceptability of the system by
patients.
Acknowledgements. This work was partially supported by the Italian Ministry of Healthcare (grant:RF-2010-2319611).
References
[1] M C Cirstea and M F Levin. Improvement of arm movement patterns and endpoint control depends
on type of feedback during practice in stroke survivors. Neurorehabilitation and neural repair,
21(5):398411, October 2007. PMID: 17369514.
[2] S Paolucci, A Di Vita, R Massicci, M Traballesi, I Bureca, A Matano, M Iosa, and C Guariglia.
Impact of participation on rehabilitation results: a multivariate study. European journal of physical
and rehabilitation medicine, 48(3):455466, September 2012. PMID: 22522435.
[3] S Silvoni, A Ramos-Murguialday, M Cavinato, C Volpato, G Cisotto, A Turolla, F Piccione, and
N Birbaumer. Brain-computer interface in stroke: a review of progress. Clinical EEG and neuroscience, 42(4):245252, October 2011. PMID: 22208122.
Article ID 054-4
DOI:10.3217/978-3-85125-378-8-55
Graz University of Technology, Institute for Knowledge Discovery, 8010 Graz, Austria
christoph.pokorny@tugraz.at, gernot.mueller@tugraz.at
2
Albert Schweitzer Clinic, Department of Neurology, 8020 Graz, Austria
gerald.pichler@stadt.graz.at
3
Coma Science Group, Cyclotron Research Centre, University of Li`ege, 4000 Li`ege, Belgium
damien.lesenfants@doct.ulg.ac.be, quentin.noirhomme@ulg.ac.be, steven.laureys@ulg.ac.be
Abstract
In the present study, we aimed to detect the resonance-like frequencies of the somatosensory system in patients in a minimally conscious state using a screening paradigm.
EEG measurements were conducted in seven patients during tactile stimulation of their
left and right wrist. A significant tuning curve could be found in one of the patients. Various reasons that could explain the inconclusive outcome of most measurements, as well as
future perspectives are discussed.
Introduction
2
2.1
Two C-2 tactors (Engineering Acoustics, Inc., USA) were attached to the left and right volar
wrist using elastic wrist bands. The wrists were stimulated with seven frequencies ranging from
1
Article ID 055-1
DOI:10.3217/978-3-85125-378-8-55
2.2
Participants
Seven patients in an MCS participated in this study (one or two sessions) at the Albert
Schweitzer Clinic (Graz, Austria) and the Li`ege University Hospital (Li`ege, Belgium). The
patients were either sitting in a wheelchair or lying in bed with the upper part of their body
slightly elevated. Before or after each EEG measurement, the patients were behaviorally assessed using the Coma Recovery Scale-Revised (CRS-R). Table 1 provides clinical and demographic data together with the CRS-R scores of all patients. Informed consent was obtained
from the patients legal representatives. The study was approved by the Ethics Committees at
the participating institutions and was conducted in accordance with the Declaration of Helsinki.
Patient no.
Location
PA01
PA02
PA03
PA04
PA05
PA06
PA07
Graz
Graz
Graz
Li`ege
Li`ege
Li`ege
Li`ege
Age
(years)
28
58
67
22
15
51
45
Sex
Etiology
male
female
male
male
male
female
female
Traumatic
Anoxia
Traumatic
Traumatic
Hemorrhagic stroke
Hemorrhagic stroke
Traumatic
CRS-R
s1 s2
9 11
8 10
17 17
6
15
4
Table 1: Clinical and demographic data of the patients, together with the CRS-R scores of the
first (s1) and, where applicable, second (s2) session.
Results
Fig. 1 shows the SSSEP screening results of all patients and sessions from three representative
EEG channels contralateral to the stimulated wrist. Only in one patient, PA05 , a significant
tuning curve could be found for right wrist stimulation at the bipolar channel F3-C3. The
frequency with the highest relative bandpower increase (140 %) was found to be 20 Hz. In
all other patients, no significant tuning curves were found at any of the channels contra- or
ipsilateral to the stimulated wrist. To demonstrate that the screening paradigm is suitable
Article ID 055-2
DOI:10.3217/978-3-85125-378-8-55
to identify the individual resonance-like frequencies, the results of a healthy control were
included (same tactor location; reduced channel set only), showing high tuning curve peaks at
23 Hz for left (373 %) and right (363 %) wrist stimulation.
F4C4
PA
01
C4P4
F3C3
PA
01
C3P3
100%
0%
100%
0%
PA
02
PA
02
100%
0%
100%
0%
PA
03
PA
03
100%
0%
100%
0%
PA
PA
100%
0%
PA
PA
100%
0%
PA
PA
100%
0%
PA
PA
100%
0%
04
05
06
07
HS
04
05
06
07
HS
200%
0%
Figure 1: Screening results of all patients and sessions (rows) from three representative (bipolar)
EEG channels contralateral to the stimulated wrist (columns). The bars show the relative
bandpower increase (in %) with 95 % confidence intervals of all seven stimulation frequencies.
The last row shows the results of a healthy subject (HS), using a dierent y axis scaling.
Discussion
Within this work, a screening paradigm was developed with regard to the specific needs and
capabilities of patients in an MCS. The wrists were selected as target location, since some of
the patients suered from hand spasticities, making it not easily possible to use more sensitive
locations like finger tips. Screening results obtained from a healthy control were totally in
accordance with literature (e.g. [3]). However, only in one of the seven patients, a significant
tuning curve could be found. In all other patients, stable SSSEPs were not present. In some
patients, an increase in band power of only certain single frequencies could be found. However,
Article ID 055-3
DOI:10.3217/978-3-85125-378-8-55
it is not yet known if perhaps such frequencies could intentionally be modulated and thus be
sucient to realize a BCI. While technical problems seem unlikely (as shown by the control
experiment), various other reasons could explain the inconclusive outcome of most patient
measurements. First, uncontrolled body movements of the patients resulted in a huge amount
of biological (EOG, EMG) and technical (cable movements, electrodes touching the pillow)
artifacts. Even though trials containing strong artifacts were manually remove, outliers and
huge confidence intervals were still present in the screening results. Second, it was not clear
if the position and contact pressure of the tactors allowed the patients to perceive the stimuli
strong enough at all, as they could not be simply asked about their perception of the stimuli.
Spasticities may have also had a severe influence on the SSSEPs, since the tendons of the
finger flexors are located at the volar side of the hand. Third, maybe SSSEPs were not present
because of an impaired somatosensory system, or could simply not be measured with EEG due
to alterations in the brain topology. Interestingly, the one patient showing significant results
was a stroke survivor with a CRS-R score of 15. In comparison to the others, this patient had
a high score and no traumatic injury. This could be evidence that the structures in his brain
were not that damaged and therefore SSSEPs could be measured.
Similar diculties regarding a paradigm transition from healthy subjects to patients in an
MCS were already reported in [6]. In future, better artifact avoidance or rejection methods,
longer stimulation intervals, or other target body locations could be beneficial. Moreover, a
thorough neurophysiological examination prior to SSSEP measurements may be helpful.
Acknowledgments
This work is supported by the European ICT Programme Project FP7-247919. The text reflects
solely the views of its authors. The European Commission is not liable for any use that may
be made of the information contained therein.
References
[1] C. Chatelle, S. Chennu, Q. Noirhomme, D. Cruse, A. M. Owen, and S. Laureys. Brain-computer
interfacing in disorders of consciousness. Brain Inj, 26:15101522, 2012.
[2] C.-M. Giabbiconi, C. Dancer, R. Zopf, T. Gruber, and M. M. M
uller. Selective spatial attention
to left or right hand flutter sensation modulates the steady-state somatosensory evoked potential.
Cogn Brain Res, 20:5866, 2004.
[3] G. R. M
uller, C. Neuper, and G. Pfurtscheller. Resonance-like frequencies of sensorimotor areas
evoked by repetitive tactile stimulation. Biomed Tech, 46:186190, 2001.
[4] G. R. M
uller-Putz, R. Scherer, C. Neuper, and G. Pfurtscheller. Steady-state somatosensory evoked
potentials: Suitable brain signals for brain-computer interfaces? IEEE Trans Neural Syst Rehabil
Eng, 14:3037, 2006.
[5] C. Pokorny, C. Breitwieser, and G. R. M
uller-Putz. A tactile stimulation device for EEG measurements in clinical use. IEEE Trans Biomed Circuits Syst, (in press), 2013.
[6] C. Pokorny, D. S. Klobassa, G. Pichler, H. Erlbeck, R. G. L. Real, A. K
ubler, et al. The auditory
P300-based single-switch brain-computer interface: Paradigm transition from healthy subjects to
minimally conscious patients. Artif Intell Med, 59:8190, 2013.
[7] D. Regan. Human Brain Electrophysiology: Evoked Potentials and Evoked Magnetic Fields in
Science and Medicine. Elsevier, New York, 1989.
[8] J. R. Wolpaw, N. Birbaumer, D. J. McFarland, G. Pfurtscheller, and T. M. Vaughan. Braincomputer interfaces for communication and control. Clin Neurophysiol, 113:767791, 2002.
Article ID 055-4
DOI:10.3217/978-3-85125-378-8-56
Introduction
Reconstructing gait cycle patterns from brain activity could be a useful technique, e.g., first,
to enable neuromonitoring for therapists, and second, to provide neurofeedback for patients
during rehabilitation after brain injury. Using reconstructed gait related brain patterns for
neurofeedback can potentially enhance training in individuals with impaired motor function.
These signals could, e.g. be used for supportive control of functional electric stimulation or
robotic gait orthosis. Recently, some eorts in gait kinematics reconstruction were made [4]
using frequencies in the delta range (0.1-2Hz) for decoding. However, the usage of low frequency
components for decoding movements was discussed controversially [1] and their cortical origin
were doubted [2]. Previous work from our group [5], [6] showed that low gamma (24-40 Hz)
amplitudes are modulated in relation to the gait cycle. These activities were localized in
central sensorimotor regions. In this work, we suggest that these low gamma modulations can
be recognized on single-trial level and thus enables the reconstruction of gait cycle patterns in
real-time.
2
2.1
Methods
Experiment
Ten able-bodied subjects (S1-S10, 5 female, 5 male, 25.63.5 years) walked with a robotic gait
orthosis (Lokomat, Hocoma, Switzerland). The experiment involved 4 runs (6 min each) of
active and passive walking as well as 3 runs of upright standing (3 min each), which was used
as a control condition. The walking speed was adjusted for every subject ranging from 1.8 until
2.2 km/h, and a body weight support of less than 30% was provided.
Article ID 056-1
2.2
DOI:10.3217/978-3-85125-378-8-56
EEG was recorded form 120 sites with four 32-channel amplifiers (BrainAmp, Brainproducts,
Munich, Germany). The sampling frequency was 2.5 kHz, filter settings were 0.1 Hz and 1 kHz
for high and low pass respectively. The electrodes were arranged in accordance to the 5%
international 10/20 EEG system (EasyCap, Germany). Left and right mastoids were chosen
to place reference and ground electrodes. Electrode impedances were lower than 10 k. The
interval between two right leg heel ground contacts defined one gait cycle. Foot contact was
measured by mechanical switches. Further details can be found elsewhere [6]. EEG data was
filtered and down sampled to 500 Hz. Channels with a variance greater than 5 times the median
variance were rejected from further analysis. If one channel exceeded a threshold of 200 V,
the according gait cycle epoch was removed from the EEG recordings. In total 94.1 % of the
data were used for the analysis.
2.3
We built a model to reconstruct the gait cycle pattern from EEG recordings, based on physiological evidence; namely that the low gamma amplitudes are modulated with the step frequency
in the sensorimotor feet area. To reconstruct the gait cycle pattern from EEG we consequently
focused on the temporal modulations of low gamma (20-40Hz) amplitudes. To test a system
that is in principle able to work in clinical practice we used five channels (FCz, C1, Cz, C2,
CPz) to calculate Laplacian Cz derivation in this work, since we already know the spatial region
of interest from previous work [5], [6].
The EEG data were split into two subsets. First, one third of randomly drawn EEG trials
(max. 8 min) were used as training set to simulate clinically feasible system training durations.
Second, the unseen two thirds of the data were concatenated and used for evaluating the gait
cycle reconstruction. The training set was used to identify the parameters carrier frequency,
mean step frequency and phase lag between low gamma modulations and actual foot contacts.
Spectral analyses were performed using complex Morlet wavelets [3] (center frequency: 1 Hz,
full width half maximum: 3). Low gamma amplitudes are a result of spectral analysis. To determine the frequency of an amplitude modulation (AM) these amplitudes were again wavelet
transformed (center frequency: 1 Hz, full width half maximum: 6).
The frequencyin the low gamma range which amplitude were maximally modulated by the step
frequency was used as optimal carrier frequency. The phase lag between low gamma AM and
the actual foot contact was determined using the training data. Once we know the optimal
carrier frequency for AM and the phase relation between the AM and the actual foot contacts;
it is straightforward to reconstruct the foot contacts and thus the gait cycle patterns from EEG
recordings.
Data of the evaluation set were analysed on single-trial level to simulate a real-time experiment.
The EEG data were wavelet transformed at the optimal carrier frequency and the resulting amplitude were again analysed using a wavelet with the step (modulation) frequency to gain the
phase of ongoing low gamma AM. The phase lag gained form the training set was then used
to estimate the foot triggers. Reconstructed and mechanical measured foot contacts were then
compared and errors were calculated as L1 norm. To test if these errors are significantly smaller
than chance, we performed the same analysis using EEG data from the standing condition to
evaluate the chance level errors. The reconstruction errors for active and passive walking were
then statistically tested to be smaller than these chance level errors using the Wilcoxon rank
sum test.
Article ID 056-2
DOI:10.3217/978-3-85125-378-8-56
Results
In 8/10 subjects the reconstruction of the gait cycle patterns during active walking were significantly better than chance (p<0.05). In the best subject, the mean error (L1 norm) was
120 ms. The reconstruction was also successful during passive walking in every subject. No
significant dierence (p=0.77, Wilcoxon signed rank test) was found between the errors of the
active and passive condition. Detailed error reports are shown in Table 1 with the corresponding carrier frequency(f), p-values and the relative amount of trials in which the error was better
than random (BTR). Note that in 6/10 the BTR percentage was greater than 90%. The error
histograms are illustrated in Fig. 1.
subject
S1
S2
S3
S4
S5
S6
S7
S8
S9
S10
f
Hz
27
34
29
31
30
29
20
27
20
20
errrecon
s
0.12
0.14
0.24
0.22
0.23
0.21
0.46
0.47
0.29
0.43
Active
errchance
s
0.51
0.50
0.60
0.59
0.48
0.52
0.57
0.47
0.54
0.44
p
<0.001
<0.001
<0.001
<0.001
<0.001
<0.001
<0.001
0.54
<0.001
0.16
BTR
%
100
96.84
92.56
92.33
91.80
91.61
63.79
53.75
82.49
48.80
f
Hz
26
35
29
30
27
27
20
34
20
35
errrecon
s
0.09
0.17
0.30
0.28
0.24
0.19
0.39
0.34
0.40
0.22
Passive
errchance
s
0.51
0.50
0.62
0.61
0.53
0.52
0.56
0.48
0.61
0.41
p
<0.001
<0.001
<0.001
<0.001
<0.001
<0.001
<0.001
<0.001
<0.001
<0.001
BTR
%
100
92.91
88.63
88.73
87.56
93.91
76.07
84.54
68.62
88.44
Table 1: Low gamma carrier frequencies, reconstructed and chance level error means with
according p-value and BTR percentage for active (left panel) and passive (right panel) walking.
Discussion
In this work we presented a system that is able to reconstruct gait cycle patterns from noninvasive EEG recordings. We were able to extract the low gamma modulation pattern, the
carrier frequencies, and their phases (time lag relative to the heel strike) using only one third
of the data for training. These values are consistent with previous analysis reported in [5], [6].
These findings suggest that the reconstruction method in this work indeed is based on the
cortical activity not on artefacts.
The proposed method also works for passive walking in every subject while during active walking
it was successful in 8/10. In the analysis we observed that the quality of the reconstruction
is reduced when artefacts are present. Our data suggests that head/neck muscular activity
is lower during passive walking than during active walking. This eect is notable at lateral
and dorsal electrodes. Thus, it is plausible that the lower influence of muscular artefacts
during passive walking is the reason why a good reconstruction is possible in all participants.
However, EEG recordings are contaminated with muscular artefacts in a broad frequency range
during walking [2]. Analysing EEG data on a single-trial level during walking is therefore very
challenging. Further work is needed to evaluate to which extent muscular activities influences
the reconstruction of gait related parameters in comparison to the overall performance. In
this work, the reconstruction performance was found to be driven by physiological meaningful
features. In conclusion, we consider this work as a further step towards analysing and monitoring
cortical activity in real-time during human upright walking.
Article ID 056-3
!"#$%&
DOI:10.3217/978-3-85125-378-8-56
'())$%&
Figure 1: Error histograms for active (left) and passive (right) walking in red, for standing
(chance level errors) in blue (S1-S10, top left to bottom right). Error distributions of the walking
(red) conditions are centred around zero with an ideally small standard deviation for successful
reconstructions. Standing condition errors (blue) seem to follow an uniform distribution.
Acknowledgements
This work was supported by the European Union research project BETTER, BioTechMed and
the Land Steiermark project BCI4REHAB.
References
[1] J. M. Antelis, L. Montesano, A. Ramos-Murguialday, N. Birbaumer, and J. Minguez. On the usage
of linear regression models to reconstruct limb kinematics from low frequency EEG signals. Plos
One, 2013.
[2] T. Castermans, M. Duvinage, G. Cheron, and T. Dutoit. About the cortical origin of the low-delta
and high-gamma rhythms observed in EEG signals during treadmill walking. Neuroscience Letters,
561:166170, 2014.
[3] J. Morlet, G. Arens, E. Fourgenau, and D. Glard. Wave propagation and sampling theorypart I:
Complex signal and scattering in multilayered media. Geophysics, 47:222236, 1982.
[4] A. Presacco, R. Goodman, L. Forrester, and J.L. Contreras-Vidal. Neural decoding of treadmill
walking from noninvasive electroencephalographic signals. Journal of Neurophysiology, 106:1875
1887, 2011.
[5] M. Seeber, R. Scherer, J. Wagner, T. Solis-Escalante, and G.R. M
uller-Putz. EEG beta suppression
and low gamma modulation are dierent elements of human upright walking. Frontiers in Human
Neuroscience, 8:485, 2014.
[6] J. Wagner, T. Solis-Escalante, P. Grieshofer, C. Neuper, and R. Scherer G.R. M
uller-Putz. Level
of participation in robotic-assisted treadmill walking modulates midline sensorimotor eeg rhythms
in able-bodied subjects. NeuroImage, 63:12031211, 2012.
Article ID 056-4
DOI:10.3217/978-3-85125-378-8-57
Sabanci University, Signal Proc. & Info. Syst. Lab, Istanbul, Turkey
delgado@sabanciuniv.edu, mcetin@sabanciuniv.edu
2
Wadsworth Center, Lab of Injury and Repair, Albany, NY, USA.
adrianad@wadsworth.org, mcfarlan@wadsworth.org
Universidad del Norte, Robotics and Intelligent Systems Lab, Barranquilla , Colombia.
jadelgado@uninorte.edu.co
Abstract
Motivated by P300 spelling scenarios involving communication based on a limited vocabulary, we propose a probabilistic graphical model-based framework and an associated
classification algorithm that uses learned statistical prior models of language at the level
of words. Exploiting such high-level contextual information helps reduce the error rate of
the speller. The proposed approach models all the variables in the P300 speller in a unified
framework and has the capability to correct errors in previous letters in a word given the
data for the current one. The structure of our model allows the use of ecient inference
algorithms, which makes it possible to use this approach in real-time applications. Our
experimental results demonstrate the advantages of the proposed method.
Introduction
Recently there has been growing interest in the incorporation of statistical language models into
P300 spellers with the intention to reduce the error rate or to increase typing speed [1, 2, 3, 4, 5].
These approaches learn marginal and conditional probabilities of letters in a language based
on some corpus and use that information in the form of prior models in a P300-based braincomputer interface (BCI) system. In particular, such priors models are combined with measured
electroencephalography (EEG) data in a Bayesian framework to infer the letters typed by the
subject. The probabilistic structure in most of this work can be described by hidden Markov
models, which are one of the simplest forms of probabilistic graphical models. Work under this
theme involves filtering [2, 4], as well as smoothing methods [5]. The latter type of method
allows the EEG data for the current letter to aect the decision on a previous letter through the
language model, and possibly correct an erroneous decision that would be reached in the absence
of such data. In most of this body of work, first a conventional classifier for the P300 speller is
utilized and then the scores of that classifier are turned into probabilities to be combined with
the language model for Bayesian inference.
In this work, we propose taking one further step incorporating higher-level, in particular
word-level, language models into P300 spellers. Our motivation comes from BCI applications
that involve typing based on a limited vocabulary. In a particular context, if it is known that
a user is likely to type words from a dictionary of, say, a few thousand words, that provides
very valuable prior information that can potentially be exploited to improve the performance of
the BCI system. Based on this perspective, we propose a discriminative graphical model-based
framework for the P300 speller with a language model at the level of words. The proposed
model integrates all the elements of the BCI system from the input brain signals to the spelled
Article ID 057-1
DOI:10.3217/978-3-85125-378-8-57
l1
l2
lk
c1,1
c1,2
c1,12
c2,1
c2,2
c2,12
ck,1
ck,2
ck,12
x1,1
x1,2
x1,12
x2,1
x2,2
x2,12
xk,1
xk,2
xk,12
Methods
Proposed Graphical Model: The proposed model is shown in Figure 1. In the bottom
(first) layer, the variables xi,j represent the EEG signal recorded during the intensification
of each row and column of the spelling matrix. The index i denotes the ordinality of the
letter being spelled and the index j represents a row or column (j = {1, ..., 6} for rows and
j = {7, ..., 12} for columns). The second layer contains a set of twelve variables ci,j indicating
the presence or absence of the P300 potential for a particular flash. The third layer contains
variables li representing the letter being spelled. The variables li are related to the variables
ci,j in the same fashion as in traditional P300 speller systems: the presence of a P300 potential
in a particular row-column pair encodes one letter. The fourth layer contains the variable w
which can be any member of a particular subset of valid words in the English language. A
learned probability mass function for this variable constitutes the language model in this work.
The distributions of all the variables in the model (w, l = {l1 , ..., lk }, c = {c1,1:12 , ..., ck,1:12 })
given the observations (x = {x1,1:12 , ..., xk,1:12 }) can be written as a product of factors over all
the nodes and edges in the graph:
12
!
!
1
{2 (j, li , ci,j )1 (j, ci,j , xi,j )}
P (w, l , c|x ) = 4 (w)
(1)
3 (i, w, li )
Z
i
j=1
!d
m=1
(2)
where d is the dimensionality of the data. The parameter 4 is a vector of weights of length
equal to the number of states of the node w (i.e., the number of words in the dictionary). The
Article ID 057-2
DOI:10.3217/978-3-85125-378-8-57
product 4 f4 (w) models a prior for the probability of a word in the language with the feature
function f4 (w) = 1{w} , where 1{w} is a vector of length equal to the number of words in the dictionary, with a single nonzero entry of value 1 at the location corresponding to the argument of
f4 . The product 3i f3 (i, w, li ) models a prior for the probability of a letter li appearing in the position i of a word with the feature function f3 (i, w, li ) = 1{w(i),li } . The product 2j f2 (j, li , ci,j )
measures the compatibility between the binary random variable ci,j and the variable li with the
feature function f2 (j, li , ci,j ) = 1{C(li ,j)=ci,j } where C is a code-book that maps the intersections of rows and columns in the spelling matrix to letters. The product 1j,m f1 m (j, ci,j , xi,j m )
is a measure of the compatibility of the m-th element of the EEG signal xi,j Rd with the
variable ci,j . Here, we use the feature function f1 m (j, ci,j , xi,j m ) = xi,j m 1{ci,j } . Learning in
the model corresponds to finding the set of parameters = {4 , 3 , 2 , 1 } that maximizes the
log-likelihood of the conditional probability density function given in Equation 1. Given that
the structure of the model does not involve loops, inference in the model can be performed using
the belief propagation algorithm
which can eciently provide the posterior probabilities for the
( (
words: P (w|l , c, x ) = l c P (w, l , c|x ). We declare the word by maximizing the posterior
density: w = arg maxw P (w|l , c, x ). Note that the model allows computing other marginals of
interest (e.g,. letters) as well.
Description of Experiments: Two kinds of experiments are reported. In the first experiment, 8 subjects are instructed to spell a number of words one by one. In this scenario (which
we call screening) the number of letters in the words typed in the testing session is known.
In the second experiment 7 subjects write multiple words in a continuous fashion, using the
character - to indicate the end of a word. The EEG signals were recorded at a sample frequency of 240 Hz using a cap embedded with 64 electrodes according to the 10-20 standard.
All electrodes were referenced to the right earlobe and grounded to the right mastoid. From
the total set of electrodes a subset of 16 electrodes in positions F3, Fz, F4, FCz, C3, Cz, C4,
CPz, P3, Pz, P4, PO7, PO8, O1, O2, Oz were selected, motivated by the study presented in
[6]. In total each subject spelled 32 letters (9 words). Training and testing sessions were held
on dierent days for the same subjects. Signal segments of 600ms following the intensification
of each row or column were calculated and filtered between 0.5Hz and 8Hz using a zero-phase
IIR filter of order 8.
The parameters 4 , 3i , 2j in Equation 2 are independent of the brain signals and can be
learned prior to any EEG data collection. The parameters 4 are learned by calculating the
relative frequency of each word in the dictionary of interest. The parameters 3i take a positive
or negative value depending of the presence of absence of a letter in the i -th position of a word.
Statistics about the language were calculated using a text corpus with 450,000,000 words. The
dictionary was then built using the 5000 words with highest frequency. The parameters 2
represent a set of decoding vectors that map rows and columns to letters in the spelling matrix.
The parameters 1 are the set of parameters that maximize the potential function 1 (j, ci,j , xi,j )
in Equation 2 for each class (i.e., P300 vs. not P300).
Results
Figure 2 shows the results for the screening scenario and for continuous decoding where a 3gram method and a classifier based in Stepwise LDA have been used for comparison. The
results show that the proposed method performs better than the other methods in terms of
classification accuracy and requires fewer numbers of repetitions to achieve a particular level
of accuracy. In order to verify the results of correct classification accuracy, a statistical test
Article ID 057-3
DOI:10.3217/978-3-85125-378-8-57
Conclusion
Acknowledgments
This work was partially supported by Universidad del Norte (Colombia), the Scientific and
Technological Research Council of Turkey under Grant 111E056, by Sabanci University under
Grant IACF-11-00889, and by NIH grant EB00085605 (NIBIB).
References
[1] Tanja Mathis and Dennis Spohr. Corpus-Driven Enhancement of a BCI Spelling Component. In
RANLP07: Int. Conf. on Recent Advances in Natural Language Processing, September 2007.
[2] Umut Orhan et al. Oine analysis of context contribution to ERP-based typing BCI performance.
Journal of Neural Engineering, 10(6), 2013.
[3] S.T. Ahi, H. Kambara, and Y. Koike. A Dictionary-Driven P300 Speller With a Modified Interface.
IEEE Transactions on Neural Systems and Rehabilitation Engineering, 19(1):614, 2011.
[4] William Speier et al. Natural language processing with dynamic classification improves P300 speller
accuracy and bit rate. Journal of Neural Engineering, 9(1), 2012.
[5] C. Ulas and M. Cetin. Incorporation of a Language Model into a Brain Computer Interface based
Speller through HMMs. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, 2013.
[6] D.J. Krusienski, E.W. Sellers, D.J. McFarland, T.M. Vaughan, and J.R. Wolpaw. Toward Enhanced
P300 Speller Performance. Journal of Neuroscience Methods, 167(1):15 21, 2008.
Article ID 057-4
DOI:10.3217/978-3-85125-378-8-58
Introduction
In electroencephalogram (EEG) based brain computer interface (BCI), it is important to extract the appropriate feature suitable for classifying EEG arising from dierent cognitive tasks.
Typically the bandpower features are extracted from the EEG and used in the online classification. These features are easy to compute and require minimum number of EEG electrodes
minimizing setup time. This advantage makes the bandpower method suitable especially for
therapeutic BCI where setup time must be minimized. The bandpower features are well established and researchers can target physiologically relevant frequency bands when using it as
features for therapeutic BCI. However the need to select a user specific narrow frequency bands
can be an issue. Firstly because the user specific bands are known to vary for the same user.
Secondly due to the uncertainty principle, estimating narrow band spectral powers must be
dubious within short time windows.
The time domain parameters (TDP) [2] are features which do not require the selection of
narrow frequency bands. In addition TDP target the most time varying EEG features. TDP
features have been shown to outperform bandpower [2]. However to be suitable for therapeutic
BCI, TDP must be able to select physiologically relevant frequencies despite the use of a wide
band filtered signal. Furthermore, the current authors could only find one report [2] on the use
of TDP in online BCI; more investigation is required before using TDP on patients.
The aim of this study is to assess the suitability of using TDP for online therapeutic BCI
by analysing the time-frequency components of EEG classified into two classes using TDP. It
is shown that despite using wide band, TDP target the relevant frequency band.
Methods
Eight BCI naive subjects took part in this study after giving their informed consents. The
study was approved by the university ethic committee.
1
Article ID 058-1
DOI:10.3217/978-3-85125-378-8-58
To obtain an initial classifier, a subject performed the motor imagery (MI) of closing and
opening of the left and the right hand (20 trials for each hand). During these tasks EEG was
recorded, using the g.USBamp (GTEC, Austria) from three pairs of bipolar electrodes, namely
Fc3-Cp3, Fcz-Cpz and Fc4-Cp4. The input signal from the amplifier was bandpass filtered (5th
order Butterworth) online between 0.5 to 30 Hz. The TDP of the filtered data were computed.
The TDP features were computed in a similar way as that described by Vidaurre and
colleagues [2]. This is shown graphically on Figure 1. The feature was used to compute initial
u2
smooth
TDP
dx(t)
dt
x(t)
d2x(t)
dt2
d9x(t)
dt9
Feature
Adapt
log
EEG
LDA
[Bias,weight]
PC
[0|1]
MI
1
Score: 10
..
..
Buffer
Feedback
Article ID 058-2
DOI:10.3217/978-3-85125-378-8-58
included an acknowledgment text on the screen and a reward sound played to the subject when
the Active state was detected. If the Active state was not detected after about 6.5 s following
the execution cue onset, a text was shown and a sound was played to reflect the subjects failure
for that trial. The subjects were also provided with a continuous feedback in form of a scale
that moves counter clockwise when imagery was detected.
Of interest were the time-frequency characteristics of the trials selected as Active state
(Detection) and those not selected (No Detection) when using TDP as BCI features. Therefore
ERD/ERS analysis was carried out by first separating the trials from all subject and all runs
into Detection and No Detection groups. The resulting ERD/ERS maps were compared
between the two groups of trials using statistical nonparametric method with Holms correction
for multiple comparisons at p=0.05.
Table 1 shows the initial classifier accuracy, and the detection rate (true positive) per subject.
The naive subjects improved the detection rate from run 1 and 2 to run 3 because they got
better with experience. No false positive was reported because the subjects were imagining as
soon as the execution cue was shown and the false positive was significantly reduced when the
BCI difficulty was set.
Subjects
% rate (run 3)
1
2
3
4
5
6
7
8
83
97
78
75
85
83
83
90
75
0
10
50
40
50
15
61
100
30
40
100
90
100
50
47
Article ID 058-3
Detection
No Detection
Fc3-Cp3
Fc3-Cp3
28
16
10
DOI:10.3217/978-3-85125-378-8-58
4
4
28
10
Fcz-Cpz
16
Fcz-Cpz
Frequency (Hz)
ERS
0 dB
2
ERD
10
Fc4-Cp4
16
Fc4-Cp4
28
4
2
6 2
Time (s)
Figure 2: ERD/ERS maps of the right hand MI for when Active state was detected (column
1) and when it was not detected (column 2). The column 3 shows the time-frequency statistical
dierences between column one and two (p=0.05 with Holms correction). The execution cue
was shown at t=0 ms. (Generated with EEGLAB, http://sccn.ucsd.edu/eeglab).
the most time varying activities during the cognitive tasks making it likely to be selected by the
TDP. The statistical dierence on Fc3-Cp3 at t=4 s is due to the ERD in the low frequencies
in the No Detection group.
There were many failed trials because the subjects were naive, no training was given, they
had short time to perform the MI and only 20 trials were used to compute the initial classifier
although it was updated online to compensate for the low number of trials. However this
is a more realistic BCI as we tend to move it out of the laboratory and also use it in our
rehabilitation programmes.
Conclusions
TDP eliminates the requirement to select a user specific frequency band allowing for a more
generalised BCI classifier. It is easy to compute and require minimum EEG electrodes. The
current result shows that TDP features are suitable in online therapeutic BCI because they
target the physiologically relevant frequency band.
References
[1] Alois Schl
ogl, Carmen Vidaurre, and Klaus-Robert Muller. Brain-Computer Interfaces: Revolutionizing Human-Computer Interaction, chapter Adaptive Methods in BCI Research An Introductory
Tutorial, pages 131356. Springer, 2010.
[2] Carmen Vidaurre, Nicole Kr
amer, Benjamin Blankertz, and Alois Schl
ogl. Time domain parameters
as a feature for EEG-based braincomputer interfaces. Neural Networks, 22(9):13131319, 2009.
Article ID 058-4
DOI:10.3217/978-3-85125-378-8-59
Introduction
Despite that the mechanism used to elicit a response for an asynchronous BCI application
intents to be a natural mental activity (i.e. imagining to move a limb) the modest proportion
of population capable of achieving a satisfactory control of these systems remains as major
drawback [1]. Ono et al. [2] have shown that the mean accuracy values may be improved using a
more realistic feedback for the training process. ERD/ERS responses can also be elicited during
the realization of other activities like imagining a cube rotation, or by another type of cognitive
activities such as performing mental calculation [3]. This last activity has been proposed for
controlling a BCI, combining fNIRS and EEG techniques [4, 5]. Power et al., have reported
accuracy values above 70%, using EEG window lengths of 5-20 seconds. Notwithstanding that
these values suggest that it is possible to use such task in a BCI paradigm, the window length
used produces a low information transfer rate [6]. This paper explores the feasibility of mental
calculation, during shorter time windows, to be used as control for a BCI implementation.
EEG data were recorded from fifteen subjects, seven men and eight women, 25.3 3.47 years
old, all with completed high school studies. A 32 channel (Fp[z,1,2], AF[7,3,z,4,8], F[7,3,z,4,8],
Fc[3,4], T[7,8] C[3,z,4], Cp[3,4], P[7,3,z,4,8], PO[3,4] and O[1,2,z]) montage was used and the
signals were digitized at 512 sps with a g.tec USBamp amplifier system, using a bandpass filter
between [0.1, 60] Hz and a notch filter at 60 Hz. All subjects were recorded for three sessions
on dierent days (the lapse of time between sessions extended from 6-110 days, according to
the time availability of volunteers); each session consisted of two or three runs. In each run
the subject was instructed to pay attention to a screen and solve mentally fourteen sets of
basic concatenated arithmetic operations. A set was formed by the concatenation of four to
six randomly selected simple operations (+,-,,/). The beginning and ending of each
set was visually cued by an X and an = symbol, respectively. At the end of each set, the
subject was asked to vocalize the answer.
Article ID 059-1
DOI:10.3217/978-3-85125-378-8-59
As illustrated on figure 1, each set consisted of five dierent screens: Cue, Begin, Operate,
Answer and Rest, presenting each one for two seconds with a random ISI duration within
[625,750] ms. Subjects performed periods of continuous mental solving through the Begin and
Operate screens, and idle periods at the Rest screens.
For data analysis, signals were conditioned by removing CAR and lowpass filtered at 40
Hz. Afterwards, signals were analyzed using windows of two seconds synchronized with the
screen presentation for the channel selection process, and using sliding windows for an online
BCI simulation. Epochs were labeled in relation to solving or resting periods, and only those
sets with a correct answer were used for estimating the model and evaluating the performance.
Five dierent indexes were extracted from EEG data: the power spectral density (PSD) over
the four classical bands, P SD , P SD , P SD and P SD using the Welch Periodogram method,
and the scaling exponent DF A obtained using Detrended Fluctuation Analysis (DFA) [7, 8].
3
3.1
Methods
Channel Selection
Channel selection was based on a statistical analysis applied to each electrode as follows. Valid
epochs of each session, as synchronized with the screen presentation and correct arithmetic answer, were divided into two sets, Operating and Idle. To avoid including false positive channels
not reflecting a real region of activity, only AF[3,z,4], F[7,3,z,4,8], FC[3,4], C[3,z,4] and CP[3,4]
were considered. For each of these channels and each index ( P SD , P SD , P SD , P SD and
DF A ), the area under the receiver operating characteristic (ROC) curve (AUC) was individually estimated using these two labels. The channels were sorted according to AUC value and
the top five of each index were selected. Hence, the five most repeated channels were chosen as
the final set.
3.2
Performance evaluation
Using DF A and P SD estimated from the five selected channels, evaluation of performance was
carried out by both cross-validation (CV) and inter-session identification (iSI). In both cases,
performance was measured with AUC values using a data set also labeled as subsection 3.1, but
in this case epochs were extracted from sliding windows of two seconds length and overlapped
by 70%. The CV process was applied to find support vector machine (SVM) parameters that
optimized AUC values using repeated sub-sampling method with 20-folds. The mean and
standard deviation were assessed and considered as the final CV performance value. Then, the
classification model corresponding to each session was constructed and tested on the remaining
two sessions. Therefore for each subject three values of CV were obtained and six related to
iSI; Figure 2(a) presents box plots which summarize these results.
Article ID 059-2
DOI:10.3217/978-3-85125-378-8-59
Results
Using sliding windows the mean AUC values for CV assessed for each session, were 0.85 0.079,
0.87 0.071 and 0.87 0.065. The distribution of the individual values was above 0.7 for
all sessions as can be seen on Figure 2(a). iSI evaluation shows a mean AUC value of 0.7
which is below the CV evaluation but acceptable for a BCI implementation; in particular when
considering that for this process, the model used and data identified were recorded in dierent
days. The individual distribution of iSI evaluation shows that about half of the population
achieved a value above 0.7, and only two subjects performed below 0.6.
For illustration purposes, Figures 2(b) and 2(c) show two examples of SVM posterior probability estimates (SVMpp) and periods of mental activity. Given that SVMpp is proposed as
the control signal for a BCI implementation, it should follow as is manifested in these examples
the realization of the mental task, which means that it must be related with the decision to
select the command or, for the case studied here, with the periods of mental activity due to
mental calculation.
1
1
SVMpp
0.8
0.9
0.6
0.4
0.2
0
0.8
50
100
150
200
AUC
t (s)
(b) AUC=0.52
0.7
0.6
SVMpp
0.8
0.5
0.85 0.079
0.87 0.071
0.87 0.065
0.70 0.129
CV S001
CV S002
CV S003
iSI
0.6
0.4
0.2
0.4
0
260
280
300
320
340
360
380
400
t (s)
(c) AUC=0.81
Figure 2: The distributions of the AUC values achieved for the three CV sessions and iSI
are presented in subfigure (a), the mean and standard deviation values are included over the
horizontal axis. Subigures (b) and (c) depict two examples of SVMpp (black line) assessed for
two subjects that achieved dierent AUC values, the light blue and green shadows indicate the
periods of mental calculation realization with either correct or wrong answers, respectively, and
the red ones indicate the periods for cue presentation and vocalization of the answer.
Day-to-day activities could be fitted as stimulation paradigms for an asynchronous BCI. Here
the utilization of mental calculation is demonstrated to be useful for such context. The results
presented in this paper suggest that, for asynchronously controlling a BCI, it seems possible
Article ID 059-3
DOI:10.3217/978-3-85125-378-8-59
to use mental computation with overlapped EEG windows of two seconds length. More experiments are required to assess the potential interference of other factors, such as tiredness,
distraction or low attention. Importantly, the AUC values achieved were above 0.8; which
appear better than the performance reported on previous publications that also use mental
calculation tasks but under dierent experimental conditions [4, 5]. As can be seen on the
examples of Figure 2, the classification output shows a rapid response to the changes of mental
activity which is also a desirable condition for a BCI control. This encourages the utilization
of mental calculation as a BCI activation paradigm.
Acknowledgments
First author acknowledges the National Council of Science and Technology (CONACyT) for
support his PhD studies, grant 16848.
References
[1] C. Guger, G. Edlinger, W. Harkam, I. Niedermayer, and G. Pfurtcheller. How many people
are able to operate an EEG-based brain-computer interface (BCI)? IEEE Transactions on
Neural Systems and Rehabilitation Engineering, 11:145147, 2003.
[2] Takashi Ono, Akio Kimura, and Junichi Ushiba. Daily training with realistic visual feedback
improves reproducibility of event-related desynchronisation following hand motor imagery.
Clinical Neurophysiology, 124:17791786, 2013.
[3] C. Neuper and G. Pfurtscheller. Event-related dynamics of cortical rhythms: frequencyspecific features and functional correlates. International Journal of Psycophysiology, 43:41
58, 2001.
[4] Sarah D Power, Azadeh Kushki, and Tom Chau. Towards a system-paced near-infrared
spectroscopy braincomputer interface: dierentiating prefrontal activity due to mental arithmetic and mental singing from the no-control state. Journal of Neural Engineering, 8:14
pp, 2011.
[5] Sarah D Power, Tiago H Falk, and Tom Chau. Classification of prefrontal activity due to
mental arithmetic and music imagery using hidden Markov models and frequency domain
near-infrared spectroscopy. Journal of Neural Engineering, 7:9 pp, 2010.
[6] Jonathan R. Wolpaw, Niels Birbaumer, Dennis J. McFarland, Gert Pfurtscheller, and Therresa Vaughan. Brain-computer interfaces for communication and control. Clinical Neurophysiology, 113:767791, 2002.
[7] C.-K. Peng, S. V. Buldyrev, S. Havlin, M. Simons, H. E. Stanley, and A. L. Goldberger.
Mosaic organization of DNA nucleotides. Phys. Rev. E, 49:16851689, Feb 1994.
[8] E.R. Bojorges-Valdez, J.C. Echeverria, and O. Yanez-Suarez. Task identification using
fluctuation analysis. In Proceedings of the Fifth International Brain-Computer Interface
Meeting 2013.
Article ID 059-4
DOI:10.3217/978-3-85125-378-8-60
Introduction
Attention is believed to be an important cognitive factor in physical rehabilitation [7]. Conversely, repetitive tasks which are typical of rehabilitation regimes can induce inattentive
cognitive states, termed mind wandering [6]. These considerations are relevant in, e.g., motor
rehabilitation after stroke, where repetitive task-specific practice is a common intervention [3].
Therefore, an EEG-based brain-computer interface (BCI), capable of discriminating between
attention and mind wandering, might prove useful in a number of dierent settings. In classical
rehabilitation, the therapist could use such a device as a window into the patients cognitive
state, guiding the exercise regime. In the robot-assisted rehabilitation scenario, attention level
could be used as an additional control signal, adhering to the human-in-the-loop concept. Additionally, such a device might allow to more rigorously answer research questions on the role
of attention in rehabilitation.
We are building on the work described in [1]. The referenced work has shown that a
distractor task (specifically, counting backwards by threes) modulates desynchronization of
sensori-motor rhythms (SMR) and that a classifier can be designed to discriminate between
attended and unattended passive movements of upper limbs, with an accuracy of around 75%.
Our approach diers from prior work in the way we defined the conditions: instead of
contrasting attention with distraction by an artificial mental task, we contrast attention with
a mind wandering state, more realistically modeling the rehabilitation scenario. Furthermore,
we focus on a specific aspect of attention the kinesthetic sensation of the movement trying
to disentangle it from other aspects like visual attention. Lastly, in this study we focused
on the lower limbs instead of upper, envisioning the application of the developed BCI to a
gait-rehabilitation robot.
2
2.1
Methods
Experimental Protocol and Data Collection
Five healthy male participants (age range: 23 28 years old) volunteered in the experiment.
Subjects were seated in front of a screen with their feet strapped to an electrical mini bike that
Article ID 060-1
DOI:10.3217/978-3-85125-378-8-60
was used to actuate passive movements. The view of participants feet was obscured to prevent
confounds with visual attention. EEG recording was carried out with 30 channels (positioned
according to the 10-10 system). TMSi Refa 32 amplifier was used with 256 Hz sampling rate
and linked ears average reference.
Each subject was exposed to two experimental conditions. In the first condition subjects
were instructed to pay attention to the kinesthetic sensation of the passive movement we
denote this as the Passive Movement with Attention (PMA) condition. In the second condition (denoted PMR Passive Movement with Relaxation) subjects were instructed to relax,
ignore the movement and let their mind wander. For both conditions subjects were instructed
not to make muscle contractions themselves and to not visualize or imagine the movements.
The recording sessions were divided into 6 blocks, each consisting of 10 consecutive trials
of one, and 10 consecutive trials of the other condition. The ordering of conditions within
blocks was semi-randomized, with 3 blocks having PMR, and 3 having PMA trials first. Before
each block the subjects were informed what type of trials follows and could take a pause. The
duration of trials was 15 seconds: for the first 5 seconds (baseline period) the message Rest
was displayed; during the next 10 seconds the mini bike was working and a fixation cross was
displayed. A sound cue, played 500 ms prior to the appearance of the fixation cross, prompted
the experiment operator to turn on the bike. The sound was also audible to the subject.
Recording time was around 50 minutes, and 60 trials per condition were collected.
2.2
Time-Frequency Analysis
To verify whether attention and mind wandering have an eect on sensori-motor rhythms, we
performed time-frequency analysis of collected data. Before the time-frequency decomposition,
data was preprocessed: zero-phase IIR filter with the pass band between 1 and 40 Hz was
applied; data was segmented into 15 s long trials, with 5 s before the appearance of the fixation
cross and 10 s after; artifactual trials were excluded on the basis of visual inspection (on average
20% of trials were rejected).
Event-related spectral perturbation (ERSP) [2] was calculated with Morlet wavelets. For
baseline correction all the spectrum estimations were divided with the mean spectrum of the
3 s to 1 s period of all the trials. Using a divisive baseline, we are assuming a gain model of
task activity: power in the task period is a modulation of power in the baseline period.
2.3
Classification
To check how well PMA trials can be discriminated against PMR trials we utilized a simple
bandpower BCI design. In the preprocessing step signals were filtered with a causal FIR filter
with the pass band from 8 to 35 Hz (capturing the alpha and beta bands that are relevant
for SMR). Next, several dierent spatial filtering variants were applied: no spatial filter (all 30
channels used); surface Laplacian with 4 closest neighbors applied to all the channels; selection
of 7 electrodes over the motor cortex (FC1, FC2, C3, Cz, C4, CP1, CP2); only channel Cz with
a surface Laplacian. In feature extraction the logarithm of variance in the period of 1 to 8 s after
the appearance of the fixation cross was calculated (resulting in 30, 7 or 1 feature, depending
on the spatial filter). No artifact rejection was performed, i.e. all the recorded trials were used.
An LDA classifier was then trained and tested on this data. The classifier was regularized
using covariance shrinkage (with the regularization parameter determined analytically; see [5]).
The classifier performance was then estimated using a 5-fold chronological cross-validation
scheme [4], with 5 trials before and after the testing block omitted from the training set.
Article ID 060-2
DOI:10.3217/978-3-85125-378-8-60
Results
In Fig. 1 we are showing the results of time-frequency analysis for all the subjects, for the electrode Cz which displayed strongest movement-related spectral changes. At a qualitative level
we observe that for all the subjects passive movement produced a prominent desynchronization
in the upper beta band with a spatial distribution concentrated over the central electrodes.
For some subjects a desynchronization in the alpha band could also be observed, but with a
more diuse spatial distribution (possibly caused by the contribution of occipital or temporal
rhythms in the alpha band, and not by the mu rhythm). For subject 3 we could also observe a
synchronization at around 30 Hz that increased in bandwidth during the course of a trial.
Averaging ERSP values over 1-8 s time window and over alpha and beta bands (selected
for each subject by visual inspection) yielded significant dierences between the PMA and
PMR conditions for subjects 3 and 4 in both bands (two-sample t-test at = 0.05, Bonferroni
corrected).
2000
2
3
4
dB
40
Frequency (Hz)
20
10
2000
0 2000 4000
Time (ms)
40
0 2000 4000
Time (ms)
20
10
2000
0 2000 4000
Time (ms)
20
10
2000
Cz, PMA
30
30
40
0 2000 4000
Time (ms)
30
20
10
2000
0 2000 4000
Time (ms)
30
20
10
2000
Cz, PMA
Cz, PMR
40
0 2000 4000
Time (ms)
20
10
0 2000 4000
Time (ms)
Cz, PMR
30
20
10
2000
Cz, PMA
30
2000
40
Frequency (Hz)
10
2000
2000 4000
Cz, PMA
30
20
40
40
Frequency (Hz)
10
30
Subject 5
Subject 4
Cz, PMR
Frequency (Hz)
40
Frequency (Hz)
20
Subject 3
Cz, PMR
Frequency (Hz)
30
Frequency (Hz)
40
Frequency (Hz)
Subject 2
Cz, PMR
Frequency (Hz)
Subject 1
40
0 2000 4000
Time (ms)
Cz, PMA
30
20
10
2000
0 2000 4000
Time (ms)
Figure 1: ERSP maps for all subjects, for electrode Cz. Upper plots show ERSPs for the PMR
condition, and the lower ones for the PMA condition.
In Fig. 2 we present the crossvalidated classification accuracy for different spatial filter choices. The results
show that it was possible to discriminate between the PMA and PMR conditions significantly above chance level
for all the subjects with a suitably chosen spatial filter. The average accuracy
for the best performing design (with 30
channels and Laplacian spatial filter applied) was 75%. Given the small sample
size, the average results should be taken
with caution, but they do seem to suggest that it is beneficial to also include Figure 2: Mean cross-validation accuracy of classichannels other than those over the mo- fiers with dierent spatial filters. The dashed line
marks the significance threshold for a binomial test
tor cortex.
(at = 0.05 significance level and with Bonferroni
correction).
Article ID 060-3
DOI:10.3217/978-3-85125-378-8-60
Discussion
For a sample of five subjects we found that it is possible to discriminate between attention and
mind wandering during passive movement of lower limbs. With a simple BCI design based on
bandpower features we were able to obtain average accuracy of 75% on single trials, in line
with results for upper limbs by [1]. However, time-frequency analysis suggested dierent levels
of SMR (de)synchronization for only two subjects.
While we tried to model realistically attention during rehabilitation, the stop-go nature of
the trial-based experiments might not be very conducive to mind wandering. Also, unlike in our
experiment, mind wandering is usually not intentional. Therefore, the experimental protocol we
propose should be validated as a calibration session for an online BCI with continuous feedback.
Our results also suggest that it is beneficial for classification to include features not only from
the channels over the motor cortex. The question of whether the classifier is using class-specific
information from other brain regions, or is using the additional features to cancel out classunrelated noise, is left to be answered by future studies with source localization techniques.
Another possibility is the existence of an uncontrolled confound in our experimental design
we find this explanation unlikely, due to the fact that the trials had the same external stimuli,
and were semi-randomized.
Our future research eorts are motivated by several observations from this study: the subjects might have difficulties in complying with the protocol (inability to ignore the movement
or to attend to it consistently); there might be discriminatory information in regions other
than the motor cortex, and in frequency bands other than the alpha and beta bands; there
is considerable variation in performance from subject to subject. We intend to address this
questions, respectively, by using the proposed protocol as a calibration session for online BCI
operation, by using optimized spatio-spectral features, and by analyzing subject-to-subject and
session-to-session transfer of knowledge.
Acknowledgements
This work was supported by the European Commission through the FP7 Marie Curie Initial Training Network 289146, NETT: Neural Engineering Transformative Technologies, the FP7 CORBYS
IP project (GA 270219) and the Spanish Ministry of Science projects HYPER-CSD2009-00067 and
DPI2011-25892. The authors would also like to thank Andreea Sburlea for valuable discussions.
References
[1] J. M. Antelis, L. Montesano, X. Giralt, A. Casals, and J. Minguez. Detection of movements with attention
or distraction to the motor task during robot-assisted passive movements of the upper limb. In Engineering
in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE, pages 6410
6413, 2012.
[2] R. Grandchamp and A. Delorme. Single-trial normalization for event-related spectral decomposition reduces
sensitivity to noisy trials. Frontiers in psychology, 2:114, Jan. 2011.
[3] P. Langhorne, F. Coupar, and A. Pollock. Motor recovery after stroke: a systematic review. The Lancet
Neurology, 8(8):741754, 2009.
[4] S. Lemm, B. Blankertz, T. Dickhaus, and K.-R. M
uller. Introduction to machine learning for brain imaging.
Neuroimage, 56(2):387399, 2011.
[5] J. Sch
afer and K. Strimmer. A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and
Implications for Functional Genomics. Statistical Applications in Genetics and Molecular Biology, 4(1),
2005.
[6] J. Smallwood and J. W. Schooler. The Restless Mind. Psychological Bulletin, 132(6):946958, 2006.
[7] K. P. Tee, C. Guan, K. K. Ang, K. S. Phua, C. Wang, and H. Zhang. Augmenting Cognitive Processes in
Robot-Assisted Motor Rehabilitation. In Biomedical Robotics and Biomechatronics, 2008. BioRob 2008.
2nd IEEE RAS & EMBS International Conference on, pages 698703, 2008.
Article ID 060-4
DOI:10.3217/978-3-85125-378-8-61
Introduction
Thus far, linear machine-learning methods are considered ideal for the application in braincomputer interfaces (BCIs) [3]. Particularly with the main argument that simplicity should be
preferred, especially when limited data are available as in BCIs. However, substantial progress
has been made in the field of machine-learning. Nowadays, certain non-linear methods claim
to generalize well when only limited amount of data are available. One such a method is
the random forests (RF) classifier [2]. Our interest in this classifier is mainly based on his
following properties: (1) RF classifiers provide a complex model which allows non-linear decision
boundaries. (2) RF classifiers are over-fitting resistant, even with a large number of features.
(3) RF classifiers are able to merge features originating from different statistical distribution
into one model. Particularly hybrid BCIs and passive BCIs make use of such features. (4)
RF classifiers are regularized by nature. (5) There exist efficient implementations of the RF
classifiers which enables on-line operation. (6) RF classifiers are multi functional tools for data
analysis. E.g. RF classifiers offer importance ratings of the features, they allow for analysis of
features proximities and provide an estimate of the expected accuracy.
Recently, we conducted an on-line study, where a RF classifier was deployed in an electroencephalography (EEG) based sensorimotor rhythms BCI (SMR-BCI) [4]. Discrete Fourier
transform (DFT) magnitudes were used as features for the classification. The on-line feedback
results in 13 users demonstrate an classification accuracy competitive to other state of the art
SMR-BCIs. Detailed results will be published elsewhere.
In the present work, we address two questions arising from the RF driven SMR-BCI: (1)
The first question addresses the features for classification. Common spatial patterns (CSP)
Article ID 061-1
DOI:10.3217/978-3-85125-378-8-61
filtering is a more powerful feature extraction method than DFT [5]. Hence, we hypothesize
that replacing the DFT features by CSP features will boost the performance of the RF classifier
driven SMR-BCI. (2) The second question addresses the impact of the RF classifiers non-linear
model. According to literature, we hypothesize that the RF non-linear model outperforms a
linear classification model although only a limited amount of data is available. The linear model
is represented by an analytic-shrinkage-regularized linear discriminant analysis (sLDA) as LDA
classifier are commonly used in BCIs [1]. For evaluating this two hypothesis, we conduct BCI
simulations using the data of the on-line study mentioned above.
Methods
Summary of the on-line studies set-up. The paradigm was based on the cue-guided GrazBCI training paradigm [4]. Hence, recording, training, and feedback was performed within a
single session. The session consisted of eight runs, five of them for training and three with
feedback for validation. One run was composed of 20 trials. Taken together, we recorded 50
trials per class for training and 30 trials per class for validation. Participants had the task
of performing sustained (5 seconds) kinaesthetic motor imagery (MI) of the right hand and of
the feet each as instructed by the cue. Feedback was presented in form of a white coloured
bar-graph. The length of the bar-graph reflected the amount of correct classifications over
the last second. EEG was measured with a biosignal amplifier and active Ag/AgCl electrodes
(g.USBamp, g.LADYbird, Guger Technologies OG, Schiedlberg, Austria) at a sampling rate
of 512 Hz. The electrodes placement was designed for obtaining three Laplacian derivations.
Center electrodes at positions C3, Cz, and C4 and four additional electrodes around each center
electrode with a distance of 2.5 cm, 15 electrodes total. The reference electrode was mounted
on the left mastoid and the ground electrode on the right mastoid. The 13 participants were
aged between 20 and 30 years, 8 nave to the task, and had no known medical or neurological
diseases.
BCI simulation. In this work, we want a balance between data-sets from nave and non-nave
participants. We include all 5 data-sets of the non-nave participants and 5 data-sets from nave
participants chosen by random. For the BCI simulation, each data-set is divided in two parts.
The first part is used for CSP and classifier training, the second part for validation. The validation is carried out with a running classifier. The applied signal processing pipeline: (1) A filter
bank of 8th order Butterworth band-pass-filters divides the EEG data into 15 sub-bands. Cutoff frequencies: [i, i + 2] i = 6, 8, 10, 12 in the -band and [i, i + 5] i = 14, 17, 20, 23, 26, 29, 32, 35
in the -band. (2) We calculate a separate set of CSP filters for each sub-band [5]. The spatial
filters according to the three highest and three lowest eigenvalues of each set of CSP filters
are selected. Hence, one CSP calculation per sub-band and six filters per CSP results in 90
virtual channels. (3) The features used for classification are obtained by calculating logarithmic band-power for each of the 90 virtual channels. The logarithm changes the band-power
features distribution to a normal distribution. Normal distributed features are not necessary
for the RF classifier, but for the sLDA classifier. The band-power was estimated by squaring
and subsequent averaging over a sliding window with a length of 1 s. (4) The classification was
performed with a RF classifier on the one hand, and with a sLDA classifier on the other hand.
For training, we picked the features from the 1 s long window starting 2.5 s after the cue
of each trial [6]. This implies a trials-to-features ratio of 100/90 = 1.11. For validation, we
performed a separate classification on each time point of each trial to obtain one course of
classification accuracy per participant.
Article ID 061-2
DOI:10.3217/978-3-85125-378-8-61
Results
For each participant, the peak, mean and median accuracies during the feedback period were
calculated and are presented in Table 1. Peak means highest accuracy during the feedback
period. Mean refereed the mean accuracy over the feedback period and median stands for the
median accuracy over the feedback period. The approaches using CSP features significantly
outperformed the approach using DFT features in terms of peak (82% < 89.67%, 87.83%;
p < 0.05), mean (66.82% < 79.30%, 77.15%; p < 0.01), and median (67.67% < 80.42%, 77.83%;
p < 0.01) performance (paired t-tests, Bonferoni-Holm correction). The combination of CSP
features with a RF classifier significantly outperformed the combination of CSP features with
sLDA in terms of mean (79.3% > 77.15%; p < 0.05), and median (80.42% > 77.83%; p < 0.05)
performance (paired t-test, Bonferoni-Holm correction).
ID
naive?
P1
no
P2
no
P3
no
P4
no
P5
no
P6
yes
P7
yes
P8
yes
P9
yes
P10 yes
average
std
Online DFT+RF
peak mean median
71.67 56.14 56.67
86.67 67.27 68.33
100.00 90.71 91.67
76.67 64.39 65.00
80.00 59.81 60.00
93.33 82.79 83.33
96.67 83.25 86.67
83.33 66.54 66.67
66.67 47.35 48.33
65.00 49.99 50.00
82.00 66.82 67.67
12.27 14.63 15.11
Simulation CSP+RF
peak
mean
median
81.67
74.56
76.67
91.67
80.52
82.50
100.00 99.30
100.00
95.00
81.85
81.67
81.67
65.44
65.00
96.67
88.46
88.33
98.33
88.33
92.50
95.00
83.28
85.83
88.33
76.64
78.33
68.33
54.61
53.33
89.67* 79.30** 80.42**
9.87
12.56
13.40
Simulation CSP+sLDA
peak
mean median
78.33
69.69
70.00
90.00
79.22
80.00
100.00 99.22 100.00
93.33
79.09
77.50
78.33
63.10
61.67
95.00
84.35
85.00
100.00 88.61 92.50
88.33
78.93
81.67
83.33
73.10
73.33
71.67 56.22 56.67
87.83* 77.15** 77.83**
9.69
12.42
13.20
Table 1: Binary validation accuracies of the different BCI systems in %. Best performing
method per participant is highlighted. * significantly better than DFT+RF (p < 0.05). **
significantly better than DFT+RF (p < 0.01). significantly better than CSP+sLDA (p <
0.05).
Article ID 061-3
DOI:10.3217/978-3-85125-378-8-61
By using CSP features instead of DFT features, the average peak, mean and median performance during the feedback period was significantly improved from 82% to 89.7%, from 66.8%
to 79.3% and from 67.7% to 80.4%, respectively when combined with a RF classifier and from
82% to 87.3%, from 66.8% to 77.2% and from 67.7% to 77.8%, respectively when combined with
an sLDA classifier. It is not surprising that an optimized spatial filtering outperforms the DFT
features. CSP features have a higher signal-to-noise ratio and are therefore easier to classify.
For example, the peak classification accuracy of one participant was improved by 22% (Table
1, P9). However, our results show that a RF classifier can make better use of CSP features
than an sLDA classifier, at least for the present data. This is remarkable, as the RF classifier
relies on a complex, non-linear model and the trials-to-features ratio is low (100/90 = 1.11).
The effect of using a RF classifier instead of a sLDA classifier is small (peak 2%, mean 2%,
median 3%), but statistically significant for mean and median performance and consistent
over the participants. For 8 of the 10 participants the combination of a RF classifier with CSP
features is the best performing method. For participant P7 the combination of sLDA with CSP
features performed slightly better, but not in median performance. For participant P10, both
methods failed, since the achieved performance is around 70% only. The enhancement of the
sustained (i.e. mean and median) performance is of particular importance, as the Graz-BCI
paradigm calls for these. Concluding, the present work on performance, in combination with
previous work on RF classifiers as powerful tools for data analysis [7], underlines the potential
of the RF classifier in the field of BCIs. Further, we argue that the widespread view that linear
methods are ideal for BCIs should be reconsidered.
Acknowledgements
This work is partly supported by the FP7 research projects BackHome (No. 288566) and ABC
(No. 287774). This paper only reflects the authors views and funding agencies are not liable
for any use that may be made of the information contained herein.
References
[1] B. Blankertz, S. Lemm, M. Treder, S. Haufe, and K.-R. M
uller. Single-trial analysis and classification of ERP components - a tutorial. Neuroimage, 56:814825, 2011.
[2] L. Breiman. Random forests. Mach. Learn., 45:532, 2001.
[3] K. R. M
uller, C. W. Anderson, and G. E. Birch. Linear and nonlinear methods for brain-computer
interfaces. IEEE Trans. Neural Syst. Rehabil. Eng., 11:1659, 2003.
[4] G. Pfurtscheller and C. Neuper. Motor imagery and direct brain-computer communication. Proc.
IEEE, 5(89):11231134, 2001.
[5] H. Ramoser, J. M
uller-Gerking, and G. Pfurtscheller. Optimal spatial filtering of single trial EEG
during imagined hand movement. IEEE Trans. Rehab. Eng., 8(4):441446, 2000.
[6] R. Scherer, G. Pfurtscheller, and C. Neuper. Motor imagery induced changes in oscillatory EEG
components: speed vs. accuracy. In Proc. of the 4th International Brain-Computer Interface Workshop and Training Course 2008, pages 186190, Graz, Austria, 2008. Verlag der Technischen Universit
at Graz.
[7] D. Steyrl, R. Scherer, and G. R. M
uller-Putz. Random forests for feature selection in non-invasive
brain-computer interfacing. In HCI-KDD 2013, Lecture Notes in Comput. Sci., 7947, pages 207
216, Berlin Heidelberg, 2013. Springer.
Article ID 061-4
DOI:10.3217/978-3-85125-378-8-62
Institute for Knowledge Discovery, Graz University of Technology, 8010 Graz, Austria
2
Albert Schweitzer Clinic, 8020 Graz, Austria
3
Cyclotron Research Center, University of Li`ege, 4000 Li`ege, Belgium
4
Department of Psychology, University of W
urzburg, 97070 W
urzburg, Germany
5
Fondazione Santa Lucia IRCCS, 00179 Rome, Italy
6
BioTechMed-Graz, Austria
g.bauernfeind@tugraz.at
Abstract
In this study we report on the improvement of classification accuracy in an auditory
P300 paradigm, by using stepwise linear discriminant analysis (SWLDA) with an increased
number of channels and analytic shrinkage-regularized LDA (sLDA) as classifier. The
investigations were evaluated on recordings of 10 healthy subjects and 12 patients in a
minimally conscious state. The results for healthy subjects were promising, significant
improvements could be found by increasing the number of channels using SWLDA, as well
as, for using sLDA instead of SWLDA. Single trial classification accuracies up to 85.1%
could be achieved for healthy subjects. For the patients the results were less promising.
However, for one patient improvement reaching up to an accuracy of 71.3% could be
achieved.
Introduction
Article ID 062-1
DOI:10.3217/978-3-85125-378-8-62
Methods
Subjects, Experimental Paradigm and Data Recording: We used the data of the 10
HS (mean age 27.6 3.0 SD years) and 12 PA (45.8 18.2 years) in MCS, recorded in our
previous study, using an auditory based P300 BCI [4]. Briefly summarized, two tone streams
(low, LTS, at 396 Hz, and high, HTS, at 1900 Hz) with infrequently appearing deviant tones
(297 Hz for the LTS and 2640 Hz for the HTS) at random positions were presented. The tones
of both streams were intermixed (LHL LHL , L..low tone, H..high tone, ..silent gap; for details
and a schematic illustration see [4]). As in this way the LTS was twice as fast as the HTS, the
percentage of deviant tones was dierent (20% for HTS and 10% for LTS) to generate the same
absolute number of deviants in both streams. For the experiment, subjects were instructed
visually, for HS, or auditory, for PA, to focus attention on one of the streams during one run.
For the HS 80 runs (40 runs for each stream; with 4000/400 (standard/deviant) tones in the
LTS and 1800/400 in the HTS) were recorded. Taking into account the reduced attention span
of the patients, one to two sessions, each with 20 runs (10 for each stream), were recorded. In
HS the EEG was recorded at 15 positions with a sampling rate of 512 Hz (filter setup: 0.5100
Hz) using active electrodes. For PA recording a reduced channel set (9 positions) was used to
facilitate measurements in a clinical environment. For the electrode setup see [4].
Data Analysis and Classification: Data recorded from HS and PA were analyzed in the
same way. Raw signals were filtered with a 3rd order Butterworth low-pass filter (cut-o
frequency at 10 Hz) and down-sampled to 64 Hz. In our previous work three EEG channels
(Fz, Cz, Pz) were used for classification (SWLDA, with 10x10 cross-validation; for details
see [4]). Only time points between 200 and 800 ms after tone onset were used as features.
In the present work, we performed two classification approaches: Firstly we used the data of
all recorded channels for SWLDA classification. Secondly we investigated also the performance
of sLDA instead of the SWLDA. In general, the classification performance of an LDA crucially
depends on accurate estimates of the class means and the common covariance matrix. If enough
data are available, an accurate estimate is unproblematic. However, in cases where insucient
data are available or the data are contaminated by outlier (e.g. during PA recordings), conventional estimation of the covariance matrix fails. In those cases, the covariance matrix is
ill conditioned. One can improve the conditioning of the covariance matrix by regularization.
An ecient regularization method is analytic shrinkage. This so-called analytic-shrinkageregularized LDA (sLDA) is computationally very ecient and outperforms other methods at
low trial to feature ratios [1]. For both approaches the classification was carried out as follows:
1) To detect the P300 the deviant tones were classified against the standard tones for each
target stream separately, and 2) to investigate the attentional modulation of the P300, the
target deviant tones were classified against the non-target deviant tones for each stream separately. By performing the second investigation, it should be possible to infer which stream was
attended. For the first investigation random subsampling with 100 iterations was applied to
account for the very dierent numbers of deviant and standard segments. Classification results
were compared with the real level of chance [2] to identify random results.
Results
Results are shown for P300 and attentional modulation detection. Figure 1 depicts the averaged
(mean SD) classification accuracies for P300 and attentional modulation detection, using
either SWLDA with 3 channels (SWLDA3 ), all channels (SWLDAall ), or sLDA classifier with
Article ID 062-2
DOI:10.3217/978-3-85125-378-8-62
all channels (sLDAall ) for both groups (HS,PA). The results are shown for LTS and HTS.
Table 1 summarizes the subject specific SWLDA and sLDA classification results of all 10 HS.
The values in the table represent the mean accuracies over all cross-validation folds.
Subj. SWLDA 3
LTS HTS
HS01 57.3 61.2
HS02 70.7 61.8
HS03 77.2 72.0
HS04 62.2 57.2
HS05 63.9 60.7
HS06 71.5 64.4
HS07 61.9 54.4
HS08 67.2 59.7
HS09 56.8 55.4
HS10 56.0 52.6
Mean 64.5 59.9
SD
7.1
5.6
P300
SWLDA all
LTS HTS
58.0 62.5
73.5 63.6
78.2 73.3
64.3 57.9
65.2 61.5
78.7 75.0
62.9 55.6
74.0 65.1
59.6 57.6
60.9 53.6
67.5 62.6
7.8
7.1
sLDA all
LTS HTS
64.0 68.2
76.7 65.2
85.1 77.7
67.2 61.4
69.2 65.8
81.3 79.7
67.2 59.8
80.7 72.1
62.6 58.6
64.4 56.7
71.8 66.5
8.3
7.9
SWLDA 3
LTS HTS
55.2 57.8
62.6 62.8
69.5 70.4
56.9 53.5
63.5 61.0
65.5 65.5
53.3 55.6
58.9 60.2
52.8 52.1
51.4 50.6
59.0 59.0
6.1
6.3
Attention
SWLDA all
LTS HTS
55.8 56.0
61.9 62.9
70.9 69.2
59.4 57.4
65.4 65.5
71.6 76.7
54.1 54.6
61.0 62.2
53.4 53.0
51.4 53.4
60.8 61.1
6.7
7.7
sLDA
LTS
58.6
65.3
75.9
61.1
67.2
76.5
56.7
65.9
54.7
57.0
63.9
7.7
all
HTS
59.3
69.4
74.8
61.2
71.2
78.8
58.1
69.3
55.9
60.3
65.8
7.9
Table 1: SWLA and sLDA classification accuracies (in %) for the HS for P300 and attentional
modulation detection using either 3 or all channels. All results significantly better than random [2]
( = 1%) are indicated in italic. Highest subject specific accuracies are indicated in bold.
In healthy subjects the P300 condition revealed a statistically significant dierence in accuracy depending on which classification method was used, 2 (5) = 39.94, p < 0.001. Post hoc
analysis with Wilcoxon signed-rank tests was conducted with a Bonferroni correction applied,
resulting in a significance level set at p 0.012. There were significant dierences between
the SWLDA3 and sLDAall (LTS: Z = 2.805, p = 0.005; HTS: Z = 2.803, p = 0.005)
and SWLDAall and sLDAall (LTS: Z = 2.803, p = 0.005; HTS: Z = 2.803, p = 0.005).
The attentional modulation condition showed also a significant dierence in classification
accuracies depending on the method used, (2 (5) = 38.11, p < 0.001) and the follow up
Wilcoxon signed-rank test showed significant dierences between SWLDAall and sLDAall (LTS:
Z = 2.805, p = 0.005; HTS: Z = 2.803, p = 0.005) and between SWLDA3 and SWLDAall
(LTS: Z = 2.49, p = 0.012).
P300
SWLDA
SWLDA
all
sLDA
all
LTS
HTS
LTS
HTS
LTS
HTS
Mean
51.9
51.5
51.7
52.4
52.3
53.8
SD
4.0
3.9
4.1
4.2
4.6
5.6
Attention
SWLDA
SWLDA
all
sLDA
all
LTS
HTS
LTS
HTS
LTS
HTS
Mean
50.8
51.1
50.3
51.0
51.2
52.7
SD
5.6
4.4
5.6
4.4
5.1
5.3
Table 2: Mean ( SD) SWLDA and sLDA classification accuracies (in %) for the patient group for
P300 and attentional modulation detection using
either 3 or all channels.
Article ID 062-3
DOI:10.3217/978-3-85125-378-8-62
For the patients (see Table 2 for mean SD accuracies) the Friedman test reported significant
dierences between the classification methods only for the P300 condition, 2 (5) = 13.28, p =
0.021. The follow up Wilcoxon signed-rank test revealed no significant results. Nevertheless,
in one of the patients (PA09; CRS-r score: 18; Cause: Hemorrhagic stroke) a P300 could
be classified above chance level. In more detail, by using sLDA, the accuracy for the HTS
was improved up to 71.3% (SWLDA3 : 63.0%; SWLDAall : 65.3%), which is significantly better
than random [2] ( = 1%). However, for the LTS all classification accuracies (SWLDA3 : 64.8%;
SWLDAall : 65.0%; sLDAall : 65.2%) remained below chance level.
For the HS group the classification accuracies could be significantly improved by using sLDA
and the inclusion of all recorded channels. However, unlike healthy subjects, for the patient
group the results were less encouraging. Although in some PA the single-trial classification
accuracies could be improved, they remained mainly below chance level. However, an accuracy
significantly better than chance level [2] could be reached in one subject (71.3%).
Concluding, as stated by Blankertz et al. [1], the use of a higher number of channels is
potentially advantageous for ERP classification and improves the classification accuracy in HS
using either SWLDA or sLDA. For the PA recordings further studies should aim on investigating
the use of additional electrode positions (especially frontal/fronto-lateral [5]). Furthermore, as
concluded in [4], improvements on the used paradigm (e.g. include spatial information [5] if
possible) are still required to take into account the specific needs and capabilities of the patients.
Acknowledgments
Supported by European ICT Programme Project FP7-247919 and Land Steiermark Project
(ABT08-22-T-7/2013-17). The text reflects solely the views of its authors. The European
Commission is not liable for any use that may be made of the information contained therein.
References
[1] B. Blankertz, S. Lemm, M. Treder, S. Haufe, and K.-R. M
uller. Single-trial analysis and classification of erp components - a tutorial. Neuroimage, 56:814825, 2011.
[2] G. R. M
uller-Putz, R. Scherer, C. Brunner, R. Leeb, and G. Pfurtscheller. Better than random? A
closer look on BCI results. Int. J.Bioelectromag., 10:5255, 2008.
[3] A. R. Murguialday, J. Hill, M. Bensch, S. Martens, S. Halder, F. Nijboer, B. Schoelkopf, N. Birbaumer, and A. Gharabaghi. Transition from the locked in to the completely locked-in state: a
physiological analysis. Clin Neurophysiol, 122(5):925933, 2011.
[4] C. Pokorny, D. S. Klobassa, G. Pichler, H. Erlbeck, R. G. L. Real, A. K
ubler, D. Lesenfants,
D. Habbal, Q. Noirhomme, M. Risetti, D. Mattia, and G. R. M
uller-Putz. The auditory P300-based
single-switch BCI: Paradigm transition from healthy subjects to minimally conscious patients. Artif
Intell Med, 59(2):8190, 2013.
[5] M. Schreuder, B. Blankertz, and M. Tangermann. A new auditory multi-class brain-computer
interface paradigm: Spatial hearing as an informative cue. PLoS ONE, 5(4):e9813, 2010.
[6] C. Zickler, A. Riccio, F. Leotta, S. Hillian-Tress, S. Halder, E. Holz, P. Staiger-S
alzer, E. J. Hoogerwerf, L. Desideri, D. Mattia, and A. K
ubler. A brain-computer interface as input channel for a
standard assistive technology software. Clin EEG Neurosci., 42(4):236244, 2011.
Article ID 062-4
DOI:10.3217/978-3-85125-378-8-63
Introduction
Article ID 063-1
DOI:10.3217/978-3-85125-378-8-63
online data is sufficient to pre-train them (unsupervised trained classifiers, see [4]). These basic
transfer learning methods lead to a reasonably accurate classifier for novel data. However,
transferred models typically can not compete with user-specific models. To tackle this problem,
a combination of transfer learning and online unsupervised adaptation has been proposed.
While this combination has already been investigated oine [4] based on high-SNR visual ERP
data, a test with more challenging data and its application in online experiments is still lacking.
Hence, this work contributes with oine results on challenging auditory ERP data and, more
importantly, by presenting preliminary online results with the spatial auditory AMUSE [7]
paradigm.
2
2.1
Methods
Auditory ERP Paradigm AMUSE
AMUSE utilizes six auditory stimuli originating from a ring of six speakers surrounding the
user. The stimuli can be identified by their direction or pitch, have a duration of 40 ms and
were presented in pseudo-randomized order with a stimulus onset asynchrony (SOA) of 175 ms
and in 15 iterations per trial. To spell one of 36 possible symbols, the user selects a group of six
symbols in a first step by focusing attention to one of the six stimuli. Subsequently the user can
select one of the six symbols from the previously selected group. Contrary to [7], a correction
of wrong letters was not foreseen. Nevertheless, for all other details we closely replicated the
setup detailed by Schreuder and colleagues.
2.2
We evaluated the unsupervised transfer learning model from [4]. This probabilistic model is
applicable for multi-stimulus ERP paradigms. It assumes that only a single one of the six stimuli
can result in a target ERP response, while the other stimuli must elicit a non-target response.
On top of that, it assumes that EEG features can be projected into a single dimension, where
the projected data is Gaussian with class-dependent mean and shared variance.
These assumptions allow for an unsupervised training of the model. Hence a calibration
recording and labeled data are no longer required. The training is based on the Expectation
Maximization algorithm and allows the model to learn how to decode ERP features while the
user is interacting with the BCI. A known limitation is the initially unreliable classification,
which is caused by a random initialization. Therefore, even though this model is an improvement
over supervised systems with a calibration recording, this so-called warm-up period may limit
the usability of the unsupervised system in the presence of tight time constraints.
To alleviate the warm-up phase, we have extended the model with inter-subject transfer
learning [4]. The concept of transfer learning is introduced in the model by placing a shared
hyper-prior on the user-specific mean of the prior on the weight vector. This corresponds to
regularizing the subject-specific model towards the general shared model in place of regularizing
towards a zero vector, which is a common approach to limit the model complexity. Due to space
limitations, we refer the reader to [4] for the update equations and the training procedure of
the transfer learning model.
Article ID 063-2
DOI:10.3217/978-3-85125-378-8-63
80
80
60
LDA
UA
TA
U
TA
40
20
0
9 10 11 12 13 14 15 16 17 18
sub-block
100
100
UA
TA
TAU
UA
TA
TAU
60
40
20
0
nbc
nbd
subjects
Figure 1: Left: Simulated grand average performance over 18 sub-blocks of ten trials each.
Right: Online selection accuracy for two users and three blocks of 30 trials each.
2.3
An oine simulation was conducted on existing data from [3] to assess the feasibility of unsupervised adaptive transfer learning for auditory ERP data with challenging SNR. In the
original study, unsupervised learning was compared to a standard supervised approach (LDA
with shrinkage-regularization on the estimated covariance matrix). Each user had performed
a calibration recording comprising 30 trials and six online evaluation blocks of 30 trials each.
Supervised decoding was used for three blocks, and unsupervised for the remaining three blocks,
resulting in 180 trials. For the simulation in the present work, this data was re-used trial-bytrial to train up a subject-specific unsupervised adaptive model per user (U A). It is compared
with the fully updated unsupervised model (having seen all 180 trials) after processing the
entire dataset (U AU ). Furthermore, data is combined in a leave-one-user-out transfer learning
model. The simulated trial-by-trial application and updating of this transfer model to data of
the left-out user is denoted T A. The final, fully updated version of this model is T AU .
The novel online transfer experiment made use of T A. Two users of the original study
were re-invited for a second session. They each performed three blocks of 30 trials of online
copy-spelling spelling using the transfer learning model T A. Please note that it was reset at
the beginning of each block in order to study the warm-up behavior. In addition, the fully
updated model T AU was available at the end of each block. In addition, the performance of
a non-transferred, randomly initialized unsupervised classifier U A was evaluated by a posthoc simulation. We would like to stress that the transferred models used in this work all are
completely unsupervised. Label information was not used at any point in the models training.
Hence, it is sufficient to use data from free spelling sessions to build the transfer model.
Results from the simulated experiment on nine users are given in the left plot of Figure 1. Oine
performance is estimated on sub-blocks of ten trials (corresponding to five symbols each). As
a baseline, the standard supervised LDA model (trained on data from a calibration session of
30 trials in the original study) is compared to the unsupervised models U A, T A and the fully
adapted model at the end of the session T AU .
Article ID 063-3
DOI:10.3217/978-3-85125-378-8-63
During the first two sub-blocks the subject-specific supervised LDA model performs best.
Furthermore it is relatively stable over most sub-blocks, but it displays a performance drop
at the end of the experiment (sub-blocks 17 and 18), while this is not observed for any of
the unsupervised adapted models. The randomly initiated unsupervised model U A exhibits
typical warm-up behavior during the first two experimental blocks (25 % and 47 % accuracy,
with 16.
6 % chance level). As it observes more and more data, it becomes more reliable. The
transfer learning model T A was able to significantly reduce the warm-up period and achieves
60 % selection accuracy in the first block and 70 % in the second block. While it does not yet
performs at the same level as the supervised LDA model, we must point out that the LDA
model had already observed ten trials more data at this point (30 trials in total). Moreover,
the re-evaluation of the transfer learning model T AU can correct many of the initial mistakes
committed by T A and is as reliable as the LDA model even in the first blocks. In previous
work, we have shown that the revised predictions of the U AU model are quite similar to those
of T AU . The online results given in the right plot of Figure 1 provide details on the warm-up
performance for three repeated blocks of 30 trials (corresponding to 15 symbols each). They
not only show the technical feasibility of the transfer approach, but are a first indication that
the simulation results reported above can be replicated online. Obviously, the online results so
far are limited by the fact that they comprise two users only.
However, the oine simulation in combination with these first online results of unsupervised
transfer learning indicate that it is not only useful to share decoding knowledge between users,
but that an additional subject-specific further online adaptation quickly leads to a very reliable
model for the novel user. This study will be extended in future work and naturally, if successful,
taken to patients.
Acknowledgment
This work was partly supported by: BMBF Germany (FKZ 01GQ0850, 16SV5839); Ghent Univ. Special
Res. Fund (BOF-GOA project Home-MATE); German Research Found. (grant number EXC 1086); Nat. Res.
Found. of Korea. We thank Johannes H
ohne and Martijn Schreuder for support with the experimental setup.
References
[1] J. H
ohne, M. Schreuder, B. Blankertz, and M. Tangermann. A novel 9-class auditory ERP paradigm
driving a predictive text entry system. Front. Neuroscience, 5:99, 2011.
[2] J. Jin, E.W. Sellers, Y. Zhang, I. Daly, X. Wang, and A. Cichocki. Whether generic model works
for rapid ERP-based BCI calibration. J. Neurosc. Meth., 2012.
[3] P.-J. Kindermans, M. Schreuder, B. Schrauwen, K.-R. M
uller, and M. Tangermann. True zerotraining brain-computer interfacing an online study. PLoS One, (in press).
[4] P.-J. Kindermans, M. Tangermann, K.-R. M
uller, and B. Schrauwen. Integrating dynamic stopping,
transfer learning and language models in an adaptive zero-training ERP speller. J. Neural Eng.,
2014.
[5] K. Li, V. N. Raju, R. Sankar, Y. Arbel, and E. Donchin. Advances and challenges in signal analysis
for single trial P300-BCI. In Proc. 6th Int. Conf. Foundations of Augm. Cogn., FAC11, pages
8794, Berlin, Heidelberg, 2011. Springer-Verlag.
[6] Y. Li, C. Guan, H. Li, and Z. Chin. A self-training semi-supervised SVM algorithm and its application in an EEG-based brain computer interface speller system. Pattern Recogn. Lett., 29(9):1285
1294, 2008.
[7] M. Schreuder, T. Rost, and M. Tangermann. Listen, you are writing! speeding up online spelling
with a dynamic auditory BCI. Front. Neuroscience, 5, 2011.
Article ID 063-4
DOI:10.3217/978-3-85125-378-8-64
1
Chair in Non-Invasive Brain-Machine Interface (CNBI), EPFL, Switzerland
inaki.iturrate@epfl.ch,ricardo.chavarriaga@epfl.ch,jose.millan@epfl.ch
Instituto de Investigaci
on en Ingeniera de Arag
on (I3A), DIIS,Universidad de Zaragoza, Spain
montesano@unizar.es,jminguez@unizar.es
3
Bit&Brain Technologies SL, 50018 Zaragoza, Spain
Abstract
Calibration of brain-machine interfaces exploiting event-related potentials has to be
performed for each experimental paradigm. Even if these signals have been used in previous experiments with dierent protocols. We show that use of signals from previous
experiments can reduce the calibration time for single-trial classification of error-related
potentials. Compensating latency variations across tasks yield up to a 50% reduction the
training period in new experiments without decrease in online performance compared to
the standard training.
Introduction
2
2.1
Methods
Experimental protocols
Article ID 064-1
8
Error
Correct
Difference
Amplitude (V)
4
2
0
2
4
2
0
2
6
200
6
200
200
400
600
Time (ms)
800
1000
Error
Correct
Difference
Error
Correct
Difference
6
Amplitude (V)
6
Amplitude (V)
DOI:10.3217/978-3-85125-378-8-64
4
2
0
2
4
200
400
600
Time (ms)
800
1000
6
200
200
400
600
Time (ms)
800
1000
Figure 1: (Top) Experimental protocols. (Bottom) ErrPs at the FCz electrode obtained in
each experimental protocol. (Left) One-dimensional cursor movement (E1). (Center) Twodimensional movements of a simulated robotic arm (E2). (Right) Real robotic arm (E3).
resting periods. In all experiments they were asked to evaluate whether a device moves towards
a given target location. The device moved in discrete steps and the time between movements
was randomly chosen within the range [1.7 4.0] s. There was a probability of moving in the
wrong direction of about 30%. Experiments were always performed in the same order from the
simplest to the most complex one. The first experiment, E1 , consists of a cursor that moves
in discrete steps (either left or right) towards a target [1]. In the second protocol, E2 , the user
monitors a simulated robotic arm that moves on a 2D plane (allowed movement directions were
left, right, up and down). The third experiment, E3 , consists of the same task using a real
robotic arm. A detailed explanation of the protocols and methods is provided in [2].
Each experiment started by a calibration phase. This phase had a variable length depending
on the obtained performance. Calibration stopped whenever the mean accuracy (ten-fold crossvalidation) on the training data exceed 75%. Then the classifier parameters were fixed, and
performance was tested on an online phase lasting 400 trials.
EEG was recorded at 256 Hz with a gUSBAmp amplifier (gTec GmbH, Austria) with 16
active electrodes (Fz, FC3, FC1, FCz, FC2, FC4, C3, C1, Cz, C2, C4, CP3, CP1, CPz, CP2,
and CP4 according to the 10-10 system). Ground and reference were placed on the forehead
and the left earlobe. Data was notch filtered at 50 Hz, and zero-phase band-pass filtered at
[1, 10] Hz. Prior to classification, we applied common-average reference and downsampled the
signal to 64 Hz. Features from eight fronto-central channels were selected in the window [200
800] ms using a spatiotemporal filter [3]. On average 45 10 features were selected based on
their r2 score. Single-trials were classified as erroneous or correct using linear discriminant
analysis (LDA).
2.2
As mentioned above, dierences between ErrPs elicited in dierent experiments can be largely
explained by latency variations. These variations can be easily estimated by computing the
Article ID 064-2
DOI:10.3217/978-3-85125-378-8-64
Figure 2: (Left) Number of calibration trials (mean SEM) required to reach the calibration
criterion (*: p < 0.05). (Right) Mean classifier accuracy during the online control phase.
cross-correlation between the grand-average ERPs for each experiment. In brief, given a previous experiment Ei , from which data is available, and a new experiment Ej , the ERP latency
variation dEi Ej will correspond to the shift that yields the maximum cross-correlation. Then,
ERP data from Ei can be shifted in time by dEi Ej and used along the available (few) trials
from the new experiment Ej to train a classifier.
Complementing previous reports [2], we compare whether this latency correction mechanism
eectively reduces the calibration time. To that end, we defined two groups of participants
depending on the training procedure. The control group (N=6, one female, mean age 27.332.73
years) followed a standard calibration approach, i.e., based only on data from the current
experiment. The experimental group (N=6; two females, mean age 27.17 4.07 years) used
latency-corrected trials from the previous experiment to build the classifier for the current task.
That is, standard calibration was followed for E1 , while data from that experiment was used
during the calibration period of E2 . Similarly, during calibration for E3 the data from E2 was
used. The latency between experiments was estimated based on the cross-correlation of the
dierence ERP (error minus correct condition) of channel FCz within the window [0, 500] ms.
Mixed two-way ANOVAs (within factor: experiments; between factor: group of subjects)
were performed to test whether (i) the number of calibration trials in the experimental group
decreased across experiments; (ii) the number of calibration trials was significantly dierent
between groups; and (iii) the online accuracies of both groups were not dierent. Post-hoc onetailed Bonferroni-corrected t-tests were performed to assess statistically significant dierences.
Results
The obtained ErrPs can be seen in Figure 1. Latency variations were about 60 25ms between
E1 and E2 , and about 4113 ms between E2 and E3 Figure 2 shows the number of calibration
trials needed in each experiment to reach the stopping criteria. The calibration period for the
control group was similar for all experiments. In contrast, the experimental group exhibit a
large reduction on the required calibration trials in E2 and E3 when previous information was
re-used. The ANOVA test revealed a significant interaction between the experiment and group
(F2,20 = 8.65, p = 0.002). Post-hoc tests showed that significant dierences were found between
groups in experiments 2 and 3 (one-tailed unpaired t-tests, p < 0.05), and also significant
dierences within the experimental group between experiments 1 and 2 (one-tailed paired t-
Article ID 064-3
DOI:10.3217/978-3-85125-378-8-64
Conclusion
Our results confirm that compensating for latency variations across protocols allows the use of
previous data to shorten the calibration phase in new applications. In the reported experiments
the use of data from the first experiment enabled users to reach the training criteria for the
second one in about half of the trials required with the standard approach. Importantly, no significant dierence was observed between the two training paradigms in the online performance.
Similar reductions were also observed for the third experiment with the real robot.
The latency correction mechanism used in this work relied on a simple measure based on the
cross-correlation in a single channel. However, ERP variations across experiments may follow
more complex patterns both spatially (i.e. across channels) and temporally (i.e. individual ERP
components). Multiple factors aect the ERP waveforms including the feedback modality, the
inter-stimulus interval, as well as subject dependent variability. It remains to be validated how
suitable this correction mechanism is to other experimental paradigms and signals. Moreover,
further research is required to explore more sophisticated techniques to better model these ERP
variations (e.g. dynamic time warping).
Acknowledgments
This work was supported by the EU Project FP7-224631; the Swiss funded NCCR Robotics and
the Spanish Ministry of Science projects HYPER- CSD2009-00067 and DPI2009-14732-C02-01,
DPI2011-25892, CAI Programa Europa, DGA-FSE, and grupo T04.
References
[1] R. Chavarriaga and J. del R. Mill
an. Learning from EEG error-related potentials in noninvasive
brain-computer interfaces. IEEE Trans Neural Syst Rehabil Eng, 18(4):381388, Aug 2010.
[2] I. Iturrate, R. Chavarriaga, L. Montesano, J. Minguez, and J.d.R. Mill
an. Latency correction of
error potentials between dierent experiments reduces calibration time for single-trial classification.
J Neural Engineering, 11:036005, 2014.
[3] I. Iturrate, L. Montesano, R. Chavarriaga, J. del R. Mill
an, and J. Minguez. Spatio-temporal
filtering for EEG error related potentials. In 5th Int Brain-Computer Interface Conference, 2011,
pages 1215, Graz, Austria, 2011.
[4] S.K. Kim and E.A. Kirchner. Classifier transferability in the detection of error related potentials
from observation to interaction. In IEEE Int Conf Systems, Man, and Cybernetics (SMC), 2013,
pages 33603365, Oct 2013.
[5] F. Lotte and C. Guan. Learning from other subjects helps reducing brain-computer interface
calibration time. In Int. Conf. Audio Speech and Signal Proc (ICASSP), pages 614617, 2010.
[6] W. Samek, F.C. Meinecke, and K.-R. M
uller. Transferring subspaces between subjects in brain
computer interfacing. IEEE Trans Biomed Eng, 60(8):22892298, 2013.
[7] C. Vidaurre, M. Kawanabe, P. von B
unau, B. Blankertz, and K.-R. M
uller. Toward unsupervised
adaptation of LDA for brain-computer interfaces. IEEE Trans Biomed Eng, 58:587597, 2011.
Article ID 064-4
DOI:10.3217/978-3-85125-378-8-65
Recent developments in brain-machine interfaces (BMIs) have proposed the use of errorrelated potentials as cognitive signal that can provide feedback to control devices or to teach
them how to solve a task. Due to the nature of these signals, all the proposed error-based
BMIs use discrete tasks to classify a signal as correct or incorrect under the assumption that
the response is time-locked to a known event. However, during the continuous operation of
a robotic device, the occurrence of an error is not known a priori and thus it is required to
be constantly classifying. Here, we present an experimental protocol that allows to train
a decoder and detect errors in single trial using a sliding window.
Introduction
EEG-measured error-related potentials (ErrPs) are one class of event-related potential (ERP)
elicited in the user brain when the outcome of an event diers from the user expected one.
These potentials have been observed, in particular, when a user observes a machine committing
incorrect actions or operations. Recent works have proposed the incorporation of these signals
into brain-machine interfaces (BMI) to correct the classifier or as rewards to control or to teach
devices [5]. As with all event-related potentials, the neural response associated to ErrPs is
triggered in response to an exogenous event. Consequently, most of the developed works try to
distinguish whether an action is correct or erroneous based on the knowledge of the time of its
occurrence[1, 4]. On the other hand, real applications (such as executing a trajectory with a
robotic arm or a mobile robot) imply the use of continuous actions where the classifier has to
asynchronously dierentiate between erroneous events and the background EEG.
This works presents a method to detect error potentials during the continuous operation of
a device. Showing that using events introduced as abrupt changes of direction, it is possible
to train a classifier to later on asynchronously classify among a complete trajectory achieving
detection rates comparable to those obtained in discrete tasks.
2
2.1
Methods
Experimental Protocol
Two healthy subjects (mean age 28 years) participated in the study. The experimental setup
consisted of a virtual cursor that had to reach a target position by moving at a fixed speed
towards it. The initial cursor and target positions of each trial were randomly generated. One
trial consisted of a trajectory performed by the device and lasted between 3 and 5 seconds.
Trajectories were correct in 70% of the trials, which consisted in a straight line between the
start and goal locations, Figure 1a. The remaining 30% of the trials were erroneous trajectories
that started as the correct ones but executed an abrupt change of direction in a random instant
between the 20% and 80% of the path, Figure 1b. Six rounds composed of 40 trials each were
recorded, obtaining around 70 erroneous and 170 correct trials per participant.
Article ID 065-1
DOI:10.3217/978-3-85125-378-8-65
Figure 1: (a-b) Designed experimental setup. Starting (S) and goal (G) positions of the
device are marked in blue and red respectively. Correct and wrong directions of movements
are shadowed in green and red respectively. (c) Neurophysiology Analysis: grand averages
in temporal (bottom,left) and frequency (bottom,right) domain, topographical representation
(top,left) and source localization(top,right).
EEG and EOG activity were recorded at 256 Hz using a gTec system with 32 electrodes
according to the 10/20 international system (including 8 fronto-central channels), with the
ground on FPz and the reference on the left earlobe; for the EOG, 6 monopolar electrodes
were recorded [3], with the ground on FPz and the reference on the left mastoid. Recorded
data were power-line notch filtered, and band-pass filtered at [1, 10] Hz. The EEG was also
common-average-reference (CAR) filtered. Additionally, EOG was removed from the EEG
using a regression algorithm, and those trajectories where EOG magnitude higher than 40 V
was detected, were rejected.
2.2
Temporal domain features have been widely used to detect ErrPs under discrete tasks. However,
for continuous detection, temporal features result in a large number of false positives as EEG
oscillations resemble ErrP patterns. On the other hand, it has been shown that frequency
features can be more robust under specific changes in the ErrP signals [7]. In this work, we
propose the combination of temporal and frequency features extracted from the most relevant
common spatial patterns associated (CSPs) for the continuous detection of ErrPs. To train
a classifier, the onset of the erroneous events was selected at the instant in which the device
performed the abrupt change of direction. On the other hand, onsets of correct trials were
selected at random instants of time within the execution of a correct trajectory. Training data
from error and correct conditions was used to extract the CSPs, and the two first CSPs were
retained. For each retained CSP, the temporal features were the EEG voltages within a time
window of [0, 1000] ms downsampled to 64 Hz, forming a vector of 128 features. The power
spectral density (PSD) was computed taking a window interval from 0 to 1000 ms as it gave
the best trade-o between frequency resolution and capturing the signal of interest. Frequency
features were selected as the power values of each channel from the theta band ([4, 8] Hz) 1
Hz [2] leading to a vector of 14 features. Finally, both set of features were concatenated and
normalized within the range [0, 1].
Article ID 065-2
DOI:10.3217/978-3-85125-378-8-65
40 TRIALS
Events
Subject 1
Subject 2
Figure 2: Representative example of the sliding window results. The error events are plotted
as red spikes indicating a change in direction, while the probability of detecting an error (pe )
for each of the 2 subjects is plotted in blue. Black dots over the probability values indicate the
time instant when the classifier detected an error (pe > 0.8).
After balancing the error and correct datasets, a RBF-SVM was trained using the previous
features [6]. During classification, we retained the classifier output for each window evaluation
as the probability estimate that the current EEG was an error, pe . For the asynchronous
classification we performed a 6-fold cross-validation where each fold was composed by a complete
recorded round. We continuously classified every 62.50ms over the testing set using a sliding
window of one-second width. To ensure a low false positive rate, the detection of error events
was only considered when pe > 0.8, value selected through cross-validation. For the sliding
window, all inter-trial data was removed. Classification performance was computed as follows:
the erroneous trials where the classifier detected an error, in a one second window after the
direction change, were considered as true positives. When an error was not detected the trial
was a false negative. True negatives were those correct trials where no error was detected. And,
when an error was detected on correct trials, they were considered false positives.
Results
Article ID 065-3
DOI:10.3217/978-3-85125-378-8-65
than those obtained with standard ErrP protocols that only classify in the window of the
event [1]. This decay is reasonable considering that classifying with a sliding window conveys
a higher chance of detecting a false positive during non event intervals. Trajectories executed
by the device according to their classification as false positives or false negatives are depicted in
Figure 3. Furthermore, it can be seen that correct trials are mostly well detected independently
of the direction and distance covered by the device, and only few of them are detected as
erroneous. However, the number of erroneous trials not detected was higher, since it was
preferable to miss the detection of an error than detect errors where was not intended. Also,
notice that most of the true negatives ended up very close to the goal. Indeed, 40% of them
ended less than 50 pixels from it, which lead us to think that the subjects may have not
interpreted them as erroneous.
This paper studies the on-line asynchronous detection of error potentials during continuous
trajectories. The results obtained for the proposed experimental protocol show that the error
potentials appear when the user monitors a continuous target reaching task and that they
can be detected in single trial using a sliding window, obtaining results comparable to those
achieved in discrete tasks. These promising results are a first step towards the use of this type of
cognitive information to control or teach robotic devices in realistic and complex tasks. There
exist several opportunities for future work. Currently, we are extending the study to more users
and more types of trajectories containing errors. Also we are testing the usage of this kind of
events on real devices obtaining promising results.
Acknowledgments
This work has been supported by Spanish projects DPI2011-2589, and DGA-FSE (grupo T04).
References
[1] R. Chavarriaga and J.d.R. Mill
an. Learning from EEG error-related potentials in noninvasive
brain-computer interfaces. IEEE Trans Neural Syst Rehabil Eng, 18(4):381388, 2010.
[2] Michael X. Cohen. Error-related medial frontal theta activity predicts cingulate-related structural
connectivity. NeuroImage, 55(3):13731383, 2011.
[3] RJ Croft and RJ Barry. EOG correction of blinks with saccade coecients: a test and revision of
the aligned-artefact average solution. Clinical neurophysiology, 111(3):44451, March 2000.
[4] I. Iturrate, R. Chavarriaga, L. Montesano, J. Minguez, and J. del R Millan. Latency correction of
error potentials between dierent experiments reduces calibration time for single-trial classification.
In Int Conf of the IEEE Engineering in Medicine and Biology Society (EMBC), 2012.
[5] I. Iturrate, L. Montesano, and J. Minguez. Shared-control brain-computer interface for a two
dimensional reaching task using eeg error-related potentials. In Int. Conf. of the IEEE Engineering
in Medicine and Biology Society (EMBC). IEEE, 2013.
[6] F. Lotte, M. Congedo, A. Lecuyer, F. Lamarche, and B. Arnaldi. A review of classification algorithms for EEG-based brain-computer interfaces. J Neural Eng, 4(2):R1R13, June 2007.
[7] J. Omedes, I. Iturrate, L. Montesano, and J. Minguez. Using frequency-domain features for the
generalization of eeg error-related potentials among dierent tasks. In Int. Conf. of the IEEE
Engineering in Medicine and Biology Society (EMBC). IEEE, 2013.
Article ID 065-4
DOI:10.3217/978-3-85125-378-8-66
Femke Nijboer 1
This preliminary study describes the assessment of BCI technology by people with
neuromuscular disorders (N=7) and high spinal cord injuries (N=9) from a technical,
ethical, legal and societal perspective. It becomes clear that people with disabilities
could and should greatly contribute to the design of assistive technology.
1 Introduction
The field of brain-computer interfacing (BCI) is rapidly expanding its application areas. Scientific
and technological endeavors typically focus on feasibility, validity and reliability of such emerging
technologies. However, the transfer of BCI technology to the clinic may be slow or difficult if
practical issues as well as ethical, legal and social issues are not properly and timely addressed.
Previously, Nijboer and colleagues interviewed rehabilitation specialists (N=28) what type of
users could be potential target users for BCIs as access technologies and what design requirements
BCIs should fulfil to be usable and pleasant technologies for such users [1]. Recommended target
users are only those who can hardly or not at all use alternative access technologies. People in the
locked-in state (resulting from late-stage amyotrophic lateral sclerosis, multiple sclerosis, spinal
muscular atrophy type II or classical or total locked-in syndrome) and people with high spinal cord
injury (C1/C2) could be target users. Specialists caution engineers and developers that these users
may have many concurring problems such as sensory or cognitive impairments and epileptic seizures.
In addition, transferring BCIs from the lab to the daily life of such target users will need a grounded
consideration for ethical, legal, social and cultural issues.
In this study we investigate the experience and opinions of people with neuromuscular and
muscular diseases (N=7) and people with high spinal cord injuries (N=9) on such issues after they 1)
have been educated about BCIs, 2) have had the opportunity to experience a BCI and 3) had the
opportunity to discuss the technology with each other. These user groups are interesting to compare.
One group, with disorders which have progressed sometimes from childhood, has a lifelong
experience with assistive technologies. The people with SCI have had years of experience with access
technologies for able-bodied people (keyboard and mouse) and now find themselves having to use
access technologies for disabled people. In addition, people with SCI do not have to anticipate further
functional decline, whereas people with progressive disorders do.
Article ID 066-1
DOI:10.3217/978-3-85125-378-8-66
2 Methods
Seven participants with progressive neuromuscular or muscular diseases (NM group) and nine
participants with high spinal cord injuries (SCI group) were recruited through the Dutch
Neuromuscular Diseases Association and the Association Spinal Cord Injury (see Table 1).
Participants were invited to attend workshops (one workshop for each group of users) entitled The
possibilities and impossibilities of Brain-Computer Interfaces. The two workshops were prepared
together with the directors of the associations and one of the participants to ensure that the program
was relevant and satisfactory for participants. Workshops were free of charge and held in accessible
buildings in a central location of the Netherlands to ensure equal access. Participants were reimbursed
for travel costs and provided informed consent before the workshop for their opinions to be recorded
on audio and video and pictures to be made.
Partici
pant
Diagnosis
Age
SMA type II
ALS
52
3
4
5
6
7
SCI
group
1
2
SMA type II
SMA type II
Distal SMA
SMA type II
SMA type II
3
4
5
6
7
8
9
NM
group
1
2
Verbal
Communication
Wheel
chair
Artificial
ventilation
Access technology
in use
Yes
No
Yes
No
38
33
60
23
21
Slurred
Speech
synthesizer
Normal
Slurred
Normal
Normal
Normal
Yes
Yes
Yes
Yes
Yes
Yes
Yes
No
No
Nighttime
Finger switch
Eye tracking
Keyboard/mouse
Keyboard/mouse
Keyboard/mouse
T5
C4/C5
61
45
Normal
Normal
Yes
Yes
No
No
C3/C4
C3/C4
C4/C5
C3/C5
C5/C6
C5/C6
C6/C7
39
51
61
38
30
52
60
Normal
Normal
Normal
Hushed
Normal
Normal
Normal
Yes
Yes
Yes
Yes
Yes
Yes
Yes
No
Nighttime
No
No
No
No
No
Keyboard/mouse
Keyboard/mouse,
mouth switch
Head switch
Chin joystick
Keyboard/mouse
Keyboard/mouse
Adapted keyboard
Keyboard/mouse
Keyboard/mouse
Table 1: Overview of participants. NM = neuromuscular; SCI = Spinal Cord Injury; SMA = Spinal Muscular
Atrophy; ALS = Amyotrophic Lateral Sclerosis. Hushed = cannot produce normal voice loudness.
The workshop followed a 3-step format which we dubbed the give-and-take approach. First,
participants were educated about the technical components of BCIs, available neuroimaging
techniques and types of applications. They were also presented with some of the major challenges the
field of BCI faces (e.g. universal design, reliability issues, limitations to information transfer rate,
sensors). Second, two participants in each workshop were offered to try out a BCI as an access
technology to operate a commercially available computer access software (The Grid 2 from Sensory
Software, see for a description [1] and for a video of a participant trying the demo:
http://youtu.be/gf3C_lAHT8U). The rest of the group watched and asked questions. Third and finally,
Article ID 066-2
DOI:10.3217/978-3-85125-378-8-66
we held a focus group interview with the participants to assess the technology they had seen in the
demonstration and the BCI field at large. Participants were prompted with predefined questions, but
also encouraged to bring up other topics they found (more) important. The interviews were
transcribed verbatim. Here we offer a preliminary descriptive overview of take home messages given
by participants.
3 Results
The focus group interviews provided recommendations from the participants on 4 different levels:
technical, ethical, social and legal (policy) issues. In addition, a philosophical outlook on human
identity was discussed.
In general, participants were positively surprised by the state of the art of the BCI field, the
effectiveness and the feeling of the BCI prototype. On a technical level, participants in both groups
agreed that they could not yet see the added functional value of current BCIs over existing
communication aids, although some participants in the NM group could imagine that BCIs could have
added value for them in about a year, because they anticipated a further decline in function such that
operation of their current aid would no longer be possible. Both groups indicated that the usability of
the system needs improvement. In both groups there was a strong interest for operation of a robot arm
rather than a communication aid, which seemed to be motivated, firstly, by the availability of reliable
communication aids in the NM group and the ability to speak in the SCI group and, secondly, by an
interest to expand or enhance current functionality rather than just repair functionality.
This was often accompanied by a wish to have electrodes implanted in the brain for practical
reasons: 1) sensors would be always in place (no need to bother caregivers), 2) the BCI would be less
bulky and less prone to get damaged by outside factors, and 3) invisible sensors are more esthetically
appealing.
Many participants seemed reluctant to discuss ethical issues and rather discussed practical issues,
but when prompted with questions such as what would you be afraid of? participants recognized
that ethical problems can be practical as well and indicated that agency was important. They want to
feel sure that the BCI does not go out of control or takes over control. Also, the surgery for
implanted sensors was perceived as risky, in particular for people in the NM group, since anesthesia is
often very complicated if not impossible for these people. Nevertheless, a few participants said they
would risk it if it gave them better functionality. Participants with neuromuscular disorders advise
to do the surgery in an early stage of the disease and, if possible, in combination with the surgery for
other life-sustaining measures, such as the tracheotomy.
Participants did not identify legal issues such as liability problems or see the need for special laws,
but they did discuss reimbursement policies. For example, mainstream technologies, such an iPad, are
not reimbursed even if these technologies would sometimes be the best solution for some people with
disabilities. Instead, they have to make a choice out of special technologies for disability. Participants
foresee difficulty in obtaining financial reimbursement for BCI technology. One participant said: It
is also a money thing of course. As long as I can use a puff-and-sip device, well use that. It is cheaper
than a BCI.
Societal issues related to BCI technology and assistive technology were discussed at length. Many
participants indicated that they did not want to wear a cap with sensors on their head unless it was
disguised as conventional headwear. The technology that surrounds people with disability, such as
communication aids and wheelchairs often scare people in society to such an extent that they literally
turn their back when a person with a disability comes in sight. Hence, any BCI technology that would
attract even more attention to persons with disabilities could potentially exclude them further rather
Article ID 066-3
DOI:10.3217/978-3-85125-378-8-66
than include them in society as intended. Participants stress the fact that esthetical BCI design is
therefore of utmost importance. On a more philosophical level we shortly discussed if BCIs could
change the boundaries between humans and technology, but participants mostly agree that such
boundaries have long changed for them since they are so dependent on technology anyway. They are
not afraid of the cyborg idea. Rather they would rather embrace it.
4 Discussion
Compatible with previous studies [2-4], the preliminary study presented here showed that
participants have stringent technical requirements concerning the usability of the overall BCI system
(efficiency, robustness, esthetics of the cap, multiple users). After reviewing the current state of the art
of BCI technology, participants did not yet see an added value of BCIs for themselves. However, they
strongly endorse further development and would like to be involved in the process.
User involvement in technology design should not necessarily be restricted to the definition of
technological requirements. This study shows that participants also have ethical requirements about
agency and (timing of) risks. Agency is essential for feeling safe, for self-determination and for
dignified living, especially for users with severe disabilities whose lives literally depend on
technology. This study also shows that users with disabilities are interested in implanted electrodes.
However, people with progressive neuromuscular disorders recommend that surgery should happen
at an early stage in disease onset when the risk of anesthesia is lower. This view contrast the view of
most scientists and ethical committees that surgery for a BCI should be a last option. A
recommendation could be to investigate scenarios in which surgeries happen at an early stage.
Compatible with [5], most participants felt no need for special BCI regulations. Legal and policy
recommendations focus on reimbursement issues (cost-effectiveness and robustness related to
maintenance costs). Finally, end-users know as no other - how society reacts to disability and
assistive technology. When the appearance of BCIs does not improve, BCI technology risks to
exclude people from society rather than to include them.
In conclusion, it becomes clear that people with disabilities could greatly contribute to the design
of assistive technology and provide expertise into the ethical, policy and societal issues that must be
addressed for successful technology transfer to the market. Thus, it is strongly recommended that
potential users are involved as experts in the earliest stages of research and development of BCIs.
References
1.
2.
3.
4.
5.
Nijboer, F., et al., Design requirements and potential target users for brain-computer
interfaces recommendations from rehabilitation professionals. Brain-Computer Interfaces,
2014: p. 1-12.
Zickler, C., et al., Brain Painting: Usability testing according to the user-centered design in
end users with severe motor paralysis. Artificial intelligence in medicine, 2013. 59(2): p. 99110.
Zickler, C., et al., A Brain-Computer Interface as Input Channel for a Standard Assistive
Technology Software. Clinical Eeg and Neuroscience, 2011. 42(4): p. 236-244.
Blain-Moraes, S., et al., Barriers to and mediators of brain-computer interface user
acceptance: focus group findings. Ergonomics, 2012. 55(5): p. 516-25.
Grbler, G., et al., Psychosocial and Ethical Aspects in Non-Invasive EEG-Based BCI
ResearchA Survey Among BCI Users and BCI Professionals. Neuroethics, 2013: p. 1-13.
Article ID 066-4
DOI:10.3217/978-3-85125-378-8-67
Ecole
Polytechnique Federale de Lausanne, CH-1015 Lausanne, Switzerland
jose.millan@epfl.ch
1
Abstract
Operating brain-actuated devices requires split attention between the interaction of the
device with its environment and the Brain-Computer Interface (BCI) feedback. In case
of screen-based applications it is possible to merge BCI feedback and application, but not
in case of controlling devices, like wheelchairs or exoskeletons. Recently we demonstrated
that BCI feedback could be provided via tactile stimulation and no performance dierence
between tactile and visual BCI feedback could be found. In this work we want to present a
platform for characterizing these multi-tasking capabilities during BCI operation. Thereby,
the subjects will have to perform a visual engaging and complex task in a game like setup
while interacting with the BCI in a coordinated multi-tasking manner.
Introduction
Controlling a brain-actuated device like a wheelchair requires the participant to look at and to
split his attention between the interaction of the device with its environment and the status
information of the Brain-Computer Interface (BCI). Such parallel visual tasks are partly contradictory, with the goal of achieving a good and natural device control. Recently we demonstrated
that it is possible to free the visual channel from one of these tasks, by providing feedback for a
motor imagery based BCI via a spatially continuous tactile stimulation [1]. Thereby, we could
demonstrate that the participants were able to perceive this type of tactile feedback well and
no statistical degradation in the online BCI performance could be identified between visual and
tactile feedback conditions. Nevertheless, an open issue is to demonstrate, that the freed visual
channel can now be fully devoted to the device while controlling the BCI.
In this work we want to present a platform for analyzing these multi-tasking capabilities
during BCI operation. Thereby, the subjects will have to perform a visual engaging and complex
task in a game like setup while interacting with the BCI in a coordinated multi-tasking manner
with the presented stimuli. This platform is giving us the possibility to evaluate dierent
parameters in a reproducible design, which would not be possible with a brain-actuated device.
Methods
The goal of this work is the evaluation of multi-tasking capabilities while exploiting visual and
tactile BCI feedback. Therefore, the participant will have to perform 2 tasks in parallel.
Article ID 067-1
DOI:10.3217/978-3-85125-378-8-67
Figure 1: Experimental setup including the multi-object tracking game and the BCI feedback.
Visual and tactile BCI feedbacks are used alternately.
The first task is the use of a 2-class BCI, which allows the delivery of a right or left BCI
command via the imagination of hand movements (MI). E.g. the imagination of a left hand
movement moves the BCI feedback to the left and the imagination of a right hand movement
to the right. The instructions when to move the BCI feedback and in which direction (cue) is
embedded in the parallel task described below.
The second task is to track several ball like objects on the screen (multi-object tracking
game [3], see Figure 2). At the beginning of each trial one, two or three targets out of 10 possible
balls are highlighted and the participant is instructed to keep track of them. Furthermore, only
the target(s) is/are slightly moving twice for 1 second either to the left or right, to indicate the
target direction (the BCI cue for the motor imagery) for this run.
After that, all targets are starting to move around in a physical realistic way. They can
bounce at the wall or at each other, but are never occluded. After approximately 10 seconds half
of the balls are highlighted (changing the color) for 4 seconds. The participants should perform
a BCI command during the time (while still keeping track of the targets). If one of the tracked
targets is highlighted, the participant should perform the BCI command, which was indicated
by the cue during the first 2 seconds of the trial. If the tracked target is not highlighted the
opposite BCI command should be applied. After these 4 seconds all balls return to white.
After another random period of around 10 seconds again half of the balls are highlighted and
the same actions as mentioned above have to be performed, dependent if one of the targets is
highlighted or not.
The trial finishes after around 30 seconds and all balls stop moving. They are immediately
numbered and the participant has to mark the one(s), s/he thinks are the targets. Furthermore,
they can report at which time the lost (if at all) one of the targets, and/or delivered a wrong
BCI command. After that the users receives feedback about his/her tracking performance.
Finally a new trial with new targets and BCI cue direction is starting.
The visualization contains two parts (see Figure 1). The balls are displayed in the top center
part of the screen. But also information about the current state of the 2-class BCI is presented
to the participant, which gives information of how much the BCI thinks that the participant
is applying e.g. the imagination of a left or right hand movement. Two dierent feedback
conditions are performed: This feedback is either displayed as a visual bar on the bottom part
of the screen (as used in normal BCI runs). Or in the second conditions this BCI feedback is
presented via 6 tactile stimulators on the neck (see Figure 1), whereby dierent stimulators are
activated to present the information similar to the position of the bar before. In this condition
no visual BCI feedback is shown, but the tracking task is performed in the same visual manner.
Article ID 067-2
DOI:10.3217/978-3-85125-378-8-67
2.1
The brain activity was acquired via 16 EEG channels placed over the sensori-motor cortex
(Fz, FC3, FC1, FCz, FC2, FC4, C3, C1, Cz, C2, C4, CP3, CP1, CPz, CP2 and CP4 with
reference on the right ear and ground on AFz). The EEG was recorded with a g.USBamp (g.tec
medical engineering, Austria) at fs =512 Hz, band-pass filtered 0.5100 Hz and notch filter set to
50 Hz. From the Laplacian filtered EEG, the power spectral density (448 Hz) was calculated.
Canonical variate analysis was used to select subject-specific features, which were classified with
a Gaussian classifier. Decisions with low confidence on the probability distribution were filtered
out and evidence was accumulated over time. More information about our BCI is given in [2].
In online experiments the output of the BCI is translated in a movement of the feedback,
which informs the subjects about their current brain status. In the case of visual feedback, the
horizontal bar moves on a screen. In the case of tactile feedback, the motors vibrate accordingly.
Results
Three experienced BCI participants (between 27 and 40 years, all male, all participated already
in study [1]), performed the first tests with the goal of tracking 1 or 2 balls out of 10 with either
visual or tactile feedback (conditions are called v1, v2, t1 or t2). As performance measures
we define the tracking score (TS) as the percentage of correctly tracked balls, the BCI true
positive rate (TP) as the percentage of correct BCI commands delivered in the highlighting
period (maximum 1 command possible per period) and the BCI false positive rate (FP) as the
ratio between BCI commands delivered in the free moving period (intentional non-control, where
no commands should have been delivered) compared to the number of highlighting periods.
Exemplary for one subject the detailed results are: TS varied between 100 % in t1, 75 %
in v1, 55 % in t2 and 40 % in v2, with a TP of 73 % in t1, 75 % in v1, 65 % in t2 and 50 % in
v2, and with a FP of 40 % in t1, 55 % in v1, 15 % in t2 and 70 % in v2. The other subjects
showed similar behaviors, but with dierent performance levels (e.g. TP of 30 %). Overall in all
subject, the TS was reduced in the visual conditions compared to the tactile ones, and further
reduced if the number of targets was increased. The TP ratio was stable over the two feedback
types, but seems to be reduced if more targets have to be tracked.
Article ID 067-3
DOI:10.3217/978-3-85125-378-8-67
Participants reported that especially an existing bias in the BCI classifier, was dragging
too much attention away from the search task, since the subject had to fight all the time
against the misclassification. Otherwise if the tracking task was performed with higher priority
the number of FP or wrong commands increased. Furthermore, the BCI decision thresholds
and the duration of the highlighting period have to be mutual adjusted, so that there is more
and longer chance to deliver correct commands. Currently some correct commands have been
delivered shortly after the end of the highlighting period (which would be FP) since it took
longer to identify the correct command side and then bring the BCI feedback towards it.
Nevertheless, we realized that the original goal of tracking 3 balls out of 10, while delivering
BCI commands in (randomized) defined orders is very challenging and mostly too complex.
Subjects even reported that they forgot the target class (cue) within one run, since the workload
was too high, which of course is contra productive to the goal of the game.
Therefore, we are currently modifying our setup according to following parameters:
1. Keeping the BCI target class (which has to be delivered if a tracked ball is highlighted)
predefined for the whole run or even session and not changing it every trial. This will not
have any eect on the balance of BCI commands to be delivered, since anyway just half
of the time the tracked balls are highlighted.
2. Characterization of the maximum possible balls to be tracked and the maximum speed
of the balls, without BCI control but while delivering faked BCI commands. The idea
thereby is, that the subject is experiencing the same BCI feedback characteristics (as it
would be during brain control), but the subjects are not delivering mental commands.
Instead they are sending the commands via the keyboard (to have a 100 % success rate),
which are then influencing the BCI feedback towards the left or right side.
The presented platform is giving us the possibility to evaluate dierent multi-tasking parameters. The strong link between the challenging visual multi-object tracking task and BCI
feedback in coordination with the presented stimuli is a perfect simulation environment of a
real brain-actuated device.
Unfortunately, the initial experiments were too challenging and complex, so that modifications and a deeper characterization of the game parameters are necessary. Furthermore, an
unbiased classifier should be applied and an additional standard BCI online recording before
and after the experiment should be performed. Based on the outcome of the characterization,
we will modify the tracking time and paradigm, the number of balls and the used speed, so that
there is a chance to succeed in the task during easier levels, while it will be very challenging at
higher levels, and then re-do the experiments.
References
[1] R. Leeb, K. Gwak, D-S. Kim, and J.d.R. Mill
an. Freeing the visual channel by exploiting vibrotactile
BCI feedback. In Conf Proc IEEE Eng Med Biol Soc, 2013.
[2] R. Leeb, S. Perdikis, L. Tonin, A. Biasiucci, M. Tavella, A. Molina, A. Al-Khodairy, T. Carlson, and
J.d.R. Mill
an. Transferring brain-computer interfaces beyond the laboratory: Successful application
control for motor-disabled users. Artif Intell Med, 59(2):121132, 2013.
[3] Z.W. Pylyshyn and R.W. Storm. Tracking multiple independent targets: evidence for a parallel
tracking mechanism. Spatial Vision, 3(3):119, 1988.
Article ID 067-4
DOI:10.3217/978-3-85125-378-8-68
Introduction
Brain-Computer interfaces (BCI) rely on the single-trial recognition of neural signals corresponding to dierent conditions. Currently, the most common BCI implementations use electroencephalography (EEG) signals elicited after presentation of dierent stimuli (i.e., eventrelated potentials, ERP). However, EEG-based applications have low spatial resolution of the
signals due to poor skull conductivity which smears the electrical activity originated in cortical
sources. This reduces the signal-to-noise ratio (SNR) posing serious challenges for BCI use.
Inverse solution methods, allowing to estimate the intra-cranial sources that generate the
scalp measured potentials, have been proposed as a potential method to improve classification
performance compared to surface EEG [3, 8, 7]. The rationale is to increase the signal spatial resolution by projecting scalp potentials onto a higher dimensional (source) space. The
estimated activity of these sources is expected to have better SNR as they represent unmixed
scalp potentials captured by the EEG electrodes. Correspondingly, from a classification perspective, the features extracted from the estimated cortical activity are expected to yield better
discrimination between the BCI classes.
Previous studies on the use of inverse solutions for BCI typically compared the performance
using source-related features against features from individual or a small number of EEG channels. However, since sources are estimated by linearly combining information from all the EEG
channels, is not surprising that source-based features provide higher discriminant information
compared to individual scalp-electrodes. Therefore, results may be biased against against surface EEG. In this paper we address this issue by comparing source-based classification with
features based on linear combination of scalp electrodes aiming to provide a fair assessment of
their capabilities in ERP-based BCIs.
Methods
We compared the classification performance using scalp- and source-based features. The first
classifier, henceforth termed Cortical, uses the estimated cortical current density (CCD) of
Article ID 068-1
DOI:10.3217/978-3-85125-378-8-68
intracranial sources [3]. Features are extracted from the temporal activity of each source within
a given time window (see below) and at each time-point the 100 most discriminant CCD features
are selected using the Fisher score. We train individual classifiers for each of these selected
features that compute the probability of likelihood to the means and covariance computed on
the training data. The output of these classifiers are combined using nave Bayes rule. The
motivation for this ensemble method is that using individual classifiers per feature reduces the
possibility of overfitting when limited number of training trials are available
In the case of EEG features, we compute the classification performance using a Fisher linear
discriminator (FLD). This classifier combines linearly the activity from all the electrodes. Thus
mathematically, it is comparable to the projection used to compute one intracranial source.
However, projection weights in the FLD are optimized for separating the classes while the
inverse solution optimize localization capabilities. Since the Cortical classifier combines several
projections (i.e. sources) at each time point while the FLD only uses one, we also tested two
more classifiers that combine multiple linear projections of the scalp activity. In the first one
these projections were obtained through bagging, i.e. multiple FLD (N=100) are built from
subsets of the training data. Each subset was obtained by randomly selecting 50% of trials
from the training set. This classifier is referred to as EEG N-FLD to signify multiple FLD
projections whereas the earlier classifier is referred to as EEG 1-FLD. Furthermore, since the
bagging approach may lead to similar projections, we also tested a method, termed Orthogonal,
that enforces the use of orthogonal FLD projections [5]. Last but not least, as it has been done
in previous works on decoding using inverse solutions [3, 8, 4], we also tested classifiers trained
on a small subset of electrodes, here referred to as EEG-Channel classifier.
These classifiers were compared oine in three standard ERP-based BCI experiments:
Rapid Serial Visual Presentation (RSVP) [10], P300-speller [6] and Error-related potentials
(ErrP) [2]. In the RSVP study (N=15), sequences of images are rapidly presented (4 images
per second) and the evoked EEG activity is decoded looking for signatures of target and distractor images. Training data was acquired during 4 search tasks, each one composed of two
sequences of 200 images. The testing phase was performed on three search tasks using dierent
target objects than for training. Both training and test phases were performed on the same
day. 64 EEG channels were acquired at 2048 Hz, then filtered in the range 1-10 Hz, spatially
filtered using CAR and downsampled to 32Hz. Classification was based on the data in the time
window [200-700] ms. Following previous works, channels Cz, Pz and Oz were used for the
EEG-Channel classifier [9].
The second experiment (N=8) consists on the standard P300 matrix speller. The experiment
was performed over two days (average separation of 12 days) and each day an average of 5 runs
were performed per subject. Each run requires the writing of 5 characters. Data of the first
day was used for training and classifiers were tested on the data for the second day. 61 EEG
channels were recorded at 250Hz, then filtered in the [1 20] Hz range, downsampled to 50 Hz
and CAR referenced. Signal in the window [100 600] ms was selected for classification. The
EEG-Channel classifier used channels Fz, Cz and Pz [1].
Finally, in the ErrP (N=6) experiment subjects monitor a cursor that moves horizontally in
discrete steps towards a target location. 20% of the time it goes in the opposite direction to the
target. The experiment was performed over two days with a separation ranging from 2 months
to 2 years. Classifiers were trained on day 1 and tested on the second day. 64 EEG channels
were acquired at 2048 Hz, then filtered in the range 1-10 Hz, spatially filtered using CAR and
downsampled to 32Hz. Classification was based on the data in the time window [200-450] ms.
Features from channels Fz, Cz and Pz were used by the EEG-Channel classifier [2].
Article ID 068-2
DOI:10.3217/978-3-85125-378-8-68
Figure 1: (a-c) Event-related potentials at discriminant electrodes. Topographic Localization of discriminant sources (Top view) at selected time points is shown in the top row. (d)
Classification performance on the three ERP protocols.
Results
Fig 1(a-c) shows representative ERPs and discrminant intracranial sources for the three protocols. Sources are color coded from green to red to show how often they were found to be
discriminant across subjects (red color indicates features that were discriminant in all subjects).
Classification performance (area under the ROC curve, AUC) is shown in Figure 1(d). In general EEG-Channel classifier exhibits the lowest performance. In the RSVP protocol the Cortical
classifier outperforms the EEG-Channel but failed to show statistically significant dierences
(p = 0.08, Wilcoxon). It also performs comparably to the other three classifiers, all using linear
combinations of EEG channels. The performance obtained with inverse solution is similar to
a previous study using a full brain head model based inverse solution [9]. In contrast, in the
P300 experiment the Cortical and Orthogonal methods significantly outperform the other three
classifiers (p <0.05, Wilcoxon). The results with our classifiers are comparable to previous
EEG-based studies [6]. In the case of the ErrP protocol, on average the Cortical, 1-FLD, and
N-FLD show higher performance. Nonetheless, no significant dierences were found with respect to the Orthogonal and EEG-Channel. Overall, performance of the 1-FLD and N-FLD
classifiers was comparable across protocols. This implies that the bagging procedure yielded
similar projections and therefore no improvement with respect to the initial classifier.
Article ID 068-3
DOI:10.3217/978-3-85125-378-8-68
Conclusion
We found that inverse solution based classifiers have consistent classification performance for
all the experimental protocols and the results matches with the state of the art methods.
Contrasting with previous reports, most of them on SMR-based BCIs, surface EEG based
classifiers yielded similar performance, in particular when multiple channels are considered.
There is thus, at least for ERP-based BCIs, the risk of overestimating the advantages of sourcebased classification if the proper validation is not performed. Noticeably, bar a few exceptions
[7], previous works on SMR classification do not report such comparison.
Nevertheless, the Cortical classifier outperformed the FLD scalp-based classifiers. This
suggests that these classifiers can indeed bring some advantages. Remarkably, testing data
in two of the experiments were obtained on a dierent day than for training, suggesting that
source estimation can still be reliable despite removal and relocation of the EEG electrodes
across sessions. Further studies are required to better identify those cases where the use of
inverse solutions may be more suitable for BCI systems than scalp-recorded signals.
Acknowledgments
This study is supported by the Swiss National Science Foundation (grant 200021-120293,
www.snf.ch). We thank Fabio Aloise and the IRCCS Fondazione Santa Lucia, Rome for providing the P300 dataset.
References
[1] C Bledowski, D Prvulovic, and K Hoechstetter. Localizing P300 generators in visual target and distractor processing: A combined event-related potential and functional magnetic resonance imaging
study. Journal of Neuroscience, 24:93539360, 2004.
[2] R. Chavarriaga and J. del R. Mill
an. Learning from EEG error-related potentials in noninvasive
brain-computer interfaces. IEEE Trans Neural Syst Rehabil Eng, 18(4):381388, 2010.
[3] F. Cincotti, D. Mattia, F. Aloise, S. Bufalari, L. Astolfi, F. Fallani, A. Tocci, L.Bianchi, M.G. Marciani, S.Gao, J.dR. Mill
an, and F. Babiloni. High-resolution EEG techniques for brain-computer
interface applications. J Neurosci Methods, 167(1):3142, 2008.
[4] M. Congedo, F. Lotte, and A. Lecuyer. Classification of movement intention by spatially filtered
electromagnetic inverse solutions. Physics in Medicine and Biology, 51(8):19711989, 2006.
[5] U. Homann, J.-M. Vesin, and T. Ebrahimi. Spatial filters for the classification of event-related
potentials. In 14th European Symposium on Artificial Neural Networks, 2006.
[6] D.J Krusienski, E.W Sellers, F. Cabestaing, S.Bayoudh, D.J. McFarland, T.M Vaughan, and J.R
Wolpaw. A comparison of classification techniques for the P300 speller. J Neural Eng, 3(4):299
305, 2006.
[7] F. Lotte, A Lecuyer, and B. Arnaldi. FuRIA: an inverse solution based feature extraction algorithm
using fuzzy set theory for brain-computer interfaces. Trans. Sig. Proc., 57(8):32533263, 2009.
[8] Q. Noirhomme, R. I. Kitney, and B. Macq. Single-trial EEG source reconstruction for braincomputer interface. IEEE Transactions on Biomedical Engineering, 55:15921601, 2008.
[9] P. Poolman, R. M Frank, P. Luu, S. M Pederson, and D. M Tucker. A single-trial analytic
framework for EEG analysis and its application to target detection and classification. Neuroimage,
42(2):787798, 2008.
[10] M. Uscumlic, R. Chavarriaga, and J.dR. Mill
an. An iterative framework for EEG-based image
search: Robust retrieval with weak classifiers. PLoS ONE, 8:e72018, 2013.
Article ID 068-4
DOI:10.3217/978-3-85125-378-8-69
Abstract
Brain Computer Interfaces (BCIs) rely on robust detection and classification
algorithms for stable and accurate device control. Here we investigate the effects of
skilled motor learning on the stability of movement related cortical potentials (MRCP).
Eight subjects were trained for 32 min on a task known to induce plasticity. Significant
increases in performance were accompanied by significant increases in the variability of
the EEG signal from two to one second prior to movement onset. The implications of
these results for the use of the MRCP for online device control are discussed.
1 Introduction
Current research on brain computer interfaces (BCIs) has been almost exclusively limited to a single
session trial where the performance is of acceptable accuracy (Wolpaw, 2013). Essentially, the user is
trained to fit into the performance of the BCI governed by a classifier or a detector. But, the brain is
undergoing continuous adaptation through learning and shifts in attention to task (Sanes & Donoghue,
2000). While the human nervous system is adaptable, current BCIs lack this capability. There is a need
for adapting the BCI to the user, thus paving the way for bidirectional adaptation of either the BCI or
the user, depending on the reasons for alterations in brain activity. Any design of a BCI must use the
changing brain and thus the principles related to how the brain acquires, improves and maintains its
natural function as a guide.
For the past 10 years, our research group has successfully implemented the slow cortical potential
arising during movement or its intention as a control signal, also called the MRCP. We have shown that
a MRCP-based BCI can detect movement or movement intention at high (above 75%) accuracy in the
online mode (Xu et al., 2014) and due to the possibility of such detection prior to movement onset, it
provides a fast device control. In this study we aim to quantify the alterations in the MRCP morphology
after healthy subjects participate in a novel skill learning motor task. We predict that since skill-learning
is associated with significant plasticity at the level of the motor cortex, MRCP will be affected
significantly.
Article ID 069-1
DOI:10.3217/978-3-85125-378-8-69
2 Methods
Eight healthy, volunteers (3 females, 5 males; 25-43 years) with no prior history of neurological
conditions participated in this study. All procedures were approved by the Scientific Ethics Committee
of Northern Jutland (Reference number: N-20130039) and subjects gave their written consent.
Article ID 069-2
DOI:10.3217/978-3-85125-378-8-69
Figure 1: The 6 traces (A) Trace and subject performance during the first (B) and last (C) training run.
Relationship between trace and subject data for all data point during the first (D) and last (E) training run (n=1)
3 Results
3.1 Motor performance
The effect of the training for one subject is illustrated in Figure 1B-E. Figure 1B and 1C show the
data for one of the traces selected for analysis during training 1 and training 8, while Figure 1D and 1E
shows the scatter plot between the exerted TA activation and the target level for each interval of time
during training runs 1 and 8. For this subject the correlation between shown traces and actual subject
performance increased from 0.84 to 0.91. Across all subjects there was a significant increase in
correlation between the first and last session (p=0.0005) indicating an improved performance.
3.2 Movement related cortical potential (MRCP) during the tracking task
Four parameters were extracted from the MRCPs generated during training session 1 and 8 (Figure
2A). Figure 2B shows the average MRCP for training session 1 and 8 for one subject. The average EEG
amplitude within 1-2 s prior to movement onset remained relatively stable (session 1: -4.23 V vs
session 8: -3.39 V; p=0.42). However, the variability of the EEG activity during this time (the SD
within the time window), increased significantly (session 1: 1.96 V vs session 8: 2.54 V; p=0.01).
Both the time of peak negativity in relation to task onset (session 1: -40 ms vs session 8: -50 ms, p=0.35)
and its amplitude (session 1: -17.07 vs session 8: -17.92, p=0.36) did not change significantly.
Article ID 069-3
DOI:10.3217/978-3-85125-378-8-69
Figure 2: A. Illustration of four variables extracted from individual MRCP traces for each movement
trace performed in the first and last training session. B. MRCP data from one representative subject during
the first (black trace) and last (red trace) training session.
4 Discussion
Results demonstrate that significant improvements in task performance are accompanied by a
significantly greater variability of the MRCP 2-1 s prior to movement onset. This has important
implications in the design of any MRCP-based BCI for online detection of movement. Here the
algorithm has a role in identifying when the EEG activity attains a specific threshold level. Once
detected, the algorithm proceeds to implement device control. For a BCI designed for neuromodulation
this may take the form of turning on an orthotic device that performs a desired movement or an electrical
stimulator that triggers nerve stimulation to induce contraction of specific muscles. Typically such a
use of BCI intends to induce plasticity at the level of the motor cortex. Data presented here suggested
that the performance of the algorithm could be affected as plasticity is induced. Previous studies have
shown significant plasticity within the motor cortex and likely also connected sites during the
acquisition of a new motor skill. Such plasticity results in an expansion of the motor maps i.e. those
areas of the motor cortex that when stimulated will result in a contraction of the target muscle. As the
motor skill manifests itself, the motor maps tend towards their normal size. However evidence also
exists that motor maps differ between skilled versus novice athletes. Any BCI designed to induce
neuromodulation must ensure that shifts in EEG patterns are taken into account with the acquisition of
new or indeed the relearning of old motor skills. The MRCP seems to be robust to effects of motor
learning at least in healthy subjects. However, since online detecting relies on threshold values for the
period prior to movement onset, the findings here suggest that BCI performance will decline with skill
acquisition unless the algorithm is adapted to the new level of activity. This is one of our current areas
of investigation.
References
Sanes, J. & Donoghue, J. (2000). Plasticity and Primary Motor Cortex. Rev Neurosci 23, 393-415.
Wolpaw, JR (2013). Brain-computer interfaces. Handb Clin Neurol 110, 67-74.
Xu R., Jiang N., Mrachacz-Kersting N., Lin C., Asin G., Moreno J., Pons J., Dremstrup K., & Farina
D. (2014). A Closed-Loop Brain-Computer Interface Triggering an Active Ankle-Foot Orthosis for
Inducing Cortical Neural Plasticity. IEEE Trans Biomed Eng. In press. (DOI: 10.1109/TBME.2014.2313867)
Article ID 069-4
DOI:10.3217/978-3-85125-378-8-70
1 Introduction
Steady-State Visual Evoked Potentials (SSVEP) have received much attention in Brain-Computer
Interface (BCI) research. However, a challenge still faced in SSVEP BCIs is the limited number of
unique frequencies available for stimulation [1].
To counteract this problem, researchers have begun to investigate the use of dual-frequency
stimulation, since this allows more targets to be created using a small number of frequencies. This has
been done either by overlaying two targets oscillating at different frequencies [1, 2] or displaying two
separate targets at different frequencies [3, 4].
In this paper, a novel method of providing dual-frequency SSVEP stimulation is investigated,
which involves using a stereoscopic display, i.e. 3-D screen and stereoscopic glasses. Stereoscopic
vision lends itself well for use with dual-frequency stimulation since the views to the left and right
eyes can be controlled independently, which may make the concept visually more transparent to the
user. Furthermore, the method is easily implementable due to the availability of low-cost equipment
and easily extendable to three-dimensional videogames and virtual reality BCIs [6].
2 Method
To test the concept of stereoscopic dual-frequency SSVEP stimulation, an offline experiment was
conducted.
Article ID 070-1
DOI:10.3217/978-3-85125-378-8-70
Population: Ten participants (aged 24-33, 8 male, 2 female) participated in the experiment. All of
these participants reported normal or corrected-to-normal vision. One participant was excluded from
the final analysis as no SSVEP response was evident in the EEG trace.
Experimental Setup: The software used for creating and running the experiments was
OpenViBE (http://openvibe.inria.fr/). An LCD 3-D television screen was used for displaying the
SSVEP targets. Stereoscopic polarized glasses were worn by the participants for all of the
stereoscopic conditions. For acquiring data the g.Tec USBamp was used to record data from thirteen
electrodes (CPz, POz, PO7, PO8, O1, O2, Oz, Iz, Pz, PO3, PO4, O9 and O10).
Procedure: Participants were seated 1.5 m in front of the screen. They received six different forms
of stimulation, as shown in Table 1. Frequencies of 6 Hz and 15 Hz were chosen as both showed high
SSVEP responses in pre-testing and were possible frequencies of the refresh rate of the screen. Two
normal vision and two stereoscopic vision single-frequency conditions were carried out as control
conditions (Conditions C6N, C15N, C6S and C15S), in which a block on the screen was inverted from
black to white at a single frequency. For the stereoscopic single-frequency conditions (C6S and C15S)
half of the image was displayed to the right eye and the other half to the left eye through the use of the
3-D display as illustrated in Figure 1. To simulate previous work in dual-frequency stimulation, the
block was divided into lines, with half flashing at 6 Hz and half at 15 Hz (C6_15N). To achieve dualfrequency stimulation with stereoscopic vision, the block was again divided into lines but each eye
was only exposed to half of the lines (and thus only one of the frequencies) through the use of
stereoscopic glasses and the 3-D mode of the screen. This was repeated with the frequencies to the left
and right eyes reversed to overcome any biasing due to ocular dominance (C6_15S and C15_6S).
For each condition, eight runs of stimulation were performed for eight seconds each, with three
second breaks. The order of the conditions was rotated pseudo-randomly among participants to
prevent a fatigue effect. Between each condition there was a rest period of approximately two
minutes. The conditions were also divided into stereoscopic and normal vision blocks and rotated.
Half of the participants started with a normal vision block while the other half started with a
stereoscopic block. After each condition participants were asked to rate the condition using a 10-point
Likert scale based on four different criteria: Fatigue, Annoyance, Discomfort and Strangeness.
Table 1: Stimulation frequencies for the different conditions of the experiment.
Condition
Target
C6N
6 Hz normal vision
C15N
15 Hz normal vision
C6S
6 Hz stereoscopic vision
C15S
15 Hz stereoscopic vision
C6 15N
6 Hz and 15 Hz normal vision
C6 15S
6 Hz (left) and 15 Hz (right) stereoscopic vision
C15 6S
15 Hz (left) and 6 Hz (right) stereoscopic vision
Article ID 070-2
DOI:10.3217/978-3-85125-378-8-70
3 Results
Amplitude (mV)
EEG Data Results: The signals were divided into non-overlapping two-second epochs on which a
1024 point Fast Fourier Transform (FFT) was carried out. The FFTs for all the epochs were then
averaged. Figure 2 shows the spectrum averaged over nine participants and averaged over electrodes
O1, O2 and Oz for the dual-frequency stereoscopic conditions C6_15S and C15_6S. The two spectrums
were averaged in order to prevent any biasing due to ocular dominance.
Power values were calculated from the FFT and then converted to relative power values, in order
to cancel differences in magnitude, as follows:
PowerRelative = 3*Power(f)/(Power(f-1)+Power(f)+Power(f+1))
where is the average power at the relevant frequency and and
are the average power at frequencies 1 Hz above and below the relevant frequency
respectively.
12 Hz
15 Hz
Frequency (Hz)
Figure 2: Averaged spectrums for the stereoscopic dual-frequency conditions (C6_15S and C15_6S) using 6 Hz and
15 Hz stimulation, with peaks at 12 Hz and 15 Hz indicated.
Paired t-tests were carried out between the relative power at each frequency of interest and the
average relative power of the two neighbouring frequencies. The results are shown in Table 2. The
single-frequency control conditions show statistically significant increases in power at the stimulation
frequencies. For the dual-frequency stereoscopic stimulation (C6_15S and C15_6S), the 6 Hz stimulation
still shows a statistically significant increase in power (p < 0.05) at 12 Hz (the first harmonic). For the
15 Hz stimulation, the increase in power is tending towards statistical significance (p < 0.1). This
suggests that the two stimulation frequencies can still be identified in the dual-frequency stereoscopic
case. A possible reason that not all power differences were significant is the decreased luminosity due
to the use of stereoscopic glasses with tinting, or the fact that each eye was only viewing half of the
lines of the image as opposed to all lines in the normal vision cases, as shown in Figure 1.
Table 2: Results of paired t-tests using relative power at relevant frequencies and neighbouring frequencies.
Condition
Comparison
t value
Significance
C6N
12 Hz to average of 11 and 13 Hz
6.611
< 0.001*
C15N
15 Hz to average of 14 and 16 Hz
5.634
0.001*
C6S
12 Hz to average of 11 and 13 Hz
5.608
0.001*
C15S
15 Hz to average of 14 and 16 Hz
6.844
< 0.001*
12 Hz to average of 11 and 13 Hz
1.587
0.164
C6_15N
15 Hz to average of 14 and 16 Hz
2.802
0.031*
12 Hz to average of 11 and 13 Hz
2.954
0.025*
C6_15S
15 Hz to average of 14 and 16 Hz
2.151
0.075
12 Hz to average of 11 and 13 Hz
3.492
0.010*
C15_6S
15 Hz to average of 14 and 16 Hz
2.211
0.063
Article ID 070-3
DOI:10.3217/978-3-85125-378-8-70
Participant Questionnaires: The averaged participant ratings are shown in Figure 3, grouped
according to the type of stimulation. A Friedman test revealed no significant differences in any ratings
(p > 0.05). This suggests that the participants did not find the stereoscopic dual-frequency stimulation
more fatiguing, annoying, uncomfortable or strange than the other forms of stimulation. The
strangeness rating did show an increase tending towards significance (p < 0.1), thus the participants
may have noticed a slight difference in appearance but not one that affected their other perceptions.
10
Single frequencies normal
vision
Single frequencies
5
stereoscopic vision
Dual frequencies normal
vision
Dual frequencies stereoscopic
0
vision
Fatigue
Annoyance
Discomfort
Strangeness
Figure 3: Participant ratings of Fatigue, Annoyance, Discomfort and Strangeness of each form of stimulation.
4 Conclusions
The aim of the experiment was to determine whether stereoscopic dual-frequency stimulation is
feasible. The results indicated that the two stimulated frequencies are evident in the SSVEP response.
Changes in the experimental setup may be required to achieve more significant responses, for
example, by increasing the luminosity of the targets. In addition, further combinations of frequencies
should be tested. The concept could then be extended to 3-D videogames and virtual reality setups.
The participants did not rate the stereoscopic dual-frequency stimulation as more fatiguing, annoying,
uncomfortable or strange, suggesting that this form of stimulation is also feasible from the point of
view of participant preferences.
Acknowledgements: This work was made possible in part by funding from a UCT Scholarship
for International Travel and the National Research Foundation (NRF), South Africa.
References
[1]
[2]
[3]
[4]
[5]
[6]
D. Zhu, J. Bieger, G. Garcia Molina, and R. M. Aarts, A survey of stimulation methods used
in SSVEP-based BCIs., Comput. Intell. Neurosci., vol. 2010, Jan. 2010.
M. Cheng, X. Gao, S. Gao, and D. Xu, Multiple Color Stimulus Induced Steady State Visual
Evoked Potentials, in 23rd Annual International Conference of the IEEE Engineering in
Medicine and Biology Society, 2001.
H. Hwang, D. H. Kim, C. Han, and C. Im, A new dual-frequency stimulation method to
increase the number of visual stimuli for multi-class SSVEP-based brain computer interface
(BCI), Brain Res., 2013.
S. T. Morgan, J. C. Hansen, and S. A. Hillyard, Selective attention to stimulus location
modulates the steady-state visual evoked potential., Proc. Natl. Acad. Sci. U. S. A., vol. 93,
no. 10, pp. 47704, May 1996.
S. P. Kelly, E. C. Lalor, R. B. Reilly, and J. J. Foxe, High-Density SSVEP Data for
Independent Brain Computer Communication, IEEE Trans. Neural Syst. Rehabil. Eng.,
vol. 13, no. 2, pp. 172178, 2005.
A. Lcuyer, L. George, and M. Marchal, Toward Adaptive VR Simulators Combining
Visual, Haptic, and Brain-Computer Interfaces, IEEE Comput. Graph., vol. 33, no. 5, pp.
1823, 2013.
Article ID 070-4
DOI:10.3217/978-3-85125-378-8-71
University of W
urzburg, W
urzburg, Germany
2
University of Oslo, Oslo, Norway
andreas.herweg@uni-wuerzburg.de
Abstract
Tactile ERP-BCIs provide unique advantages over visual ERP-BCIs but suer from
lower classification accuracies due to lower discrimination between target and non-target
ERPs. In this study we analyzed whether spatial masking can reduce non-target ERPs
to improve discrimination. Two dierent tactile stimulation setups designed to produce
simultaneous stimulation with and without masking were compared to a classic oddball
stimulation in three healthy subjects. Spatial masking decreased the average number of
sequences required to achieve 100% classification accuracy. Preliminary data indicate that
spatial masking might have a beneficial eect on classification accuracy.
Introduction
Whenever possible assistive technology should strive to be unobtrusive for the end-user. Visual
ERP-BCIs generally restrict the end-users perception of his environment and are difficult to
conceal due to the necessary display devices. Tactile ERP-BCIs on the other hand allow the
user to retain his visual and auditory senses and tactile stimulation devices (tactors) can easily
be hidden below layers of clothing. While information transfer rates of most tactile BCIs are
lower than those of comparable visual BCIs [1], the unobtrusive nature of tactile ERP-BCIs
might be a crucial advantage for certain applications such as wheelchair control [3]. In daily
life, vibrotactile stimulioccur rarely compared to visual and auditory stimuli. Therefore, tactile
stimuli are difficult to ignore and are most likely perceived as odd stimuli. Consequently tactile
stimulation elicits significant ERPs for target and non-target stimuli [3]. While tactile ERPs
from target and non-target stimuli are sufficiently dierent for classification, reduced nontarget ERPs may increase classification accuracies. For this purpose, we investigated whether
we can accustom a user to the tactile stimulation to reduce non-target ERPs. We applied
spatial masking to allow for continuous stimulation on all tactors while still maintaining an
oddball task. When multiple tactile stimuli are applied at the same time, but one stimulus
is significantly stronger than the remaining ones, the strongest stimulus can mask the other
stimuli [7]. Therefore, only one stimulus is perceived despite all locations being stimulated. In
this study, we investigated the eect of spatial masking on classification accuracy.
Methods
Dierent stimulation patterns were applied to N=3 healthy subjects using 8 vibrate transducers (C2 tactors; Engineering Acoustic Inc., USA). EEG signals were recorded from 16 passiv
Ag/AgCl electrodes and amplified using a g.USBamp amplifier (g.tec Engineering GmbH, Austria). Stimulation was applied to the lower and upper arms (see Figure 1) with a stimulusduration of 240ms and an inter-stimulus-interval of 400ms.
Three dierent stimulations setups were compared in this study (see Figure 1):
Article ID 071-1
DOI:10.3217/978-3-85125-378-8-71
Figure 1: Overview of the three dierent stimulation setups. Tactors were placed along both
(right and left) lower and upper arms (red and black dots). The High vs None setup was
a classical oddball where the target stimulus (red) was vibrating with high intensity and nontarget stimuli (black) were inactive. For the High vs Low setup the target stimulus (red) was
vibrating with high intensity and non-target stimuli (black) were vibrating with low intensity.
For the High vs Medium setup the target stimulus (red) was vibrating with high intensity and
non-target stimuli (black) were vibrating with medium intensity. Target stimuli were presented
in randomized order and all stimulations were above detection threshold.
High vs None
Classical oddball where the target stimulus (odd) is vibrating with high intensity while
non-targets (frequent) are inactive.
High vs Low
Modified oddball where the target stimulus (odd) is vibrating with high intensity while
non-targets (frequent) are vibrating with low intensity.
High vs Medium
Modified oddball where the target stimulus (odd) is vibrating with high intensity while
non-targets (frequent) are vibrating with medium intensity.
Subjects were not informed about the dierence between the three stimulation setups, but it
was ensured that low stimulation was above detection threshold for all subjects. Participants
had to perform a calibration-task for each stimulation setup, using 15 sequences, meaning each
tactor was the target stimulus 15 times. Afterwards they had to perform three copy-spelling
tasks, selecting each of the 8 body locations once, with each setup, using 10, 5 and 3 sequences.
Article ID 071-2
DOI:10.3217/978-3-85125-378-8-71
Figure 2: Calibration task results for dierent stimulation setups. Left: classification accuracy
as a function of number of sequences is shown. Right: averaged ERPs for position Cz. Red:
Targets, Blue: Non-targets.
Results
After the experiment, participants were asked to report the felt dierence between the High vs
None and High vs Low setup. All participants stated that there was either no dierence at all
or that they could feel an additional active tactor from time to time. None reported that they
could reliably feel multiple active tactors. For the High vs Medium setup all participants
reported feeling multiple tactors simultaneously.
All participants achieved 100% classification accuracy in 9 sequences or less with the High vs
None setup (see Figure 2). In the High vs Low setup all participants were able to reach 100%
classification accuracy with 6 sequences or less. Two participants achieved 100% classification
accuracy in the High vs Medium setup (4 and 8 sequences needed), one participant was
unable to achieve 100% classification accuracy. All stimulation setups elicited event-relatedpotentials in all participants. Simultaneous activation of all 8 tactors resulted in stimulation
artifacts in the High vs Low and the High vs Medium setup (see Figure 2). Performance
in the copy-spelling tasks appeared to be equal for all stimulation setups, but due to the small
sample size no statistics were calculated (see Table 1).
Article ID 071-3
DOI:10.3217/978-3-85125-378-8-71
Discussion
These preliminary results suggest that spatial masking (High vs Low) might have a beneficial
eect as it reduced the required number of sequences to reach 100% classification accuracy
below that of the default setup (High vs None). However no performance dierence could be
observed in the copy-spelling task for this limited sample. Due to stimulation artifacts a direct
comparison between the ERPs of High vs None and High vs Low condition was difficult.
The Source of the artefacts was identified as an unintended flow of current between tactors and
skin and will be eliminated for future studies. More data is needed to verify whether tactial
masking provides beneficial eects on target or non-target ERPs. Spatial masking is only one
out of a multitude of dierent spatial, temporal and spatio-temporal eects relevant for tactile
stimulation [5]. Further research is needed to assess whether additional eects such as apparent
location [4], temporal summation [6] and tactile illusions [2] have to be circumvented when
designing a tactile BCI or might even prove to be useful for good control of a BCI driven
application.
4.1
Acknowledgments
This study was funded by the European Community for research, technological development and demonstration activities under the Seventh Framework Programme (FP7, 2007-13), project grant agreement
number 288566 (BackHome). This paper reflects only the authors views and funding agencies are not
liable for any use that may be made of the information contained herein.
References
[1] F. Aloise and I. Lasorsa. Multimodal stimulation for a p300-based BCI. International Journal of
Bioelectromagnetism www.ijbem.org, 9:128130, 2007.
[2] F. A. Geldard and C. E. Sherrick. The cutaneous Rabbit: a perceptual illusion. Science,
178(4057):178179, October 1972.
[3] T. Kaufmann, A. Herweg, and A. K
ubler. Toward brain-computer interface based wheelchair control
utilizing tactually-evoked event-related potentials. Journal of NeuroEngineering and Rehabilitation,
11(1):7, January 2014.
[4] C. E. Sherrick, R. W. Cholewiak, and A. A. Collins. The localization of low- and high-frequency
vibrotactile stimuli. The Journal of the Acoustical Society of America, 88(1):169179, July 1990.
[5] J. van Erp. Guidelines for the use of vibro-tactile displays in human computer interaction. In
Proceedings of Eurohaptics 2002, 2002.
[6] R. T. Verrillo and G. A. Gescheider. Enhancement and summation in the perception of two successive vibrotactile stimuli. Perception & Psychophysics, 18(2):128136, March 1975.
[7] R. T. Verrillo, G. A. Gescheider, B. G. Calman, and C. L. Van Doren. Vibrotactile masking: eects
of one- and two-site stimulation. Perception & psychophysics, 33(4):37987, 1983.
Article ID 071-4
DOI:10.3217/978-3-85125-378-8-72
Fond Santa Lucia, Rome, Italy; 2Dep of Psychology, Sapienza University of Rome, Italy;
Institute of Cognitive Sciences and Technologies, CNR, Rome, Italy.; 4DIAG, Sapienza University of
Rome , Italy; 5Department of Neurology and Psychiatry, Sapienza University of Rome, Italy.
a.riccio@hsantalucia.it
Abstract
The purpose of this study was to investigate the support of attentional and memory
processes in controlling a P300-based brain-computer interface (BCI) in people with
amyotrophic lateral sclerosis (ALS). Eight people with ALS performed two behavioral
tasks: i) a rapid serial visual presentation (RSVP) task, screening the temporal filtering
capacity and the speed of the update of the attentive filter, and ii) a change detection
task, screening the memory capacity and the spatial filtering capacity. The participants
were also asked to perform a P300-based BCI spelling task. We found that only the
temporal filtering capacity was a predictor of both the P300-based BCI accuracy and of
the amplitude of the P300 elicited performing the BCI task. We concluded that the
ability to keep the attentional filter active during the selection of a target influences
performance in BCI control.
1 Introduction
The Brain Computer Interface (BCI) exploits neurophysiological signals to control external
devices for a range of applications (Wolpaw & Wolpaw 2012). Reasons for performance variability
across people with Amyotrophic Lateral Sclerosis (ALS) and performance predictors of P300-based
BCI were not fully investigated. Indeed, the knowledge about the cognitive capabilities reflecting a
successful use of P300-based BCI is limited. In particular, people suffering from ALS, included in the
range of potential users of BCIs, could have cognitive dysfunctions in association with motoneuron
failure, mostly regarded as attention, concentration and verbal fluency (Ringholz et al. 2005). In this
study we investigated the influence of the attentive and memory features of people with ALS on
performances in a P300-based BCI task.
2 Methods
2.1 Participants
We recruited the participants at the ALS center of the Policlinic Umberto I of Rome. We
included in the study a total of nine volunteers, all nave to BCI training, (3 women; mean
age=59.712.3) with definite, probable, or probable with laboratory support ALS diagnosis (mean
ALSFRS scores: 32.4 8.2; Brooks et al. 1996). Inclusion criteria required that participants were able
Article ID 072-1
DOI:10.3217/978-3-85125-378-8-72
to communicate with or without the help of an AT device which also included an eye tracker, thus all
participants had their eye-gaze control preserved. Due to the fact that one participant did not perform
the behavioral tasks, only the data of 8 participants out of nine (3 women; mean age=5812; mean
ALSFRS sores: 31.88.6) were reported in this article.
The study was approved by the ethic committee of Fondazione Santa Lucia, Rome and all
participants provided an informed consent.
3 Data analysis
EEG data was high pass and low pass filtered with cut off frequencies of 0.1 Hz and 10 Hz
respectively using a 4th order Butterworth filter. In addition, a notch filter was used to remove 50 Hz
contamination. Data was divided into 1000 ms long epochs starting with the onset of each stimulus.
The amplitude of the P300 potential in Cz was defined as the highest value of the difference between
target and non-target average waveforms in the time interval 250-700ms (P300amp).
To provide an estimate of the classifier accuracy we considered the binary classification problem
target vs. non-target (binary accuracy, BA; Blankertz et al. 2011). A 7-fold cross-validation was used
to evaluate the binary accuracy of the classifier on each participants dataset by applying a Stepwise
Linear Discriminant Analysis (SWLDA) on the testing dataset (6 words) and assessing the binary
accuracy on the training dataset (the remaining word).
Article ID 072-2
DOI:10.3217/978-3-85125-378-8-72
The detection accuracy of T1 (T1%) in the RSVP task was considered as an index of the temporal
attentional filtering capacity of the participants. Because the detection accuracy of T2 (T2%) was
considered as an index of the capability to adequately update the attentive filter, only trials in which
T1 had been correctly identified were selected in order to determine T2%.
To investigate the memory capacity, according to Cowan (2001), we defined the number of items
held in memory (K) as K=S(H-F), where S is the size of the array (highest number of item to
memorize, S=4), H is the observed hit rate and F is the false alarm rate. We calculated the K index for
the baseline task (Kb) and for the selection task (Ks). To screen for the attentional spatial filtering
capacity (), of the participants (that is the capacity to efficiently filter the distracters) we subtracted
the Ks from the Kb (= Kb- Ks).
Because variables were normally distributed, Pearsons correlation coefficient of T1% and T2%
with the BA and the P3amp was computed. Because Kb and violated the assumption of normality,
they were correlated with BA and P3amp by means of the non-parametric Spearman correlation test.
For the parameters whose correlation was statistically significant we performed two regression
analyses in which attentional parameter (T1%) was considered as the independent variable and the BA
and the P300amp variables were considered as dependent variables.
4 Results
A significant positive correlation was observed between T1% and the offline BA, r=.79, p<0.05.
To estimate the predictive value of T1% on the binary accuracy we computed a regression analysis
which resulted in an F=8.341 with a p<0.05, indicating that the variance of the binary performance
was predictable by the participant temporal filtering capacity, with =0.79. A significant positive
correlation was found between T1% and P300amp (r=.84.5, p<0.05) showing that participants with
higher T1% had a larger P300amp. As a result of the linear regression, T1 accuracy was significantly
predictive of P300amp (F=16,23 with a p<0.05) with =0.87. Offline mean value of BA obtained
during the BCI task was 87.4% (SD = 2.4%, range = 84.592.3 %). The mean amplitude for P300 was
3.3 V (SD = 1.6, range = 1.16.5 V). Mean accuracy of detection was 77.2% (SD = 10.4%, range =
6596.25%) and 67.7% (SD= 14.1%, range = 50.387.1%) for T1 and T2, respectively. No significant
correlation was found between T2%, Kb, the offline binary performance and P300amp.
Article ID 072-3
DOI:10.3217/978-3-85125-378-8-72
failure observed in potential user groups. The awareness about the processes and the clinical features
of BCI potential end-users influencing the BCI performance, would allow developing flexible
systems, adaptable to different clinical profiles. We can conclude that top-down mechanism to keep
an attentive map active (Huang & Pashler 2007) is crucial to control a BCI speller task, allowing the
user to set up and maintain the proper attentional map throughout a trial and thus to select the desired
letter.
References
Farwell, L.A. & Donchin, E., 1988. Talking off the top of your head: toward a mental prosthesis
utilizing event-related brain potentials. Electroencephalography and Clinical
Neurophysiology, 70(6), pagg.510523.
Huang, L. & Pashler, H., 2007. A Boolean map theory of visual attention. Psychological review,
114(3), pagg.599631.
Kranczioch, C. et al., 2007. Temporal dynamics of access to consciousness in the attentional blink.
NeuroImage, 37(3), pagg.947955.
Riccio, A. et al., 2013. Attention and P300-based BCI performance in people with amyotrophic lateral
sclerosis. Frontiers in Human Neuroscience, 7:, pag.732.
Ringholz, G.M. et al., 2005. Prevalence and patterns of cognitive impairment in sporadic ALS.
Neurology, 65(4), pagg.586590.
Vogel, E.K., McCollough, A.W. & Machizawa, M.G., 2005. Neural measures reveal individual
differences in controlling access to working memory. Nature, 438(7067), pagg.500503.
Wolpaw, J. & Wolpaw, E.W. a c di., 2012. Brain-Computer Interfaces: Principles and Practice 1
ed., Oxford University Press, USA.
Article ID 072-4
DOI:10.3217/978-3-85125-378-8-73
Introduction
Brain-computer interface (BCI) has been developed as a possible solution for enabling communication capabilities to people who lose motor functions [9]. The use of motor imagery (MI)
is a common approach for BCI systems to send mental command by imagining the movement
of limbs [9]. Currently, most motor imagery BCI systems adopt power spectral density (PSD)
or common spatial filter as discriminative features between tasks [7]. Recently, some studies
have explored the possibility of using the brain interaction patterns between EEG channels as
classification features, e.g. phase dierence or directed transfer function (DTF) [8] [2]. The
objective of the present work is to study the use of DTF features for BCI, as well as the modulations of brain connectivity during MI. We compare the performance of features extracted
by DTF and the PSD features, as well as their combination for decoding BCI commands.
Moreover, we assess the modulations of brain connectivity to evaluate the interhemispheric and
intrahemispheric interaction patterns.
2
2.1
Methods
Experimental protocol
In this study, we focus on a two-class motor imagery task, left hand versus right hand [6].
Subjects were instructed to move a horizontal bar by imaging the movement of their hands.
We analyzed oine experiments, involving nine subjects that perform two runs of BCI training.
Each run comprised 30 trials of motor imagery for each class. Before each trial, a visual cue was
presented to show the target, and the bar started moving after one second, as shown in Figure
1.A. During each trial a feedback bar that moved to the target continuously for 4 seconds was
shown to the subject. EEG signals were recorded using a 16-channel g.USBamp amplifier (g.tec
medical engineering, Schiedelberg) with a sampling rate of 512 Hz. The signal was band-pass
Article ID 073-1
DOI:10.3217/978-3-85125-378-8-73
Figure 1: Experimental protocol of motor imagery and the montage of the EEG cap. Each trial
includes fixation (1000 ms), cue (1000 ms) before the bar starts moving.
filtered between 0.1 Hz and 100 Hz and notch filtered at 50 Hz. The 16 EEG channels were
placed over the sensorimotor cortex as shown in Figure 1.B.
2.2
To compare the performance between PSD and DTF, features were computed 16 times per
second. The PSD of each channel was computed using the Welch method for the past one
second, with the window (Hanning) size 500 ms and overlapping 50%. PSD features were
collected between 2-50 Hz with the step size 2 Hz. DTF was also computed in the same period
of one second, by estimating the coefficients of multivariate autoregressive (MVAR) model
based on Yule-Walker equation, which included all 16 EEG channels with order 8. The system
transfer matrix in frequency domain was obtained by applying fast Fourier transform on the
MVAR coefficients, as the so-called non-normalized directed transfer function [5]. The step size
of DTF was 2 Hz, resulting in a three dimensional matrix 161625 (channel channel
frequency) for each time step of 1/16 s. The log values of either PSD or DTFS were then used
for classification.
We compare the performance with three types of features: (1) PSD features; (2) DTF
features; (3) Combination of both PSD and DTF. We computed the Fisher score on the oine
runs to select the most informative features, defined by f s = |m1 m2 |/(s21 + s22 ), where mk
and s2k represent the mean and variance of class k. In the combined case we used the features
that are selected in PSD and DTF separately. Linear discriminant analysis (LDA) was used
for classification (left vs right MI), assuming Gaussian distribution of the data samples. The
classification performance was evaluated by 10-fold cross validation and the area under the
curve (AUC) of the testing sets.
3
3.1
Results
Brain connectivity patterns
We compute the interaction between and within hemispheres to explore the modulation during
motor imagery. The left hemisphere is defined as the channels FC3, FC1, C3, C1, CP3 and
CP1, and the right hemisphere corresponds to FC2, FC4, C2, C4, CP2 and CP4. The brain
interaction from left to right hemisphere is defined as the mean DTF value from the channels
in the left hemisphere to the channels in the right, and vice versa for right to left interactions.
Figure 2.A shows the dierence of DTF between left and right hand motor imagery
(DT Flef t DT Fright ) for four brain interaction patterns: Interhemispheric (left to right and
right to left) and intra hemispheric (left and right) connectivity. The curves illustrate the mean
Article ID 073-2
DOI:10.3217/978-3-85125-378-8-73
value across all subjects. As shown in the figure, the connectivity levels within hemispheres are
highly modulated in the mu rhythm (blue and red curves), around 10-14 Hz, indicating evident
disassociation between the two motor imagery tasks. In particular, the connectivity within left
hemisphere is much higher in the condition of left hand motor imagery, cf., the positive peak
in red curve, and vice versa (negative peak in the blue curve). This might be due to the higher
mu rhythm desynchronization in contralateral than in ipsilateral areas, which causes the lower
interaction level in the left hemisphere when people are perform right hand motor imagery. The
modulation of intrahemispheric connectivity disappear in the high frequency bands, i.e. above
35 Hz.
The green and black curves represent the interhemispheric connectivity patterns, which are
almost zero in the low frequency bands, indicating no dierence between two motor imagery
tasks. Dierences are noticeable in high frequency bands, particularly 26-32 Hz, in which the
information flow from left to right hemisphere is higher when subjects are performing left hand
motor imagery (green curve), and vice versa (black curve).
Figure 2.B and C represent the mean values of the curve in Figure 2.A in two selected
frequency bands, 10-14 Hz and 26-32 Hz. Significant dierences (Wilcoxon signed-rank test)
could be found between within left and within right hemisphere interactions in the mu band.
The modulation in higher frequency bands is not as obvious as the low band, however. It
is also significant between two interhemispheric interactions. These results indicate that the
interhemispheric, or distant, brain interactions are more modulated in high frequency bands
than local connections, and motor related brain signals are originated from the contralateral
brain regions.
3.2
Classification performance
10-fold cross validation was used to evaluate the oine training performance. We tested the
eect of feature quantity by varying the number of selected features, from 1 to 50. Results of
each subject were averaged for all testing folds. Rapid increase of AUC can be observed when
the feature number is very low, i.e. below 10 features for all the feature sets, after which the
increasing trend stops and keep constant with more features, shown in Figure 2.D. This is caused
by the level of redundancy between features, i.e. no more novel information is contributed by
new features. In average, the PSD features show higher AUC (0.58) than the DTF method
(0.57). However, the combined feature set (BOTH) yields the highest performance, which
is around 0.6 after using more than 12 features. Wilcoxon signed-rank test (uncorrected) was
applied and found that below 20 features, the performance of combination is significantly better
(p < 0.05) than PSD, except using 8 features, Figure 2.D.
The present study verified the potential application of using DTF to decode motor imagery
tasks, as well as the the modulation patterns of brain connectivity between the two tasks.
Results of oine anaylsis of 9 subjects, show modulation of EEG rhythms that are consistent
with previously reported results. In particular activity in higher frequency bands related to the
interhemispheric interactions [4], as well as the reduced mu band in intrahemispheric patterns
with contralateral sites. Further experiments will be performed to confirm the current results
including online evaluation.
One should notice that an essential issue of the online DTF method is the computational
consumption, since heavy computation is required to obtain the coefficient matrix of the MVAR
Article ID 073-3
B
0.25
0.2
0.2
0.15
0.1
0.05
0
0.05
0.1
Left>Left
Left>Right
Right>Right
Right>Left
0.15
0.2
0.25
10
20
30
Frequency (Hz)
40
DOI:10.3217/978-3-85125-378-8-73
1014 Hz
p = 0.015
2632 Hz
C
0.25
0.62
0.2
0.15
0.15
0.1
0.1
0.05
0.05
0.05
0.05
0.1
0.1
0.15
0.15
0.2
0.2
0.25
0.25
50
p = 0.027
0.6
A
0.25
DTF (left right hand)
0.58
0.56
p < 0.05
PSD
CON
BOTH
0.54
0.52
10
20
30
40
50
Feature number
Figure 2: A: The modulation of DTF during left and right hand motor imagery. B and C: Mean
and variance of brain interaction patterns in two frequency ranges across subjects (B: 10-14 Hz;
C: 26-32 Hz). D: Oine performance of classifying two motor imagery tasks with three types
of features. Gray area show the results of statistical tests between PSD and BOTH.
model [3], for our case 16 16 8 (8 is the order of MVAR model, determined by AIC criteria)
parameters are estimated with 1 s EEG data, and the computation should be shorter than 1/16
second, since we update the DTF feature with 16 Hz to obtain BCI control. The use of higher
order models or more EEG channels will increase the time consumption. In our experiment, it
took around 0.01 s to do the computation of DTF and PSD, short than 1/16 s. Furthermore,
future work may try to improve the stability of DTF features by averaging MVAR coefficients
in a sliding window within past 1 s EEG signal with certain overlap, as the Hanning window
in Welchs method. For online implementation, the other option could be applying dynamic
updating of MVAR coefficients other than re-compute for each 1 s, as adaptive MVAR model,
which might be helpful to decrease the computation [1].
References
[1] M. Arnold, X.H.R. Milner, H. Witte, R. Bauer, and C. Braun. Adaptive AR modeling of nonstationary time series by means of Kalman filtering. IEEE Trans Biomed Eng, 45(5):553562, 1998.
[2] M. Billinger, C. Brunner, and G.R. M
uller-Putz. Single-trial connectivity estimation for classification of motor imagery data. J Neural Eng, 10(4):046006, 2013.
[3] I. Daly, S.J. Nasuto, and K. Warwick. Brain computer interface control via functional connectivity
dynamics. Pattern Recogn, 45(6):2123 2136, 2012.
[4] M. Grosse-Wentrup. Understanding brain connectivity patterns during motor imagery for braincomputer interfacing. In NIPS, pages 561568, 2008.
[5] M.J. Kaminski and K.J. Blinowska. A new method of the description of the information flow in the
brain structures. Biol Cybern, 65(3):203210, 1991.
[6] R. Leeb, S. Perdikis, L. Tonin, A. Biasiucci, M. Tavella, M. Creatura, A. Molina, A. Al-Khodairy,
T. Carlson, and J.d.R. Mill
an. Transferring brain-computer interfaces beyond the laboratory: Successful application control for motor-disabled users. Artif Intell Med, 59(2):121 132, 2013.
[7] H. Ramoser, J. M
uller-Gerking, and G. Pfurtscheller. Optimal spatial filtering of single trial EEG
during imagined hand movement. IEEE Trans Rehabil Eng, 8(4):441446, 2000.
[8] Y. Wang, B. Hong, X. Gao, and S. Gao. Phase synchrony measurement in motor cortex for
classifying single-trial EEG during motor imagery. EMBC, 1:7578, 2006.
[9] J.R. Wolpaw, N. Birbaumer, D.J. McFarland, G. Pfurtscheller, and T.M. Vaughan. Brain-computer
interfaces for communication and control. Clin Neurophysiol, 113(6):767791, 2002.
Article ID 073-4
DOI:10.3217/978-3-85125-378-8-74
Abstract
The multi-way decoding algorithms are adapted to incomplete wirelessly transmitted
data and integrated to CLINATEC BCI platform. The platform includes wireless 64channels ElectroCorticoGram (ECoG) recording implant WIMAGINE and BCI
software environment associated to a 4-limbs exoskeleton EMY.
1 Introduction
Multi-way (tensor-based) analysis was recently reported as an effective tool for neuronal signal
processing. It was applied for Brain Computer Interface (BCI), e.g., (Li & Zhang, 2010). Movementrelated BCI aims to provide an alternative non-muscular communication pathway for individuals with
severe motor disability to send commands to the external world using measurements of brain activity.
Generally, a common approach for brain signal analysis consists in the extraction of the information
from spatial, frequency and/or temporal domains. The commonly used time-frequency decomposition
of the signals leads to a matrix valued process. Introducing delays in time leads to the observations
stored in a 4th order tensor. For the decoding model identification, a projection of the high
dimensional tensor of observation to low dimensional space is generally applied using unfolding
procedure or various tensor decomposition technics. The multi-way decoding was chosen for BCI
project in CLINATEC, CEA, Grenoble (Eliseyev, et al., 2011; Eliseyev & Aksenova, 2013). The
goal of the project is to allow a tetraplegic subject to control external effectors, such as an
exoskeleton. Despite encouraging success, movement related BCIs have yielded limited application
outside of the laboratory, mainly due to unsolved problems of efficiency and long-term stability. The
key criterion for clinical applications is reliable BCI devices and their stable and robust functioning.
Moreover, the self-paced BCI system should be activated by users whenever they want. The important
requirement is fast and easy calibration of the BCI system. In addition, to be comfortable for users, the
high decision rate and low latency are desirable. The latency comprises algorithm delay, as well as
*
Corresponding author
Article ID 074-1
DOI:10.3217/978-3-85125-378-8-74
data transmission and computation time. To approach the requirements of the clinical applications, a
set of tensor-based algorithms were developed in CLINATEC, CEA. The Recursive block-wise Nway PLS (Eliseyev & Aksenova, 2013) provides the progressive adaptive learning of the BCI system.
L1-penalized N-way PLS algorithm performs the slice-oriented tensor decomposition that allows the
selection of the groups of informative features (Eliseyev, et al., 2012). Sparse models reduce the
computation time, increase the decision rate and, thus, improve the latency of the BCI system. The
algorithms were tested in self-paced mode in series of preclinical experiments in animals, namely, the
brain switch in freely moving rats (Eliseyev, et al., 2011), as well as brain switch and upper limb
trajectory reconstruction in minimally restricted nonhuman primates (Eliseyev, et al., 2012; Eliseyev
& Aksenova, 2013). In parallel, to estimate the performance, the publically evaluable data (Shimoda,
Nagasaka, Chao, & Fujii, 2012) were analyzed (Eliseyev & Aksenova, 2013). The algorithms delays
were evaluated in series of experiments in rats and nonhuman primates. The negative delay of the
decoder was observed in the brain-switch BCIs (Eliseyev, et al., 2011; Eliseyev, et al., 2012). The
next step is integration and testing of the decoding algorithm at clinical BCI platform.
The CLINATEC BCI platform includes ECoG recording implants WIMAGINE which
wirelessly transfer neural activity of the brain to a PC and the software environment associated to an
effector. The wireless connection may introduce temporal loss of the signals. Moreover, the important
point of the real time functioning of BCI system is the delay of the signal processing: signal
acquisition, transmission, and the commands generation. To ensure stable system functioning, a set of
gaps filling algorithms was investigated. Then, preclinical experiments were carried out to study the
latency and the performance of the system in a realistic context.
2 Methods
2.1 CLINATEC BCI Platform
For real-life applications, the decoder is to be integrated to BCI environment. The CLINATEC
clinical movement related BCI platform is based on a wireless 64-channels ECoG recording implant
WIMAGINE, designed for the long-term clinical application (Charvet, et al., 2013). Two implants
will record simultaneously the neural activity of the brain for wireless transfer to the base station, then
a PC. The BCI software environment is associated to a 4-limbs exoskeleton EMY (Enhancing
MobilitY) dedicated to medical purposes (Perrot, Verney, Morinire, & Garrec, 2013).
Article ID 074-2
DOI:10.3217/978-3-85125-378-8-74
all the cases, the algorithm has demonstrated significant robustness. The biases of the prediction
(relative root mean squares error) were, 1.4%, 1.4%, 1.4%, 3.6%, 1.4%, 1.8%, 1.3%, and 1.3%
respectively. Thus, even in the simplest case of the filling, namely, Zero-Order Hold, the influence of
the data loss on prediction was negligible.
Figure 1: Preclinical experiments using CLINATEC BCI Platform. Recording of ECoG activity of the
brain is transmitted wirelessly by the WIMAGINE implant. ECoG data are analyzed in real-time to send the
commands to the external effector.
Figure 2 illustrates the relative weights of the linear model coefficients in the spatial and frequency
domains for both full and sparse models. In the frequency domain, high frequencies 80-130 Hz have
the highest contribution. At the same time, the frequencies 35-40 Hz as well as around 7 Hz are
informative.
During the online experiments, the control commands were generated with decision rate
DR=10 Hz. The estimated delay of the signal processing, including the signal acquisition,
transmission, as well as the commands generation, was approximately equal to 300 ms. The next step
of the CLINATEC project is optimization and adjusting of all the components of the BCI system.
Additional tests in animals are in progress to better estimate system performance (more animals,
various experiment protocols etc.).
Article ID 074-3
DOI:10.3217/978-3-85125-378-8-74
3 Acknowledgment
Research is supported by French National Research Agency, Fondation Motrice, Fondation
Nanosciences, Fondation de lAvenir, Rgion Rhne-Alpes, Philanthropic Foundation Edmond
J. Safra.
Figure 2: Relative weights of the linear model coefficients in the spatial and frequency domains for both
full (A) and sparse (B) models.
References
Charvet, G., Sauter-Starace, F., Foerster, M., Ratel, D., Chabrol, C., Porcherot, J., et al. (2013).
WIMAGINE: 64-channel ECoG recording implant for human applications. Engineering in
Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the
IEEE, (pp. 2756-2759).
Cowpertwait, P. S., & Metcalfe, A. V. (2009). Introductory time series with R. Springer.
Eliseyev, A., & Aksenova, T. (2013). Recursive N-way partial least squares for brain-computer
interface. PLoS One, 8(7), e69962.
Eliseyev, A., Moro, C., Costecalde, T., Torres, N., Gharbi, S., Mestais, C., et al. (2011). Iterative Nway partial least squares for a binary self-paced brain-computer interface in freely moving
animals. J Neural Eng, 8(4), 046012.
Eliseyev, A., Moro, C., Faber, J., Wyss, A., Torres, N., Mestais, C., et al. (2012). L1-penalized N-way
PLS for subset of electrodes selection in BCI experiments. J Neural Eng, 9(4), 045010.
Li, J., & Zhang, L. (2010). Regularized tensor discriminant analysis for single trial EEG classification
in BCI. Pattern Recognition Letters, 31(7), 619-628.
Perrot, Y., Verney, A., Morinire, B., & Garrec, P. (2013). EMY: Full-body Exoskeleton. ACM
SIGGRAPH Emerging Technologies. Anaheim, US.
Shimoda, K., Nagasaka, Y., Chao, Z., & Fujii, N. (2012). Decoding continuous three-dimensional
hand trajectories from epidural electrocorticographic signals in Japanese macaques. J Neural
Eng, 9(3), 036015.
Article ID 074-4
DOI:10.3217/978-3-85125-378-8-75
IRCCS Fondazione Santa Lucia, Rome, Italy. 2Dept. of Computer, Control, and Management
Engineering, Univ. of Rome Sapienza, Italy. 3Dept. of Psychology, Univ. of Graz, Austria.
jlenia.toppi@uniroma1.it
Abstract
The efficacy of rehabilitative interventions in stroke patients is routinely assessed by
means of a neuropsychological test battery. Nowadays, more evidences indicate that the
neuroplasticity which occurs after stroke can be better understood by investigating
changes in brain networks. In this pilot study we applied advanced methodologies for
effective connectivity estimation in combination with graph theory approach, to define
EEG derived descriptors of brain networks underlying memory tasks. In particular, we
proposed such descriptors to identify substrates of efficacy of a Brain-Computer
Interface (BCI) controlled neurofeedback-based intervention to improve cognitive
function after stroke. EEG data were collected from two stroke patients before and after
a neurofeedback-based training for working memory deficits. We show that the
estimated brain connectivity indices were sensitive to different training intervention
outcomes, thus suggesting an effective support to the neuropsychological assessment in
the evaluation of the changes induced by the BCI-based rehabilitative intervention.
1 Introduction
In the post-stroke subacute time frame, the estimated proportion of patients having cognitive
impairment ranges from below 50% to over 90%. Currently, the diagnosis of cognitive impairments
after stroke and their treatment efficacy relies upon a neuropsychological assessment battery.
Nowadays, evidences form neuroimaging studies indicate that the neuroplasticity which occurs after
stroke might be better understood by investigating changes in brain networks (Cramer et al., 2011).
In this study we proposed the application of advanced methodologies for effective connectivity
estimation (Milde et al., 2010), combined with a graph theoretical approach (Astolfi et al., 2013) to
extract indices describing the topology of the brain networks as derived from EEG signals. The aim
was to seek for neurophysiological descriptors which could serve as a sensitive outcome measures and
thus, support the neuropsychological assessment in evaluating the efficacy of a Brain-Computer
Interface controlled neurofeedback training to promote recovery of memory function after stroke. As
such the BCI-controlled neurofeedback training module has been implemented within the EU funded
project CONTRAST (www.contrast-project.eu) which is currently deploying a Brain Neural
Computer Interface based technology to provide cognitive training modules to improve cognitive
rehabilitation outcomes in institutionalized and at home stroke patients.
Article ID 075-1
DOI:10.3217/978-3-85125-378-8-75
Article ID 075-2
DOI:10.3217/978-3-85125-378-8-75
Figure 1: a-b) Bar diagrams reporting the equivalent scores achieved for RAVLT and CBTT tests administered
to patient A (panel a) and B (panel b) before (PRE, green bars) and after (POST, orange bars) the SMR training.
Equivalent scores below 2 (in yellow) highlight a pathological condition. c-f) Anterior Density and Left
Temporal Degree indexes estimated in alpha band in PRE (green bars) and POST (orange bars) sessions relative
to the two stroke patients A (panels c and e) and B (panels d and f). The symbol (*) denotes a significance
difference between PRE and POST session scores (unpaired t-test; p<0.05).
3 Results
3.1 Patient A
The Patient A was able to learn the modulation of her SMR as indicated by the increase of SMR
amplitude from 7.7 +V2 to 8.4 +V2 across the 10 training sessions.
Memory Assessment. As reported in Fig.1a, the neuropsychological tests revealed a significant
improvement of the tested memory function after the neurofeedback-based training. Equivalent scores
for both CBTT and RAVLT tests increased from 1 to 3 and 4 respectively, thus indicating a transition
from a pathological (PRE) to a physiological (POST) condition.
Behavioral Data. Analysis of the behavioral performance obtained at the Sternberg task revealed
an increase of correct answers and a significant decrease (df=68, t=2.16, p=0.034) of the reaction time
after training (PRE-POST comparison, unpaired t-test).
EEG derived Brain Network. Analysis of the connectivity patterns revealed a significant POST
training increase of Anterior Density index (Fig.1c) estimated in the alpha band only for Storage
(df=198, t=2.87, p=0.0045) and Retrieval (df=198, t=3.97, p=0.0001) phases of the Sternberg task
associated with an increase of Left Temporal Degree index (Fig.1e) in alpha band for all the three
memory phases (Encoding (df=198, t=1.99, p=0.048), Storage (df=198, t=2.08, p=0.039) and
Retrieval (df=198, t=3.05, p=0.0026)).
3.2 Patient B
The Patient B did not show changes in the amplitude of his SMR across the 10 training sessions.
The amplitude value remained stable around 2.3 +V2.
Article ID 075-3
DOI:10.3217/978-3-85125-378-8-75
Memory Assessment. In this Patient (Fig.1b) we did not find significant changes in the memory
functions as evaluated by means of neuropsychological assessment. Equivalent scores for both
RAVLT and CBTT tests remained around 1 and 2 respectively, indicating a persistency of the
pathological profile.
Behavioral Data. Similar negative outcome was found for the behavioral assessment. Data analysis
revealed a decrease of the percentage of correct answers and no significant difference (df=68, t=0.76,
p=0.46) in reaction time between PRE and POST sessions of Sternberg task (unpaired t-test).
EEG derived Brain Network. Connectivity pattern analysis revealed in Patient B an opposite
profile of changes in the POST training analysis with resect to what observed in Patient A. In fact, a
significant decrease in the Anterior Density index (Fig.1d) for Storage (df=198, t=3.07, p=0.0024) and
Retrieval (df=198, t=2.11, p=0.036) phases and of Left Temporal Degree index (Fig.1f) for the
Retrieval (df=198, t=2.01, p=0.046) memory phase both estimated in the alpha band, were observed.
5 Acknowledgment
Research supported by the European ICT Program FP7-ICT-2009-4 Grant Agreement 287320
CONTRAST.
References
Astolfi, L., Toppi, J., Wood, G., Kober, S., Risetti et al., (2013). Advanced methods for time-varying
effective connectivity estimation in memory processes. Conf. IEEE Eng. Med. Biol., 29362939.
Baccal, L. A., and Sameshima K., ( 2001). Partial Directed Coherence: A New Concept in Neural
Structure Determination. Biological Cybernetics 84: 46374.
Cramer, S.C., Sur, M., Dobkin, B.H., OBrien, C., Sanger, T.D., Trojanowski, J.Q., Rumsey, et al.,
(2011). Harnessing neuroplasticity for clinical applications. Brain. 134, 15911609.
Kober, S. E., Witte M., Stangl M., Vljame A., Neuper C., and Wood G. (2014). Shutting down
Sensorimotor Interference Unblocks the Networks for Stimulus Processing: An SMR
Neurofeedback Training Study. Clinical Neurophysiology, S1388-2457(14)00186-2.
Milde, T., Leistritz, L., Astolfi, L., Miltner, W., Weiss, T., Babiloni, F., Witte, H., (2010). A new
Kalman filter approach for the estimation of high-dimensional time-variant multivariate AR
models and its application in analysis of laser-evoked brain potentials. Neuroimage 50, 960969.
Sternberg, S., (1966). High-speed scanning in human memory 53, 652654.
Article ID 075-4
DOI:10.3217/978-3-85125-378-8-76
Brain-computer interfaces (BCI) have been investigated for more than 40 years. P300based BCIs can nowadays control very complex applications such as spelling applications
and even web browsers. To do this in a practical way, it is essential to have the possibility
to pause the command selection of the BCI. We implemented and evaluated an automatic
pause detection system for P300-based BCIs based on artifact detection with an inverse
filtering method. Experimental results of an oine study (9 healthy participants) demonstrate the feasibility of the proposed approach and its high performance.
Introduction
2
2.1
Methods
Participants, Data Acquisition, and Experimental Design
Ten volunteers participated in this study. All participants stated that they have no history of
neurological or psychiatric disorders. Due to a technical problem the data of one participant
was not useable for this study. The final sample comprised 9 participants (3 female; mean age
Article ID 076-1
DOI:10.3217/978-3-85125-378-8-76
Figure 1: (A) Laptop displaying the user interface for feedback and P300 stimulation.
(B) Sketch of the experimental design. The angle between the participant, the laptop, and
the monitor was 30.3o . (C) Monitor for the web browser.
23.9 1.3 years).
EEG was acquired with a wireless EEG amplifier with dry electrodes (g.Nautilus, Guger Technologies OG, Graz, Austria). Signals from Fz, Cz, Pz, PO7, PO8, and Oz were used in this
study with a sampling rate of 250 Hz. The channels were referenced to the right mastoid and
grounded at the left mastoid. The raw signal of the Wi-Fi headset was filtered with a 0.530 Hz
bandpass.
The participants were seated in a comfortable chair approximately 65 cm away from two
screens (39.5 cm and 43 cm diameter), see Figure 1 (B). One screen was centered in front of the
participants. At this screen a P300 matrix was displayed to control a special web browser (see
Halder et al., in preparation), which was shown on a second screen placed right beside the first
one, see Figure 1 (A) and (C).
The P300 user interface and signal processing in Matlab (MathWorks, Natick, USA) was presented in [3].
Calibration was performed with fifteen highlightings per row and column and ten letters as described in [3]. The online task for the participants was to write an email to a given address and
reply to an automatically generated email. The whole task needed a minimum of 52 selections
and was aborted if the goal was not reached within 78 regular selections.
2.2
Manual Pause
A PAUSE/RUN element was selectable with the matrix. If the user selected this element, no
further selections were sent to the web browser until the same element was selected again. The
participants had to select the PAUSE/RUN element in the study when they were waiting for
the reply of the first email and they could select it whenever they needed a pause.
2.3
The automatic pause detection was performed on the oine data by detecting artifacts in the
EEG during the flashing time periods. The principle of inverse filtering was applied to detect the
artifacts, cf. [5]. For this method autoregressive filter model parameters have to be estimated
Article ID 076-2
DOI:10.3217/978-3-85125-378-8-76
out of clean (i.e., artifact free) EEG data. Our assumption was that the participants generated
few artifacts during the P300 calibration period. Consequently, we used the data of the P300
calibration to estimate autoregressive filter model parameters (model order p = 10) by using
the Burg method.
The created filter model was applied inversely to the EEG data of every online task selection.
An artifact detection threshold was set to three times the standard deviation of values calculated
with this inverse filter from the calibration EEG data. If in more than 1 percent of the online
task data artifacts were detected, the selection was marked as pause state related.
Results
The participants needed on average 11.8 (SD 2.9) highlighting sequences to select a command.
Eight participants completed the task within the maximum allowed value of 78 selections. Only
participant S7 did not complete the task. They had an average selection accuracy of 88.7% (SD
9.4) and needed an average time of 63.1 (SD 16.4) minutes including pauses to complete the
whole task.
3.1
Manual Pause
Two selections were necessary to go manually into pause mode and leave the pause mode.
The time the participants needed to perform these two selections was between 58 and 100
seconds depending on the number of highlighting sequences and the actual number of rows and
columns. Seven participants had no problem to switch between pause and control mode. Two
participants (S6, S7) needed more than one attempt to leave the pause mode. The probability
that the pause mode was left by chance was 1/N with N being the actual number of matrix
elements. This occurred once (S7) in this study.
3.2
All
Selectionsa
TP
S1
61
5 ( 6)
S2
62
7 ( 7)
S3
61
3 ( 5)
S4
66
2 ( 2)
S5
93
26 (29)
S6
101
8 ( 9)
S7
94
9 (12)
S8
73
10 (12)
S9
64
6 ( 6)
incl. selections during pause.
TN
52
46
56
63
63
92
74
60
64
Pause Detection
FP FN
TPR
4 (3)
0
100.0%
9 (9)
0
100.0%
2 (0)
0
100.0%
1 (1)
0
100.0%
4 (1)
0
100.0%
1 (0)
0
100.0%
6 (3)
5
64.3%
3 (1)
0
100.0%
0 (0)
0
100.0%
TNR
92.9%
83.6%
96.6%
98.4%
94.0%
98.9%
92.5%
95.2%
100.0%
Cohens
0.68
0.54
0.73
0.79
0.90
0.94
0.55
0.85
1.00
Table 1: Automatic pause state detection results. The true positive (TP), true negative (TN),
false positive (FP), and false negative (FN) detections as well as the true positive rate (TPR)
and the true negative rate (TNR) are presented for every subject. Values in parentheses indicate
prevented wrong item selections during the control state. Cohens Kappa was calculated to give
a measure of agreement.
Article ID 076-3
DOI:10.3217/978-3-85125-378-8-76
In Table 1 the oine simulation results of the automatic pause detection are shown. The
number of selections of some participants was higher than 78 because selections during the
pause were also counted for evaluation. The sensitivity (TPR) of the automatic pause detection
was 100 % for eight participants and 64.3 % for one participant. Consequently, the overall mean
TPR was 96.0% (SD: 11.9). The specificity (TNR) was between 83.6 % and 100% with a mean
of 94.7 % (SD: 4.9). At eight participants no false negative (FN) detections were made, see 6th
column in Table 1. This is very important because false negative detections would result in
unintended, random selections. The measured Cohens Kappas for our results were between
0.54 and 1.00, indicating moderate to strong agreement.
Discussion
In this study we provide evidence that our suggested automatic pause detection method works
comparable to the manual pause selection method without its disadvantages.
The introduced automatic pause detection method detected the pause state with 100% accuracy at eight participants and the control state with an accuracy between 83.64% and 100%.
Unintended selections in the pause state are almost non-existent and the number of prevented
selections in the control state is low and acceptable. Considering the prevented wrong selections
during the voluntary control periods by the automatic pause detection the number of wrong
classifications would be even lower (numbers in parentheses beside the TPs and FPs in Table3).
In conclusion, this study shows that detecting artifacts in a P300-based BCI can be used as a
very reliable and eective automatic P300 pause state detection method.
Acknowledgments
This work is supported by the FP7 research project BackHome (No. 288566). This paper only
reflects the authors views and funding agencies are not liable for any use that may be made of
the information contained herein.
References
[1] L. A. Farwell and E. Donchin. Talking o the top of your head: toward a mental prosthesis utilizing
event-related brain potentials. Electroencephalography and Clinical Neurophysiology, 70:510523,
1988.
[2] F. Piccione, F. Giorgi, P. Tonin, K. Priftis, S. Giove, S. Silvoni, G. Palmas, and F. Beverina. P300based brain computer interface: reliability and performance in healthy and paralysed participants.
Clinical Neurophysiology, 117(3):531537, 2006.
[3] A. Pinegger, S. Wriessnegger, and G. M
uller-Putz. Introduction of a universal P300 brain-computer
interface communication system. In Biomedical Engineering, volume 58, 2013.
[4] C. Pokorny, D. S. Klobassa, G. Pichler, H. Erlbeck, R. G. L. Real, A. K
ubler, D. Lesenfants,
D. Habbal, Q. Noirhomme, M. Risetti, D. Mattia, and G. R. M
uller-Putz. The auditory P300-based
single-switch BCI: Paradigm transition from healthy subjects to minimally conscious patients. Artif
Intell Med, 59(2):8190, 2013.
[5] R. Scherer, A. Schl
ogl, F. Lee, H. Bischof, J. Jansa, and G. Pfurtscheller. The self-paced Graz
brain-computer interface: methods and applications. Computational Intelligence and Neuroscience,
2007.
Article ID 076-4
DOI:10.3217/978-3-85125-378-8-77
Institute for Knowledge Discovery, BCI Lab, Graz University of Technology, Graz, Austria
petar.horki@tugraz.at, gernot.mueller@tugraz.at
Abstract
1 Introduction
In (Horki and Mller-Putz, 2013) we reported frontal EEG theta band oscillations for a mental task
related to working memory, but found no such event-related changes for a control task. Based on the
current literature, we assumed these EEG changes were not due to eye-movement artifact, such as
eye-blinks. However, having performed a more detailed analysis, precluded by a rigorous artifact
rejection and removal procedure, we were surprised to find these EEG changes absent in the cleaned
data. Thus, it seems that some users exhibited spontaneous eye-blinks correlated to the presentation of
the target stimuli, even after being instructed differently.
A closer visual inspection of the original EEG data revealed that, while performing the working
memory tasks, eight out of eleven participants blinked with their eyes more often than every second
time. Was this merely a coincidence, or where these eye-blinks somehow task-related? The latter case
could lead to a classifier adapted to eye-movements, and thus to a biased evaluation.
Spontaneous eye-blinks have been observed at breakpoints of attention during reading, listening to
speech, and while viewing videos (Nakano et al., 2013). In a recent work (Nakano et al. 2013), it was
suggested that spontaneous eye-blinks play an active role in the release of attention from external
stimuli while attentively engaging in a cognitive task. Based on these findings, we investigated
whether the eye-blinks observed during our experiment are somehow related to the working memory
task. To that end, we have redone the analysis from (Horki and Mller-Putz, 2013), this time using the
onset of eye blinks following the presentation of the target letter as triggers.
2 Methods
2.1 Subjects
Eleven healthy subjects (5 male, 6 female; 22 to 29 year old, mean age 26) participated in this
experiment. They were recruited through university publice notice boards (i.e. newsgroup, forum).
Participants gave informed consent prior to the beginning of the experiments and received monetary
compensation afterwards. Half of the participants had no previous exposure to EEG experiments. The
experiment was undertaken in accordance with the Declaration of Helsinki.
Article ID 077-1
DOI:10.3217/978-3-85125-378-8-77
2.2 Recording
The EEG was recorded with 29 active electrodes (g.tec, Guger Technologies, Graz, Austria)
overlying the frontal, central, and parietal scalp areas. In detail, the electrodes were placed at positions
AFz, F3, F1, Fz, F2, F4, FC2, FC1, FCz, FC4, C5, C3, C1, Cz, C2, C4, C6, CP3, CP1, CPz, CP2,
CP4, P3, P1, Pz, P2, P4 and POz according to the international 10 / 20 electrode system. The EEG
electrodes were referenced to the left ear lobe with the ground electrode placed on the right ear lobe.
The electrodes were integrated into a standard EEG cap (Easycap GmbH, Herrsching, Germany) with
an inter-electrode distance of 2.5 cm and connected to EEG amplifiers (g.tec, Graz, Austria).
The electrooculogram (EOG) was recorded with three active electrodes (g.tec, Guget
Technologies, Graz, Austria), positioned above the nasion, and below the outer canthi of the eyes. The
electromyogram (EMG) was recorded with four electrodes from both legs (musculustibialis anterior).
The EEG amplifiers were set up with a bandpass filter between 0.5 and 100 Hz, and a notch filter at
50 Hz. The EEG and EOG were sampled with 512 Hz, the EMG with 2000 Hz. Participants were
seated in an electrically shielded room.
2.3 Stimuli
Spoken letters of the English alphabet, generated by a text-to-speech program (AT&T Natural
Voices, AT&T, USA), were presented sequentially in alphabetical order through a right head phone
for one of several predefined words. Presenting acoustic cues through one ear only, keeps the other
ear free for incoming communication from surroundings. The task irrelevant acoustic cues were
presented in either male or female voice, and were balanced across all the subjects. Stimulus onset
asynchrony was set to 550 ms, including a 50 ms pause. Thus, it took 14.3 s for a single presentation
of the whole alphabet. For each target letter, indicated through a verbal cue, the alphabet was repeated
one to three times, for a total of two to four alphabet presentations, followed by a short break of
random length (i.e. four to six seconds).
Article ID 077-2
DOI:10.3217/978-3-85125-378-8-77
3 Results
In Table 1 the number of epochs containing eye-blinks during the 2 s following the target letter
presentation, the most frequent eye-blink trigger latency relative to the target letter presentation onset,
and the accuracy of reporting, for eight participants, are shown for the COG condition. The maximum
number of epochs per participant possible was 60, and only the participants with 30 eye-blink epochs
or more were included in the analysis. The distribution of the aforementioned eye-blink trigger latency
Article ID 077-3
DOI:10.3217/978-3-85125-378-8-77
could be approximated by a normal distribution peaking at around 0.8 s relative to the target letter
presentation onset.
#Eye-blink epochs
Trigger latency [s]
Reporting acc [%]
S1
44
0.7
90
S2
40
0.8
100
S3
47
0.5
100
S4
30
0.8
90
S5
38
0.6
100
S6
41
0.8
100
S7
43
0.6
40
S8
35
0.8
90
405
0.70.1
8920
Table 1: Number of epochs containing eye-blinks during the 2s following the target letter presentation
(COG condition), the most frequent eye-blink trigger latency relative to the target letter presentation onset,
and the accuracy of reporting, for eight participants. The maximum number of epochs per participant possible
Reanalysis of eye-blink triggered data revealed no significant (p=0.01) frontal oscillatory EEG
changes. The only time oscillatory EEG changes were observed, was when the original triggers (i.e.
onset of the target letter presentation) were employed instead, indicating that these EEG changes were
due to eye-blink artifacts. The grand average EEG event-related potentials were, as expected,
dominated by the eye-blinks used to trigger the data.
5 Acknowledgements
Contribution of Daniela S. Klobassa and C. Pokorny is gratefully acknowledged. The authors would
like to thank C. Breitwieser and M. Billinger for advice and assistance in the development of aspects
of the software used in this work. This research was supported by the European ICT Programme
Project FP7-247919 (DECODER) and the "Land Steiermark" Project ABT08-22-T-7/2013-17
(EEG@Wachkoma). This paper only reflects the authors views and funding agencies are not liable.
References
P. Horki, and G. Mller-Putz. (2013) A Novel Approach to Auditory EEG-Based Spelling.
Proceedings of the Fifth international BCI meeting, pp. 316 - 317.
P. Horki, D. S. Klobassa, C. Pokorny, and G. Mller-Putz. (2014) Evaluation of Healthy EEG
Responses for Spelling Through Listener-Assisted Scanning, IEEE Journal of Biomedical and Health
Informatics, in press.
T. Nakano, M. Kato, Y. Morito, S. Itoi, and S. Kitazawa. (2013) Blink-related momentary
activation of the default mode network while viewing videos. PNAS 110 (2), pp. 702-706.
Article ID 077-4
DOI:10.3217/978-3-85125-378-8-78
Introduction
Cerebral palsy (CP) is a non-progressive condition caused by damage to the brain during early
developmental stages (annual incidence of 2 per 1000 live births in Europe). Individuals with
CP may have a range of problems related to motor control, speech, comprehension, or mental
retardation. The majority of children with CP have normal intelligence, however, due to lack of
appropriate communication means they are classified as educational abnormal. Communication
solutions are available; however, they strongly depend on motor activity and assistance of others.
The aim of the ABC project (http://abc-project.eu) is to develop an interface for individuals with CP that improves independent interaction, enhances non-verbal communication
and allows expressing and managing emotions. Both motor activity and (electro)physiological
signals will serve as input for the interface. In fact, one key component of the ABC system
is a non-invasive electroencephalogram-based (EEG) brain-computer interface (BCI). In this
paper, we outline the basic structure of the ABC system and briefly report on the functionality
of individual modules, which were developed following user-centered design principles, i.e., in
accordance with user needs and requirements defined in terms of usability and functionality.
Article ID 078-1
DOI:10.3217/978-3-85125-378-8-78
The ABC system has three main functions: (i) Intentional communication (by use of EEG,
electromyography (EMG) and inertial measurement units (IMUs)), (ii) emotion expression and
management (by EEG, and electro-dermal activity (EDA) and blood volume pulse (BVP) sensors placed at the wrist), and (iii) health monitoring (by use of a sensing chest band with
electrocardiogram (ECG), respiration and accelerometer sensors). The Communicator application is the graphical user interface (GUI) for accessing all features of the ABC system
2.1
Communicator
The Communicator application provides dierent pointing and scanning techniques (dependent
on input signal) for selecting available options. Menu options (pictograms) describing the
actions to be executed (e.g. access to video, music or social network) are arranged in a grid
(matrix). Audio and visual feedback informs the user about selected choices and the action
being performed. Changes in the aective state of the user and vital health parameters are
shown in a status window. As impairments and capabilities in CP users vary considerably
between individuals, the communication with the system can be tailored to the specific needs
and capabilities of each individual user.
2.2
2.2.1
Communication
EEG-based interface
Developing BCIs for CP user is challenging for several reasons. Firstly, more or less frequent
spasms and involuntary movements negatively impact on the EEG signal quality. Secondly,
BCI paradigms have mostly been developed for able-bodied individuals. Hence, experimental
paradigms need to be adapted to the end-users capabilities following user-centered design principles. Thirdly, data collection is time consuming and tedious for the user. Hence, besides the
need to develop novel motivating training paradigms, machine learning methods need to be
improved in a way to handle the lack of training data and infer robust models that predict the
users intend from EEG signals [1, 2].
An imagery-based BCI (brain-switch) was developed to give CP users on-demand access
to the Communicator by row-column scanning protocols. Spontaneous EEG oscillations were
selected instead of evoked-potentials to encode messages to avoid the risk of epileptic seizures
in younger CP users. Note, that the prototype will be most useful to children. To enhance
robustness and reliability of mental imagery detection during early training, i.e., when activity
patterns are usually not well established and correct detection may be challenging, selection
redundancies were integrated. Currently user have to confirm a selection n = 3 out of m = 5
times. Test in healthy users confirm that, at the cost of an increased selection time, the
developed strategy enables users to make robust selections even when binary classification
performance was around 80%. Experiments in CP users are currently in progress.
One core component of the BCI system is a novel fully-automatic artifact reduction method
[3]. The method, based on online Wavelet decomposition, independent component analysis and
thresholding, automatically removes a number of dierent artifacts while preserving information
that is useful for BCI. Experiments in healthy users suggest that BCI operation is still possible
while users for example walk and hold objects with their hand [4].
Article ID 078-2
2.2.2
DOI:10.3217/978-3-85125-378-8-78
A head-mounted interface based on inertial technology has been developed. This interface
translates head movements and posture of the user with CP into mouse pointer positions following an absolute control (based on angular orientation). The inertial interface consists of a
headset with an inertial measurement unit (IMU). The IMU integrates a three-axis gyroscope,
accelerometer and magnetometer. The first study was focused on characterizing the motor
capabilities and abnormalities of users with CP. The task consisted in reaching targets on the
screen using the head motion. We selected some outcome metrics to evaluate the performance
of the users. Three healthy subjects participated as control group. 1) Throughput (TP), a
metric proposed by the ISO9241 that measures the usability of the interface, 2) Frequency (f)
of the head motion, aiming to identify some abnormal movements such as tremor or spasms, 3)
Range of motion (ROM) of the head, aiming to identify difficulty to maintain the posture to
control the interface. Results showed that there are no significant dierences between users with
CP and healthy control participant in the frequency domain of the head movement. However,
there are significant dierences for posture control between healthy users and subjects with CP.
This result suggests that posture control leads to a lack of usability more than high frequency
movements. The results and conclusions of this work are considered an objective description
of user needs, which should be used to optimize the inertial interface. Based on these results,
a new control mode based on angular velocity (relative) instead of orientation (absolute) has
been implemented. Additionally, the EMG signal is being studied to generate the click in a
natural way instead of dwell click that resulted complicated for users with CP.
2.3
Auditory aect induction paradigms based on short emotionally-laden sounds from the International Aective Digitized Sounds (2nd edition) database were developed to investigate
electro-dermal activity (EDA) and EEG activity of emotional processing in individuals with
CP in the absence of the ability to visually fixate on traditional emotional imagery.
An EDA-based emotion detection module was developed that discriminates between five
distinct aective states: high arousal positive, low arousal positive, neutral, low arousal negative
and high arousal negative. An average 5-class classification performance of 80% was computed
over 12 CP users. To implement a reaction of the system to the detected emotional state e.g.
play music, we have furthermore developed an emotion management system (EMS). EMS allows
the caregiver of CP users to define how the system reacts to the current detected emotional
state. EMS was evaluated with 15 CP users. All users appreciated the EMS and gave many
suggestions for improvements, the majority of these suggestions has been implemented.
EEG-based emotion detection is work in progress. Time and frequency domain EEG features of emotion, such as the late positive potential (LPP), as well as valence-dependent interhemispheric alpha and event-related de/synchronization eects were analyzed. Promising preliminary LPP and inter-hemispheric alpha activity results are comparable to literature from a
healthy population. Movement artifact-contaminated EEG, however, resulted in low accuracies
in CP users. The custom developed artifact rejection method is currently being adapted to
clean the data.
2.4
Health Monitoring
The main health-related problem of individuals with CP is dyspnea. In fact, 59% of the immediate causes of death in this population are diseases related to respiratory system; other
Article ID 078-3
DOI:10.3217/978-3-85125-378-8-78
21% are related to infections and inflammations of the lungs (pneumonia). Accordingly, a
health monitoring module was integrated into the ABC prototype. Conceptually, it is divided
into clinical health monitoring for severely aected people and physical activity monitoring for
mildly aected ones. Movement artifacts make clinical health-state monitoring challenging. The
system, however, provides helpful information on vital parameters during sleep or periods of
reduced movement activity. The accelerometer embedded in the chest band provides important
information on the quantity of locomotion performed by the individual during the day. This
information can be therapeutically important for mildly to moderately aected individuals.
The ABC prototype is a modular and flexible system that can be adapted according to the
users need. This is especially important for CP users as impairments and capabilities vary
considerably between individuals. Pilot studies with the head mounted IMU interface and
emotion detection by EDA activity show that these methods can be applied reasonably well
and are accepted by CP users. Besides improving robustness and performance of the developed
functions, in a next step individuals modules will be tested in daily use by CP users.
Movement artifacts are one major issue for the sensor network and EEG signals are most
severely aected. To ensure that correlates of cortical activity are used for BCI control, focus
was put on the characterization of clean EEG and the development of artifact reduction and
detection methods. A novel robust protocol for imagery-BCI based interaction has been tested
in healthy users and is currently being evaluated in CP users. Note, results in healthy users
cannot directly be transferred to real life situation in users with CP. In a next step the prototype
will be updated according to the CP users comments and training paradigms will be adapted.
The updated ABC prototype system will provide an excellent basis to study advantages and
limitations of imagery-BCI control in CP. We expect that, the system will allow us to identify
scenarios and applications where CP users truly benefit from the use of BCI technology.
Acknowledgments
This work was supported by the FP7 Framework EU Research Project ABC (No. 287774).
This paper only reflects the authors views and funding agencies are not liable for any use that
may be made of the information contained herein.
References
[1] I. Daly, M. Billinger, J. Laparra-Hern
andez, F. Aloise, M. Lloria Garcia, J. Faller, R. Scherer, and
G. M
uller-Putz. On the control of brain-computer interfaces by users with cerebral palsy. Clinical
Neurophysiology, 124(9):17871797, 2013.
[2] I. Daly, J. Faller, R. Scherer, C.M. Sweeney-Reed, S.J. Nasuto, M. Billinger, and G.R. M
uller-Putz.
Exploration of the neural correlates of cerebral palsy for sensorimotor bci control. Front Neuroeng,
7:20, 2014.
[3] I. Daly, R. Scherer, M. Billinger, and G. M
uller-Putz. Force: Fully online and automated artifact
removal for brain-computer interfacing. IEEE Trans Neural Systems Rehab Eng, accepted.
[4] R. Scherer, J. Wagner, M. Billinger, and G. M
uller-Putz. Online artifact reduction and sequential
evidence accumulation enhances robustness of thought-based interaction. In Proceedings of the 6th
International BCI Conference, Sep. 16-19, 2014, Graz, Austria, 2014.
Article ID 078-4
DOI:10.3217/978-3-85125-378-8-79
INSERM U1028, CNRS UMR5292, Lyon Neuroscience Research Center, Brain Dynamics and
Cognition Team, Lyon, F-69000, France, 2 University Lyon 1, Lyon, F-69000, France
Jeremie.mattout@inserm.fr
Abstract
A major challenge in Brain-Computer Interfaces (BCI) is the optimization of performance,
regardless of the within and between user variability. A promising option is to move towards
adaptive BCI that can accommodate fluctuations over time and subjects. An obvious criterion to
be optimized is the speed-accuracy trade-off. We implemented a BCI whose decision speed or
reaction time depends on the reliability of accumulated evidence. We instantiated a probabilistic
classifier whose outcome can be up-dated online based on new incoming information. An
entropic measure then enables us to derive an optimal stopping strategy, in a user-dependent
fashion. We evaluated the proposed approach on 11 participants, during online P300-Spelling.
We thus quantified the beneficial effect of this method. Importantly, we show that the adaptive
mode creates a virtuous circle such that: the higher the spelling accuracy, the more the
participant engages in the task and, in return, the higher the motivation the higher the BCI
performance.
1 Introduction
A central question in BCI is how fast the system can produce a reliable command. Although BCI
performances are often evaluated and reported using measures of bit rate, this one is rarely explicitly
maximized online by the system. The vast majority of P300-Speller studies do fix the time for spelling
a letter in a way that is supposed to optimize speed given some expected level of accuracy. Yet it is
clear that the same strategy might not be optimal for every individual, at any time. In a given subject,
adaptation would consist of varying the number of flashes per trial, given some measures of the users
level of engagement. Such a system should stop earlier whenever the user is very well focused on the
task, and it would keep acquiring data whenever the user is unfocused and produces ambiguous
signals. Hence for a given averaged trial duration, we expect an increase in accuracy with the adaptive
approach compared to the traditional one. Moreover, in the case of BCI, such an adaptive approach
might also trigger up the users motivation. If so, we expect the online results to reflect not only the
improvement due to the adaptive method but also some further positive effect due to the ensuing boost
in motivation. This issue of optimizing the stopping criterion has already been addressed in a couple
of studies relying on different measures of how robust is the decision about to be made (Serby et al.
2005; Jin et al. 2011). Remarkably though, the authors used a still fairly rigid approach instead of a
fully flexible one. Their approaches were restricted to stop acquiring new observations after some
varying amount of repetitions (blocks), rather than considering the possibility of deciding after an
arbitrary amount of flashes (trials). As a consequence, the machines reaction time can only take a few
discrete values. Our approach proposes an information theoretic and probabilistic criterion, which
enables us to generalize this optimal strategy, by allowing the machine to stop at any time during the
evidence accumulation process.
Article ID 079-1
DOI:10.3217/978-3-85125-378-8-79
2 Method
2.1 Data acquisition
Eleven healthy subjects took part in this study (4 men, mean age = 26.96.4 (SD), range 19-40).
They all signed an informed consent approved by the local Ethical Committee. We used a traditional
6x6 matrix made of letters (A-Z), digits (1-9) and an additional symbol for blanks (_). Pseudo-random
groups of letters (adapted from Townsend et al, 2010) were flashed alternatively for 80ms, while the
SOA was set to 150ms. The whole experiment included one training followed by three test sessions.
The former consisted of 15 characters spelled with 10 repetitions and each test session was made of
20 5-letter words. Target letters were defined prior to the experiment and indicated to the subject by a
green circle. Subjects were instructed to visually fixate the target and count how many times it was
flashed.
We here report the online and offline comparison between two experimental conditions, which
differed in the way the decision was made: one used a time-based decision (called the fixed condition)
and the other involved an accuracy-based decision (called the adaptive condition). In the fixed
condition, each trial consisted of 60 flashes (5 repetitions). In the adaptive condition, there was a
maximum of 180 flashes by item (15 repetitions), but the actual number of flashes varied from one
trial to the next, depending on the pre-determined threshold and the entropy of the current posterior
probability distribution. The threshold was determined from each individual training set so that the
average number of flashes equalled roughly 60 (5 repetitions). Participants spelled 4 blocks of five 5letter words per condition and conditions were presented pseudo-randomly over time. The experiment
lasted about an hour in total for each subject.
EEG data were recorded from 9 electrodes (Pz, P7, P8, P3, P4, PO9, PO10, O1, O2) following the
extended 10-20 system and referenced to the nose. Data were digitized at 1000 Hz, band-pass filtered
between 1 and 20 Hz and down-sampled at 100 Hz.
Article ID 079-2
DOI:10.3217/978-3-85125-378-8-79
3 Results
In the fixed condition, the online spelling accuracy was 71% 16 (SD), which corresponds to 18.8
bits/minute. In the adaptive condition, it was 80% 14 (SD), for an average of 57 4 (SD) flashes,
which corresponds to 24.1 bits/minute. (Figure 1.A).
Wilcoxon tests revealed that both the spelling accuracy and the bit rate are significantly higher in
the adaptive condition compared to the fixed condition (p<0.01 for both tests). Importantly, the
number of flashes was not significantly different between the two conditions (p=0.1), it was even
slightly lower in the adaptive condition.
To evaluate the effect of motivation, data from both conditions were reanalyzed offline, using the
same time-based stopping criterion: a decision was made after the 24 first flashes. The obtained
spelling accuracy proved significantly higher in the adaptive than in the fixed condition (p<0.01)
(Figure 1.B). Since the number of observations was the same for both conditions, the ensuing bit rate
proved also significantly higher in that same condition (p<0.01).
Figure 1. A/ Online spelling accuracy as a function of the averaged number of flashes for each condition: fixed (blue
diamond), adaptive (red square). Error bars indicate the standard error of the mean. B/ Offline spelling accuracies obtained with
the same datasets reanalyzed using a time-based criterion (decision made after 24 flashes). P-value: ** p<0.01.
4 Discussion
In this P300-Speller study, we first developed a new classification approach which up-dates the
belief of the machine about target location, after each single electrophysiological observation. This
single-trial based classification enabled us to propose and evaluate an adaptive decision making,
which consist of implementing an optimal reaction time strategy in the machine, allowing for short
spelling when the first few incoming pieces of evidence are strong enough and vice versa.
Adaptive decision making was proposed to overcome the limitation of the traditional time-based
decision criterion used in the P300-speller and BCI in general. Indeed, a machines adaptive decision,
based on some information or accuracy criterion, allows for an optimal stopping strategy. In other
words, the reaction time of the machine can be optimized in a way that mimics the reaction time of
human beings, which highly relies on the amount and quality of accumulated evidence from incoming
sensory information. What is expected from such a strategy is to produce a short reaction time,
whenever the accumulated evidence in favor of a given single choice is strong.
Article ID 079-3
DOI:10.3217/978-3-85125-378-8-79
Conversely, reaction time should be longer, whenever evidence is noisy and ambiguous, since
more data will be needed to make a reliable decision. Compared to a time-based criterion, this can
accommodate the slow intrinsic fluctuations of the electrophysiological signals, which might be due to
fluctuations in attention.
In the P300-Speller, this is particularly relevant, since sustained attention is what is required from
the subject to keep performing the task efficiently. To implement adaptive decision making, we used a
classical entropic measure, which efficiently summarizes and quantifies the uncertainty about our
belief, the latter being represented by a probabilistic distribution.
The first significant effect we indeed observed with this new criterion is that, for the same spelling
duration, the user is able to spell letters more accurately. The time saved by stopping the flashes
earlier, whenever possible, was efficiently reallocated to letters that required longer stimulation time
in order to be accurately identified. Equivalently, given an objective in terms of accuracy, fewer
flashes should be required with adaptive decision making, on average.
Secondly, a very interesting and significant effect of motivation could be observed online. Indeed,
spelling accuracy was found higher for adaptive sessions than for fixed ones when these datasets were
reprocessed offline with the same stopping criterion. This suggests that the subjects were on average
more engaged into the task during the adaptive session, thus producing electrophysiological responses
with a larger signal-to-noise ratio, which resulted in higher spelling accuracies. Indeed, the N1 and
P300 responses, which are the electrophysiological responses used to identify the target, are known to
reflect the participants involvement in the task (Treder and Blankertz, 2010). The P300 has also been
shown to increase with motivation in a BCI context (Kleih et al., 2010). The fact that spelling
accuracy is optimized by continuously and explicitly adapting the stimulation to the users need
appears to create a virtuous cycle by boosting the users motivation.
5 Acknowledgements
This work was carried out as part of the CoAdapt project, funded via ANR-09-EMER-002-01.
References
Jin J., Allison B.Z., Sellers E.W., Brunner C., Horki P., Wang X. and Neuper C. 2011 An adaptive
P300-based control system. J Neural Eng 8, 3:036006.
Kleih S.C., Nijboer F., Halder S. and Kbler A. 2010 Motivation modulates the P300 amplitude
during brain-computer interface use. Clin Neurophysiol 121, 7:1023-31.
Rivet B., Souloumiac A., Attina V. and Gibert G. 2009 xDAWN algorithm to enhance evoked
potentials: Application to brain-computer interface. IEEE Trans Biomed Eng 56, 8:2035-43.
Serby H., Yom-Tov E. and Inbar G.F. 2005 An improved P300-based brain-computer interface. IEEE
Trans Neural Syst Rehabil Eng 13, 1:89-98.
Townsend G., LaPallo B.K., Boulay C.B., Krusienski D.J., Frye G.E., Hauser C.K., Schwartz N.E.,
Vaughan T.M., Wolpaw J.R. and Sellers E.W. 2010 A novel P300-based brain-computer
interface stimulus presentation paradigm: Moving beyond rows and columns. Clin
Neurophysiol 121, 7:1109-20.
Treder M.S. and Blankertz B. 2010 (C)overt attention and visual speller design in an ERP-based
brain-computer interface. Behav Brain Funct 6, 1:28.
Wolpaw J.R., Birbaumer N., Heetderks W.J., McFarland D.J., Peckham P.H., Schalk G., Donchin E.,
Quatrano L.A., Robinson C.J. and Vaughan T.M. 2000 Brain-computer interface technology:
A review of the first international meeting. IEEE Trans Rehabil Eng 8, 2:164-73.
Article ID 079-4
DOI:10.3217/978-3-85125-378-8-80
Introduction
Language models can optimize ERP-based BCI speller performance by incorporating the probabilistic information about how letters are ordered in words when making character selection
decisions [13]. We developed a dynamic stopping algorithm for the ERP-based P300 speller
that uses a Bayesian approach to determine the amount of data collection based on a probabilistic level of confidence that a character is the target. The dynamic stopping algorithm
with uniform initialization priors resulted in significant increase in bit rate from static data
collection [4], with additional improvements observed with the inclusion of a bigram language
model in the character initialization process [5]. In this study, we expand to a higher order
model that uses all of the users spelling history, not just the previous character selection, with
the hypothesis that it will improve the predictive capacity of the algorithm due to an increasing
number of selected characters reducing the set of possible intended characters. However, the
utility of a higher order model is dependent on the number of preceding characters that are
selected correctly as these are used to obtain the correct target word prefix for initializing subsequent character probabilities. We consider a solution for misspellings by using information
about character likelihoods post-data collection [6] to weight possible prefixes.
Methods
Participants (n = 20) were recruited from the student and work population at Duke University,
who gave informed consent prior to participating in the study. Participants performed word
copy-spelling tasks online with a 9 8 matrix speller grid using the Bayesian dynamic stopping
algorithm (detailed in [5]) with dierent language models, with order counter-balanced.
Prior to the Bayesian update process, the language model is used to initialize character
probabilities given a sequence of previously selected characters, ATT 1n+1 = aT n+1 , ..., aT 1 :
!
X 1
1
T 1
CT )
where P (Ci,T =
is the initialization probability of character Ci being the T th character in the target word, CT ; P (Ci,T |ATT 1n+1 ) is the probability that the next character is Ci
Article ID 080-1
DOI:10.3217/978-3-85125-378-8-80
given the previously selected characters ATT 1n+1 , which is based on the order of the language
model; denotes the weight of the language model; 1 denotes the P
weight of a uniform
distribution, which is an error factor to account for possible misspellings; N AC N1 is the sum
of the non-alphabetic character (NAC) probabilities, which is subtracted from 1 to normalize
the probabilities. The language model probabilities were derived from a corpus compiled by
Norvig [7].
2.1
Bigram model
2.2
1
n+1 ),
In the n-gram model, the conditional probability, P (Ci,T |ATT 1n+1 ), depends on all the previously spelled characters, ATT 1n+1 = a1 , ..., aT 1 . However, when using the n-gram model, there
is the possibility of an erroneously selected character generating an invalid prefix (e.g. VIS2
for the word VISUAL), or an incorrect valid prefix which can lead to the wrong initialization
probabilities (e.g. ANC for the word INCOME), unless the erroneous character is revised.
ERP-based P300 classifier confidences post-data collection can provide some information about
the likelihood of being the target character [6]. We denote the P300 classifier confidences with
a QT = [Q1 , Q2 , ..., QT ] matrix, where Qt = [qt1 , ..., qtN ]| is a column vector of P300 classifier
confidences post-data collection for all N grid characters for the tth selected character. An example Q matrix is shown in Figure 1. The user intended to spell the word BEHIND but the
simulation yielded the word BLGIND. While the target character may not have the highest
probability post-data collection, one of the next most probable characters usually is the target.
The Q matrix can thus be used to select the k most likely prefixes from a dictionary to
calculate initialization probabilities prior to spelling the T th character. The set D kT consists of
valid
in the dictionary with the top k values of the product of their character likelihoods
Q prefixes
j
T 1 l(Dt )
qt
, and the prefixes are retained from one character to the next to generate the
1
next k most likely prefixes. The conditional probability in the initialization step to select the
T th character thus also depends on the P300 classifier confidences via the QT 1 matrix:
j
w(D ) = P
P (Ci,T |AT1
, QT
QT
1
D j 2D k
T
)=
j
1 l(Dt )
qt
Q
j
T 1 l(Dt )
q
t
1
D j 2D k
T
(2)
(3)
where w(Dj ) is the weight of the prefix Dj ; l(Dtj ) is the label for the tth character in prefix Dj ;
P (Ci,T |Dj ) is the initialization probability that the T th character is Ci , given prefix Dj .
Results
Figure 2 shows each participants selection accuracy. For most participants, the accuracy from
the bigram to n-gram model was similar or noticeably improved, (p < 0.09). The Q matrix
tends to be sparse, meaning that only a few characters are considered likely to be the target.
Article ID 080-2
DOI:10.3217/978-3-85125-378-8-80
These characters typically have at some point been in the same flash group as the target, and
this is often the source of most character selection errors. We hypothesize that the strong priors
introduced by a higher order language model give the target character an added advantage to
prevent an erroneous character selection, thereby sometimes improving accuracy.
Figure 3: Participant performance comparison between on-line dynamic stopping with dierent language models. (A) Average character selection time, with standard error bars, (B) Bit Rate.
.
Figure 3A shows the average amount of time per character selection for each participant.
A significant decrease was observed in character selection time with the n-gram DAPS model
(p < 0.002). In Mainsah et al. [5], o-line analysis of participant EEG data revealed that the
rate of convergence to the threshold probability in dynamic data collection increased with the
inclusion of the bigram language model. We hypothesize that the stronger priors introduced
Article ID 080-3
DOI:10.3217/978-3-85125-378-8-80
by the higher order model further causes the character probabilities to converge faster.
The accuracy and average character selection time were used to calculate bit rate, including
the time pauses between character selections [8]. Figure 3B shows participant bit rates with
both algorithms. Due to similar accuracy levels and a significant reduction in character selection
time, most participants observed significant improvement in their performance (p < 0.007),
with on average 26% increase in bit rate. Performance improvements with then-gram model
are consistent with o-line analysis performed on EEG data from [5].
Discussion
The relatively slow communication rates of ERP-based BCI speller systems can be improved
by exploiting the predictability of language. However, sometimes the manner of integration of
language information in the ERP speller can lead to a decrease in performance due to increased
task difficulty e.g. selecting from a drop-down menu in a predictive speller as in [2]. Our
online results indicate there is potential to improve performance with a higher order language
model in dynamic data collection, with additional consideration to minimize erroneous character
revisions when used in combination with a dictionary. Further development includes adapting
the algorithm for sentence spelling tasks, where word-space boundaries are important. There
is the potential to further enhance performance using natural language processing tools such as
word prediction and/or dictionary-based spelling correction. For example, the algorithm can be
adapted to include likely word alternatives generated from the prefixes which can be displayed
directly in the speller matrix [3], as this has been shown to not negatively aect performance.
Acknowledgments
This research was supported by NIH/NIDCD grant number R33 DC010470-03.
References
[1] U. Orhan, D. Erdogmus, B. Roark, B. Oken, and M. Fried-Oken. Oine analysis of context
contribution to erp-based typing bci performance. Journal of Neural Engineering, 10(6), 2013.
[2] D. Ryan, G. Frye, G. Townsend, D. Berry, G. Mesa, N. Gates, and E. Sellers. Predictive spelling
with a p300-based brain-computer interface: Increasing the rate of communication. Int J Hum
Comput Interact, 27(1):6984, 2011.
[3] T. Kaufmann, S. Volker, L. Gunesch, and A. K
ubler. Spelling is just a click away - a user-centered
brain-computer interface including auto-calibration and predictive text entry. Front Neurosci, 6(72),
2012.
[4] C. Throckmorton, K. Colwell, D. Ryan, E. Sellers, and L. Collins. Bayesian approach to dynamically
controlling data collection in p300 spellers. IEEE Trans Neural Syst Rehabil Eng, 21(3):50817,
2013.
[5] B. Mainsah, K. Colwell, L. Collins, and C. Throckmorton. Utilizing a language model to improve
online dynamic data collection in p300 spellers. IEEE Trans Neural Syst Rehabil Eng, 22(4), 2014.
[6] B. Mainsah, K. Morton, L. Collins, and C. Throckmorton. Achieving spelling correction in p300
spellers without error-related potentials. Society for Neuroscience Conference, 2013.
[7] Peter Norvig. How to write a spelling corrector. http://norvig.com/spell-correct.html, 2007.
[8] D.J. McFarland, W.A. Sarnacki, and J.R. Wolpaw. Brain-computer interface (bci) operation: Optimizing information transfer rates. Biological Psychology, 63(3):237251, 2003.
Article ID 080-4
DOI:10.3217/978-3-85125-378-8-81
Brain Embodiment Lab, School of Systems Engineering, University of Reading, Reading, U.K.
m.n.wairagkar@student.reading.ac.uk
Abstract
Various complex oscillatory processes are involved in the generation of the motor
command. The temporal dynamics of these processes were studied for movement
detection from single trial electroencephalogram (EEG). Autocorrelation analysis was
performed on the EEG signals to find robust markers of movement detection. The
evolution of the autocorrelation function was characterised via the relaxation time of the
autocorrelation by exponential curve fitting. It was observed that the decay constant of
the exponential curve increased during movement, indicating that the autocorrelation
function decays slowly during motor execution. Significant differences were observed
between movement and no moment tasks. Additionally, a linear discriminant analysis
(LDA) classifier was used to identify movement trials with a peak accuracy of 74%.
1 Introduction
Neural correlates of movement have been increasingly explored for applications in brain-computer
interfacing (BCI) as they enable very intuitive control [1]. Previous studies have suggested the
possibility of the involvement of various complex oscillatory processes in motor command generation
[2]. Most research focuses on the spectral domain of the EEG for detecting movement [1]. This
project takes a different approach on understanding the motor commands by studying the temporal
dynamics of the EEG using novel features.
The principle of Event Related (De)synchronization (ERD/S) corresponding, respectively, to
attenuation and increase predominantly in mu power and beta power [3], is widely used for detecting
movements. Single trial analysis is important for online BCI implementation. Although these spectral
features are able to reliably detect the motor command, they may not completely describe all aspects
of motor command generation available in the EEG and do not indicate how one part of the EEG
depends on another. Moreover, it is challenging to compute accurate instantaneous frequency
distributions without compromising the temporal resolution and inducing delays in the motor
command detection. Utilizing EEG signals high temporal resolution, we have developed a novel
method of detecting motor commands on a single trial basis by performing time domain analysis.
Continuous autocorrelation analysis has been used for extracting temporal features from EEG.
In this study, different correlation based time domain analysis methods were explored for
understanding the neural basis of motor command generation. Previous studies report that the first
zero-crossing time, the time at which the autocorrelation function crosses 0, increases before and
during voluntary movement [2]. Autocorrelation analysis was motivated by looking at temporal
dependencies in the EEG. This approach was expanded by considering the evolution of the
autocorrelation function over time and studying changes in relaxation time of the decay of the
autocorrelation.
Article ID 081-1
DOI:10.3217/978-3-85125-378-8-81
2 Methods
2.1 Experimental Paradigm
EEG was recorded from three participants. All the participants were males (2 right handed and 1
left handed) with ages 25, 23 and 29 years. An experimental paradigm was developed for recording
self-paced index finger tapping of the right and left hand using tools from the BioSig toolbox [4]. A
fixation cross was displayed on the screen placed at eye level for 2 sec at the beginning of each trial
and followed by a textual cue for right or left hand finger tapping or resting. Participants were asked
to perform a self-paced single finger tap at a random time of their choice within the 10 sec window
following the cue. Each trial was followed by a random break of 1 to 1.5 sec. The experiment was
broken down into separate runs of 12 trials with 4 cues per class displayed in random order to avoid
pattern learning by the participants. The experimental setup is illustrated in Figure 1.
A bespoke tapping device was developed using a programmable microcontroller to record the
tapping signals from both the fingers. In order to mark the exact onset of the movement in EEG, both
EEG and finger tapping signals were recorded simultaneously and co-registered using tools developed
as part of the TOBI framework [5]. EEG from 19 electrodes (impedances kept below 8k) was
recorded using a Deymed TruScan amplifier with a sampling frequency of 1024 Hz. Forty trials for
each of the three conditions were recorded for each participant.
Article ID 081-2
DOI:10.3217/978-3-85125-378-8-81
3 Results
Increases in the value of around the onset of movement were clearly observed in most trials.
The values of the resting state trials appeared stable throughout the trial Features around the onset of
the movement showed statistically significant differences between tap vs. rest conditions (see Figure
5). The most responsive channels for right and left hand tap differed between participants. Using the
autocorrelation function decay constant movement could be detected from single trials.
Figure 6: Classification
accuracy for left hand,
participant 3 in Fz. The
horizontal line indicates
statistical significance (p<0.05).
LDA classification accuracies for all the participants were plotted for the classification of
movement of right vs. rest and left vs. rest. The accuracy obtained was considered statistically
significant at p<0.05. A peak accuracy of 74% was achieved for participant 3 (shown in Figure 6).
Article ID 081-3
DOI:10.3217/978-3-85125-378-8-81
Table 1 shows classification accuracies for all the participants. Significant accuracies were obtained
for all the participants, except for participant 1 right hand tapping condition. Its interesting to note
that participant 3, who exhibited the smallest ERD response, gave the best results using this method.
Participant
Right hand tapping
Left hand tapping
classification accuracy (%)
classification accuracy (%)
1
58.0
68.4
2
66.0
69.0
3
68.0
74.0
Table 1: Movement classification accuracies. Statistical significance (p<0.05) is indicated in bold.
References
[1] H. Hwang, et al., EEG-based Brain-Computer Interfaces (BCIs): A Thorough Literature
Survey, International Journal of Human-Computer Interaction, 29(12), 2013.
[2] Y. Hayashi, et al,Analysis of EEG Signal to Detect Motor Command Generation Towards
Stroke Rehabilitation, in Converging Clinical and Engineering Research on
Neurorehabilitation, Berlin Heidelberg, 2013.
[3] G. Pfurtscheller and F. Lopes da Silva, Event-related EEG/MEG synchronization and
desynchronization: basic principles, Clin. Neurophysiol., 110(11) pp. 1842-1857, 1999.
[4] BioSig for Matlab and Octave an open source software library for biomedical signal
processing, in Available online: http://biosig.sf.net, 2005.
[5] C. Breitwieser, I. Daly, C. Neuper and G. Muller-Putz, Proposing a Standardized Protocol for
Raw Biosignal Transmission, {IEEE} Trans. Biomed. Eng , 59(3), pp. 852-859, 2012.
[6] T. Jung, S. Makeig , C. Humpharies and T. Lee, Removing electroencephalographic artifacts
by blind source separation, Psychophysiology, 37, p. 163178, 2000.
Article ID 081-4
DOI:10.3217/978-3-85125-378-8-82
!"#$%&'()*+,"-+./012134"516".1307-78+"
6+/)(727-)-710")5-+6"*-619+:"*;68+4"10";*)(727-4
!"#$%& '$()**$++,-."#/&-01-23)$45,-.$36$&-78$9&-2":)8;,-08$(*$#&-0&<=$33">,
?3)#$&-@"==$+,A,-B&C8&-D(*"3E$+,A,-03&C(-F"G)3)>,-HC$34)8<)-I""J;,-D#J8)&
2K:3)85-&#J-L"#&*)33&-!&**$&+5+-
M"#J&N$"#)-.&#*&-BC%$&,-O'00.,-'"<),-O*&3P
5
Q#$6)8($*P-"E-IK8N:C8G,-H)8<&#P
;
Q#$6)8($*P-"E-H8&N,-DC(*8$&
>
Q#$6)8($*P-"E-BCR)<:"C8G,-BCR)<:"C8G
A
L)=*1-"E-0"<=C*)8,-0"#*8"3-&#J-!&#&G)<)#*-7#G$#))8$#G-S-TB&-.&=$)#N&T-Q#$6)8($*P-"E-'"<)
m.risettisantalucia.it
!(*-6).@4) - *8)&*<)#* - "E - %"G#$*$6) - $<=&$8<)#* - &#J - =8)6)#*$"# - "E - EC8*4)8 - J)%3$#) - &8))(()#*$&3-&(=)%*(-"E-(*8"9)-8)4&:$3$*&*$"#1
D-#)U-V8&$#-W)C8&3-0"<=C*)8-O#*)8E&%)-XVW0OY-(P(*)<-U&(-J)6)3"=)J-&J"=*$#GC()8S%)#*)8)J - J)($G# - XQ0LY - =8$#%$=3)(, - *" - "=)8&*) - :$"S - &#J - #)C8"SE))J:&%9 - :&()J%"G#$*$6) - *8&$#$#G - <"JC3)( - U$*4 - *4) - &$< - "E - $<=8"6$#G - ="(*S(*8"9) - %"G#$*$6)8)4&:$3$*&*$"#-"C*%"<)(1-@4)-C(&:$3$*P-"E-(C%4-VW0OS:&()J-(P(*)<-E"8-%"G#$*$6)-*8&$#$#GU&( - *)(*)J - :"*4 - U$*4 - 4"(=$*&3$N)J - =&*$)#*( - XE$8(* - *)(*$#G - =4&()Y - &#J - *4"() - &* - 4"<)
X()%"#J-*)(*$#G-=4&()Y1-ZC)(*$"##&$8)(-U)8)-&J<$#$(*)8)J-*"-+A-)R=)8*(-$#-8)4&:$3$*&*$"#,[-(*8"9)-=&*$)#*(-&#J-;-%&8)G$6)8(1-Q()8(\-#))J(-&#J-8)]C$8)<)#*(-4&6)-:))#,-&#J-U$33EC8*4)8 - :) - %"33)%*)J, - JC8$#G - &33 - *4) - )R=)8$<)#*&3 - =4&()( - *" - $#*)G8&*) - &#J - 8)E$#) - *4)(P(*)<-&%%"8J$#G-*"-*4)-$*)8&*$6)-%P%3$#G-"E-*4)-Q0L1
@4)-=8)3$<$#&8P-E$#J$#G(-"E-*4$(-(C86)P-$#J$%&*)J-*4&*-*4)-#)U3P-=8"="()J-VW0OS
&(($(*)J-*8&$#$#G-E"8-%"G#$*$6)-8)4&:$3$*&*$"#-U&(-U)33-&%%)=*)J-:P-*4)-<&/"8$*P-"E-(*8"9)=&*$)#*( - X^_`Y - &#J - =8"E)(($"#&3( - X[a`Y1 - @4) - 3)6)3 - "E - C(&:$3$*P - &#J - &%%)=*&:$3$*P8)6)&3)J-:P-*4)-(C86)P-$(-)#%"C8&G$#G-E"8-*4)-(P(*)<-*8&#(3&*$"#-$#-%3$#$%&3-8"C*$#)-C(&G)C="#-*4)-*8&$#$#G-=8"6)(-)EE)%*$6)-E"8-$<=8"6$#G-%"G#$*$"#1
+ O#*8"JC%*$"#
0"G#$*$6)-J)E$%$*(-"%%C8-$#-*4)-<&/"8$*P-"E-(*8"9)-=&*$)#*(-Xb&8$#G,-5aa5Y-&#J-4&6)-&-4$G4-$<=&%*"# - *4)$8 - ]C&3$*P - "E - 3$E) - &#J - *4&* - "E - *4)$8 - E&<$3$)( - X0&8"JSD8*&3 - )* - &31, - 5aacY1- 0C88)#*3P,#)C8"=(P%4"3"G$%&3-8)4&:$3$*&*$"#-3&%9(-$#-$#*)#($*P-&#J-JC8&*$"#-"E-(=)%$E$%-8)4&:$3$*&*$"#-(*8&*)G$)(1
D(-P)*,-=&*$)#*(-J"-#"*-4&6)-*4)-"=="8*C#$*P-*"-%"#*$#C)-*4)-8)4&:$3$*&*$"#-)#J)&6"8-&E*)8-J$(%4&8G),&#J-*4)P-&8)-3)E*-U$*4-#"-"=*$"#(-E"8-&-=8"E)(($"#&3-*8&$#$#G-&#J-<"#$*"8$#G-"E-"C*%"<)(-&*-4"<)1-@4) +-!&(*)8<$#J)J-7&(P04&$8-&#J-%8)&*)J-*4)-E$8(*-(*&:3)-6)8($"#-"E-*4$(-J"%C<)#*
5-08)&*)J-*4)-E$8(*-J8&E*-"E-*4$(-J"%C<)#*
Article ID 082-1
DOI:10.3217/978-3-85125-378-8-82
7Q - EC#J)J - =8"/)%* - 0dW@'D.@- XUUU1%"#*8&(*S=8"/)%*1)CY - $( - %C88)#*3P- J)=3"P$#G - & - V8&$# - W)C8&30"<=C*)8 - O#*)8E&%) - XVW0OYS:&()J - *)%4#"3"GP - *" - =8"6$J) - %"G#$*$6) - *8&$#$#G - <"JC3)( - *" - $<=8"6)%"G#$*$6)-8)4&:$3$*&*$"#-"C*%"<)(-$#-$#(*$*C*$"#&3$N)J-=&*$)#*(,-&#J-&3("-*"-(C=="8*-=&*$)#*e(-*8&$#$#G-&*4"<)-:P-8)<"*)-%"#*8"33)J-(C=)86$($"#1-@4)-*)8<-VW0O-J)#"*)(-E"8-*4)-$<=3)<)#*&*$"#-U$*4$#-*4)0dW@'D.@-(P(*)<-"E-()6)8&3-*8&$#$#G-<"JC3)(-E"8-&**)#*$"#,-J)%3&8&*$6)-<)<"8P,-U"89$#G-<)<"8P&#J - )R)%C*$6) - EC#%*$"#(- *4&* - &8) - "=)8&*)J - :P - :$"<)*8$% - ($G#&3( - (C%4 - &( - 4)&8*:)&* - Xb'FY,
73)%*8""%C3"G8&<-X7dHY-&#J-73)%*8"<P"G8&=4P-X7!HY-&#J-*4)-73)%*8")#%)=4&3"G8&<-X77HY1-V&()J"#-*4)-Q0L-=8$#%$=3)(-X!&GC$8),-+cc^Y,-U)-$*)8&*$6)3P-$#6"36)J-J$EE)8)#*-C()8(-(C%4-&(-=8"E)(($"#&3(,=&*$)#*(-X$#-*4)-4"(=$*&3-&#J-&*-4"<)Y-&#J-%&8)G$6)8(-*"-)6&3C&*)-*4)-(P(*)<-!"#$%&%'(1-b)8)-U)-8)="8**4) - =8)3$<$#&8P - 8)(C3*( - "E - *4) - (C86)P - :&()J - "# - ]C)(*$"##&$8) - J&*& - %"33)%*)J - $# - *4) - 8)4&:$3$*&*$"#E&%$3$*$)(1
5 !&*)8$&3(-&#J-!)*4"J(
51+ D(()((<)#*-"E-(P(*)<-C(&:$3$*PD*-*4)-%C88)#*-(*&G)-"E-*4)-0dW@'D.@-=8"/)%*,-+A-=8"E)(($"#&3(-X=(P%4"3"G$(*(,-#)C8"3"G$(*(,#)C8"=(P%4"3"G$(*( - &#J - *4)8&=$(*( - E8"< - ()6)8&3 - 7Q - %"C#*8$)(Y, - [ - (*8"9) - =&*$)#*( -X&J<$**)J - *" - *4)#)C8"8)4&:$3$*&*$"#-J)=&8*<)#*-E"8-(*&#J&8J-8)4&:$3$*&*$"#-*8)&*<)#*-&#J-*4)-)R=)8$<)#*&3-$#*)86)#*$"#Y&#J-;-%&8)G$6)8(-4&6)-:))#-$#6"36)J-*"-)6&3C&*)-*4)-C(&:$3$*P-"E-*4)-*8&$#$#G-<"JC3)-E"8-J)%3&8&*$6)<)<"8P1 - .C%4 - <"JC3) - 8)3$)( - "# - 77H - ()#("8$<"*"8 - 84P*4< - X.!'Y - ()3ES<"JC3&*$"#1 -@4) - *8&$#$#G
%"#($(*(-"E-&-;S<$#C*)-:&()3$#)-*8$&3-JC8$#G-U4$%4-*4)-=&8*$%$=&#*(-&8)-$#(*8C%*)J-*"-8)3&R-E"33"U)J-:P-&*"*&3-"E-($R-;S<$#C*)-E))J:&%9-8C#(-U4)8)-=&*$)#*(-$#(*8C%*$"#(-&8)-*"-$#%8)&()-*4)$8-.!'-&<=3$*CJ)XE)J-:&%9-:P-<)&#(-"E-&-:&8-=3&%)J-$#-*4)-<$JJ3)-"E-&-(%8))#-U4"()-($N)-4&(-*"-:)-9)=*-&:"6)-&=8)J)E$#)J-*48)(4"3JY-U4$3)-8)JC%$#G-7dH-&#J-7!H-&8*$E&%*(-XE)J-:&%9-:P-<)&#(-"E-5-:&8(-"#-*4)-3)E*&#J-*4)-8$G4*-($J)-"E-*4)-(%8))#-U4"()-($N)-4&(-*"-:)-9)=*-:)3"U-=8)J)E$#)J-*48)(4"3J(Y,-8)(=)%*$6)3P1.C%%)((EC3-*8$&3(-&8)-&3("-8)U&8J)J-:P-<)&#(-"E-6$(C&3-&#J-&CJ$*"8P-E))J:&%9-X="$#*-(%"8)-&#J-&-!$J$@"#)Y1-@4)-77H-($G#&3(-&8)-8)%"8J)J-E"8-*4)#-0N-)3)%*8"J)-X5-<&(*"$J-)3)%*8"J)(-E"8-8)E)8)#%)-&#JG8"C#JY1-@4)-V3""J-F"3C<)-fC3()-XVFfY-&#J-*4)-4)&8*-8&*)-Xb'Y-&8)-<"#$*"8)J-:P-<)&#(-"E-&-E$#G)8%3$=-()#("81-D33-<"JC3)(-4&6)-:))#-$<=3)<)#*)J-U$*4$#-*4)-W7gQ.-*)%4#"3"GP-XW)gC(S+a-!2OO,
!$#J - !)J$& - VFY1 - .$R - "C* - "E - [ - $#6"36)J - =&*$)#*( - C#J)8U)#* - &33 - *4) - *8&$#$#G - =8"*"%"3 - X+a - .!' #)C8"E))J:&%9 - *8&$#$#G - ()(($"#(Y - &#J - > - =&*$)#*( - U)8) - &3(" - &:3) - *" - 3$#)&83P - $#%8)&() - *4)$8 - .!' &<=3$*CJ)-"6)8-*4)-#)C8"E))J:&%9-*8&$#$#G-8C#(1
@4)-&J"=*)J-]C)(*$"##&$8)-U&(-J)8$6)J-E8"<-*4)-8)(C3*(-"E-"=)#-$#*)86$)U(-=8)6$"C(3P-%"33)%*)JE8"<-&-G8"C=-"E-)R=)8*(-$#-*4)-8)4&:$3$*&*$"#-E$)3J(-E8"<-J$EE)8)#*-7Q-%"C#*8$)(,-&#J-&J&=*)J-*"-)&%4%&*)G"8P-"E-C()8(1 -@4)-E"33"U$#G-<&$#-J"<&$#(-X&-*"*&3-"E -[a-]C)(*$"#(-8)3&*)J-*"-*4)-*U"-J$EE)8)#*()**$#G(Y- U)8) - $#6)(*$G&*)Jh - =8"JC%* - C(), - $#*)#($*P - "E - *4) - *8&$#$#G, - E))J:&%9 - E"8 - =8"E)(($"#&3 - &#J=&*$)#*(,-<&89)*-E)&($:$3$*P,-$#J$6$JC&3$N&*$"#,-J)($G#-&#J-)&($#)((-*"-C(),-(&E)*P,-&#J-6$8*C&3-8)&3$*P1 M$#&33P,-*4)-=8"E)(($"#&3( -U)8)-&(9)J-*"-8	-*4)-&(=)%*(-"E-*4)-*)%4#"3"GP-*4)P-%"#($J)8)J-<"(*$<="8*&#*1-D-(4"8*)8-E"8<-"E-*4$(-]C)(*$"##&$8)-U&(-*4)#-&J<$#$(*)8)J-*"-=&*$)#*(-&#J-%&8)G$6)8(-&#J-$*J$J-#"*-J)&3-U$*4-%"G#$*$6)-EC#%*$"#(,-<&89)*-E)&($:$3$*P,-$#J$6$JC&3$N&*$"#-&#J-6$8*C&3-8)&3$*P-E$)3J(
U4)8)&(-$*-=8"6$J)J-]C)(*$"#(-8)3&*)J-*"-(*&#J&8J-8)4&:$3$*&*$"#-=8"G8&<-&#J-*4)-#))J(-E"8-$#*)8&%*$"#U$*4-*4)-*4)8&=$(*1-D33-$*)<(-"E-*4)-]C)(*$"##&$8)-8)]C$8)J-&-]C&3$*&*$6)-8)(="#(),-*4C(-*4)-=8)()#*)J8)(C3*(-&8)-"#3P-J)(%8$=*$6)1-
Article ID 082-2
DOI:10.3217/978-3-85125-378-8-82
; ')(C3*(
;1+ ZC)(*$"##&$8)(-E"8-=8"E)(($"#&3-C()8(I$*4-8)G&8J-*"-*4)-VW0OS:&()J-(P(*)<-C(&G),-*4)-<&/"8$*P-"E-*4)-+A-$#*)86$)U)J-)R=)8*(-X[;`Y8)="8*)J-$*-<$G4*-:)-C()J-$#-*4)-E&%$3$*$)(-"#3P-&E*)8-&#-$#$*$&3-=)8$"J-"E-%"#6)#*$"#&3-%"G#$*$6)-*4)8&=P1 M"8-*4)-4"<)S:&()J-()**$#G,-[a`-"E-*4)-=8"E)(($"#&3(-8&*)J-*4)-$#J)=)#J)#*-(P(*)<-C(&G)-X:P-*4)=&*$)#*-&#Ji"8-*4)-%&8)G$6)8Y-&(-*4)-<"(*-8)3)6&#*-&(=)%*1-@4)-*8&$#$#G-()(($"#-JC8&*$"#-"E-;a-<$#C*)(U&(-$#J$%&*)J-&(-=8)E)88)J-:P-ca`-"E-*4"()-$#*)86$)U)J1-@4)-&CJ$*"8P-E))J:&%9,-*4)-=)8E"8<&#%)
(%"8) - &#J - *4) - (<$3)P - E&%) - E"8 - ="($*$6) - 8)$#E"8%)<)#* - U)8) - 8	)J - :P - =8"E)(($"#&3( - &( - *4) - <"(*&==8"=8$&*)-*P=)-"E-E))J:&%9-E"8-=&*$)#*(,-U4$3)-*4)P-U$(4)J-*"-4&6)-E)J-:&%9-*4)-=)8E"8<&#%)-&#J3)&8#$#G-%C86)(-*"-)(*$<&*)-=&*$)#*(e-=8"G8)((1
@4)-#))J-E"8-&#-)&(PS&%%)((-U&(-$#J$%&*)J-:P->A`-"E-*4"()-$#*)86$)U)J-U4"-$#J$%&*)J-*4&*-()6)8&3("3C*$"#(-*"-&%%)((-*4)-J)6$%)(-&8)-#)%)((&8P-&#J-(4"C3J-:)-=8"6$J)J-*"-=&*$)#*(-$#-"8J)8-*"-E&%$3$*&*)*4)-:&($%-"=)8&*$"#-"E-*4)-(P(*)<-X$1)1,-()3)%* -&#J-$#$*$&*$"#-"E-*4)-8)]C$8)J-*8&$#$#G-<"JC3)j-(&6) J&*&kY1-I$*4-8)G&8J(-*"-*4)-(&E)*P-"E-*4)-VW0OS:&()J-*)%4#"3"GP,-*4)-(P(*)<-#))J)J-*"-:)-%)8*$E$)J-&( <)J$%&3-=8"JC%*-E"8-[;`-"E-*4)-=8"E)(($"#&3(1-M$#&33P,-*4)-=8"E)(($"#&3(-U)8)-&(9)J-*"-8	-*4)-()6)8&3&(=)%*(-"E-(P(*)<-U$*4-*4)-$#J$%&*$"#,-$E-*4)P-4&J-%"#($J)8)J-"#)-$((C)-&(-$<="8*&#*-&(-&#"*4)8,-*" $#J$%&*)-:P-G$6$#G-*4)<-*4)-(&<)-#C<:)81- M"C8-=8"E)(($"#&3(-4&J-*4)-"=$#$"#-*4&*-&33-&8)&(-U)8))]C&33P-$<="8*&#*1-D(-$33C(*8&*)J-$#-M$GC8)-+,-*4)-<"(*-8)3)6&#*-&(=)%*(-U)8)-)EE)%*$6)#)((,-)EE$%$)#%P,
(&E)*P-&#J-)&($#)((-"E-C()1
<73;6+: " "@4)- 4$(*"G8&<- $33C(*8&*)( - *4) - VW0OS:&()J - *8&$#$#G- (P(*)< - 8)3&*)J- $*)<( - XR- &R$(Y - &(- 8	)J - :P- *4) =8"E)(($"#&3-C()8(-XP-&R$(-l-*4)-#C<:)8-"E-=8)E)8)#%)(-":*&$#)J-E"8-)&%4-$*)<Y1
-
Article ID 082-3
DOI:10.3217/978-3-85125-378-8-82
U4)8)&(->;`""E-*4)<"*4"CG4*-*4)P-U"C3J-#))J-4)3=-E8"<-E&<$3P-<)<:)8(-X_a`Y-"8-*4)8&=$(*(-X>a`Y1!"(*-"E-*4)-=&*$)#*(-8)]C)(*)J-&-*4)8&=$(*-*"-:)-=8)()#*-&*-3)&(*-"#%)-&-U))9-X[a`Y-"8-5-*$<)(-=)8<"#*4-X;a`Y1-O#*)#($*P-&#J-JC8&*$"#-"E-*4)-$#*)86)#*$"#-U&(-8&*)J-&(-($<$3&8-*"-=8"E)(($"#&3(-X&6)8&G);-*$<)(-=)8-U))9-U$*4-)&%4-()(($"#-JC8&*$"#-"E-;a-<$#C*)(j-A[`Y1-@4)-<&$#*)#&#%)-"E-&-4$G4-3)6)3-"E=&*$)#*e(-<"*$6&*$"#-U"C3J-8)]C$8)-*"-:)-$#-%"#*&%*-U$*4-*4)-*4)8&=$(*-X>;`Y-&#Ji"8-*"-=8"6$J)-C()8S
E8$)#J3P-E))J:&%9(-X5c`Y1-7EE)%*$6)#)((-&#J-(&E)*P-U)8)-8	)J-&(-*4)-<"(*-$<="8*&#*-&8)&(,-E"33"U)J
:P-)EE$%$)#%P, -)&($#)((-"E-C()-&#J-E))J:&%91-!"(*-E&<$3P-<)<:)8(-X;-E&<$3$)(-U)8)-$#*)86$)U)JY U"C3J - 3$9) - *" - %"#*$#C) - *4)$8 - 8)3&*$6)(e- 8)4&:$3$*&*$"# - &* - 4"<)1 - @4)P - %"C3J - 4)3= - =&*$)#*( - $# - *4)$88)4&:$3$*&*$"#-E"8-&:"C*-5-4"C8(-&-J&P-$E-#)%)((&8P,-:C*-*4)P-U"C3J-3$9)-*"-:)-*8&$#)J-C=-*"-5-4"C8(-*" 3)&8#-*4)-(P(*)<-C()1-@4)P-&3("-U"C3J-3$9)-*"-4&6)-%"#*&%*-U$*4-*4)-*4)8&=$(*-5S;-*$<)(-X_[`Y-"8->SA*$<)(-X;;`Y-=)8-<"#*41-D33-E&<$3P-<)<:)8(-$#J$%&*)J-&(-="*)#*$&3-3$<$*(-*4)-%"(*(-&#J-*4)-J$EE$%C3*$)(*"-C()-*4)-(P(*)<1
> 0"#%3C($"#-&#J-"C*3""9
@4) - =8)3$<$#&8P- E$#J$#G( - "E - *4$( - (C86)P- $#J$%&*)J - *4&* - *4) - =8"="()J - VW0OS:&()J - &==8"&%4 - *"%"G#$*$6)-8)4&:$3$*&*$"#-U&(-U)33-&%%)=*)J-:P-*4)-<&/"8$*P-"E-*4)-(*8"9)-=&*$)#*(-&#J-=8"E)(($"#&3(1-@4)E))J:&%9(-":*&$#)J-E8"<-:"*4-%3&(()(-"E-C()8(-&8)-:)$#G-%C88)#*3P-$#%"8="8&*)J-$#-*4)-J)($G#-"E-*4) $#*)8E&%)-:)*U))#-=&*$)#*(-&#J-*4)8&=$(*(,-&(-<"8) -$#*C$*$6)-$%"#(-=8"6$J)-&-J$8)%* -&%%)(( -*"-:&($%"=)8&*$"#( - (C%4- &( - ()3)%*$#G-&#J -8C##$#G- *4)- J)($8)J- *8&$#$#G-<"JC3),- &C*"<&*$%- J&*&- (&6$#G, -*4) ="(($:$3$*P-E"8-*4)-*4)8&=$(*-*"-%"#*8"3-*4)-()(($"#-XU4)#)6)8-$(-#))J)JY-&(-(""#-&(-*4)-=&*$)#*(-(U$*%4
"#-*4)-(P(*)<1-@4)-3)6)3-"E-C(&:$3$*P-&#J-&%%)=*&:$3$*P-8)6)&3)J-:P-*4)-(C86)P-$(-*4)-:&($(-E"8-*4) (P(*)<-&==3$%&*$"#-$#-%3$#$%&3-8"C*$#),-=8"6$J)J-*4)-*8&$#$#G-=8"6)(-)EE)%*$6)-*"-)#4&#%)-%"G#$*$"#1-@4)&==8"&%4-$(-&3("-*$<)3P-&(-8)<"*)-=&*$)#*e(-(C=)86$($"#-U$33-:)%"<)-<"8)-8)3)6&#*-$#-*4)-EC*C8),-JC)-*"$#%8)&($#G-="=C3&*$"#-<)&#-&G)-X$1)1-)3J)83PY-&#J-*4)-3$E)-)R=)%*&#%P-"E-*4"()-U4"-4&6)-*"-3$6)-U$*4*4)-:C8J)#-"E-%48"#$%-J$()&()1-!%=$>?@ABCDA$EF
@4$(-U"89-U&(-(C=="8*)J-:P-*4)-7C8"=)&#-O0@-f8"G8&<<)-f8"/)%*-Mf[S5a++S[-X0dW@'D.@Y1@4$(-=&=)8-"#3P-8)E3)%*(-*4)-&C*4"8(e-6$)U(-&#J-EC#J$#G-&G)#%$)(-&8)-#"*-3$&:3)-E"8-&#P-C()-*4&*-<&P-:)<&J)-"E-*4)-$#E"8<&*$"#-%"#*&$#)J-4)8)$#1-
')E)8)#%)(
0&8"JSD8*&J,-M?-&#J-7G$J",-?D1-X5aacY1-ZC&3$*P-"E-3$E)-&E*)8-(*8"9)h-*4)-$<="8*&#%)-"E-&-G""J8)%"6)8P1-)*+*$+,-#"./0%"1/234567289:259;/<,%7/58;55=>?8882889@5;
0$%)8"#),-2L,-B&#G)#:&4#,-L!,-V8&J)#,-0,-!&3)%,-?M,-2&3<&8,-2,-M8&&(,-!,-M)3$%)**$,-@,-B&&*(%4,
B,-b&83)P,-?f,-V)8G]C$(*,-@,-DNC3&P,-?,-0&#*"8,-?-&#J-D(4<&#,-@1-X5a++Y1-76$J)#%)S:&()J-%"G#$*$6)8)4&:$3$*&*$"#h-C=J&*)J-8)6$)U-"E-*4)-3$*)8&*C8)-E8"<-5aa;-*48"CG4-5aa^1-A+.B/CB("/D*</E*B#$%&%1/>2 /
4967/=5>:=F8;
b&8$#G,-bf-X5aa5Y1-0"G#$*$6)-$<=&$8<)#*-&E*)8-(*8"9)1-)!++/GH%I/J*!+,&1/5=7/3>:K9;/!&GC$8),-!0X+cc^Y1-Q()8S%)#*)8)J-')]C$8)<)#*(-b&#J:""91-b1'1-O#(*$*C*),-L<%',+;/ELMCL)N/),I",+'%!O;
Article ID 082-4
DOI:10.3217/978-3-85125-378-8-83
While being very promising, brain-computer interfaces (BCI) remain barely used outside laboratories because they are not reliable enough. It has been suggested that current
training approaches may be partly responsible for the poor reliability of BCIs as they do
not satisfy recommendations from psychology and are thus inadequate [3]. To determine to
which extent such BCI training approaches (i.e., feedback and training tasks) are suitable
to learn a skill, we used them in another context (without a BCI) to train 20 users to
perform simple motor tasks. While such approaches enabled learning for most subjects,
results also showed that 15% of them were unable to learn these simple motor tasks, which
is close to the BCI illiteracy rate [1]. This further suggests that current BCI training
approaches may be an important factor of illiteracy, thus deserving more attention.
Introduction
Brain-computer interfaces (BCIs) are communication systems allowing users to interact with
the environment, using only their brain activity [6]. BCIs, although very promising, remain
barely used outside laboratories because they are not reliable enough [6]. Two main reasons
have been identified. The first one, extensively investigated, concerns brain signal processing,
with current classification algorithms being still imperfect [1]. The second one concerns the
users themselves. Indeed, many users seem unable to acquire good BCI skills (i.e. the capacity
to generate specific and stable brain activity patterns): around 20% cannot control a BCI
at all (the so-called BCI illiteracy), while most of the remaining 80% have relatively modest
performances [1]. An appropriate training is needed to acquire these skills, especially for Mental
Imagery-based BCI (MI-BCI). It has been suggested that currently used training and feedback
protocols, which do not take into account recommendations from psychology to optimise human
learning, might be partly responsible for BCI illiteracy and poor user performance [3]. For
instance, it has been shown that, for efficient learning, training protocols have to fit the user
learning style and propose an increasing and adaptive difficulty [3]. Yet standard BCI training
protocols are the same for all users [3]. While instructive, these studies only provide theoretical
considerations about training approaches. It is therefore necessary to concretely assess whether
training approaches used in BCI are appropriate to train a skill. Moreover, it is necessary to
perform this evaluation independently of BCI, to rule out possible biases due to BCI complexity,
non-stationarity and poor signal-to-noise ratio. Thus in this work, we propose to study these
BCI training approaches without using a BCI: participants were asked to learn specific and
simple motor tasks using the same feedback and training tasks used for MI-BCI. We then studied
how well they could learn such motor tasks to assess the quality of the training approaches,
independently of BCI use. We studied here two dierent approaches: 1) the training approach
Article ID 083-1
DOI:10.3217/978-3-85125-378-8-83
used in standard MI-BCI [5] and 2) a variant of it which provides some autonomy to the
user. Indeed, with the standard approach, no autonomy is given to the user, who always has
to perform the tasks required by the protocol. Yet, autonomy is known to increase motivation
and learning efficiency in general [3]. Interestingly enough, the study described in [4] obtained
promising results when providing more autonomy to a single BCI user.
Methods
Participants were asked to learn to perform two motor tasks: drawing triangles and circles with
a pen on a graphic tablet (see Figure 1(b)), using standard MI-BCI training approaches [5].
Indeed, as with MI-BCI, in which users have to learn a suitable movement imagination strategy,
the participants here had to learn the strategy which allows the computer to correctly recognise
their drawing, e.g., they had to identify the suitable shape size, angles or speed of drawing.
Figure 1: (a) Outline of a trial from a standard run; (b) Experimental setup.
2.1
Experimental protocol
Participants had to learn to draw circles and triangles that can be recognised by the computer
during dierent runs, which were either standard (s) or self-paced (sp). S-runs were composed
of 20 trials per task. As shown in Figure 1(a)), at the beginning of each trial a green cross
was displayed. After 2s, an auditory cue (a beep) announced the beginning of the task. Then,
after 3s, a red arrow was displayed, indicating which task the participant had to perform:
continuously drawing circles or triangles upon appearance of a left or right arrow, respectively.
After 4.25s, a blue feedback bar appeared and was updated continuously for 4s. Its direction
indicated the shape recognized by the classifier (left: circle, right: triangle) and its length was
proportional to the classifier output (i.e., the distance to the classifier separating hyperplane),
as with MI-BCI. During sp-runs, no instructions were given: the participants were asked to do
the motor tasks in an autonomous and free way. Half of the participants were asked to learn
using a Standard (S) training approach: they did 4 seven-minute-long s-runs. The other half
learned using a training approach with increased autonomy, denoted Partially Self-Paced (PSP)
approach: the 1st and 4th runs were s-runs, while the 2nd run was replaced by a 3.5 minute
long sp-run followed by a shortened s-run (10 trials per task, 3.5 minutes), and the 3rd run
was replaced by a shortened s-run followed by a 3.5 minute long sp-run. The training duration
was the same in both conditions. We studied the impact of the condition, S vs. PSP, on the
recognition accuracy of triangles and circles over runs (i.e., learning eects) and on subjective
experience (using a questionnaire). 20 participants (10 per group) took part in our experiment.
Article ID 083-2
2.2
DOI:10.3217/978-3-85125-378-8-83
Signal Processing
In order to discriminate triangle from circle pen gestures, we used a pattern recognition approach
as in BCI. From the past 1s-long time window (in a sliding window scheme, 937.5ms overlap)
of the 2D pen position (16Hz sampling rate), a histogram of angles was computed. More
precisely, the angles between each consecutive segment of the time window were first computed.
Then the number of angles falling in the ranges 0-30o , 30-75o , 75-105o , 105-150o and 150-180o
were counted, and these 5 count values were used as input features for a Linear Discriminant
Analysis (LDA) classifier. The (subject-independent) LDA classifier was trained on 60 trials
from each gesture, from 2 persons (1 left-handed, 1 right-handed). The resulting classifier could
discriminate triangles from circles with 73.8% classification accuracy (10-fold cross-validation
on the training set), which is an accuracy equivalent to the average accuracy of a MI-BCI [2].
Classification accuracy was measured as the average number of 1s-long time windows correctly
classified during the feedback period from each trial (see Figure 1(a)).
2.3
Analyses
In order to analyse the interaction between the Condition (2 modalities: S and PSP; independent measures) and the performance obtained at each Run (4 modalities: run1, run2,
run3 and run4; repeated measures), we performed a 2-way ANOVA. Morevoer, we asked the
participants to complete a Usability Questionnaire (UQ) which measured 4 dimensions: learnability/memorability (LM), efficiency/eectiveness (EE), safety (Saf.) and satisfaction (Sat.).
Thus, we did a two-way ANOVA to analyse the interaction between the Condition and the
Evaluated Dimension (4 modalities: LM, EE, saf. and sat.; repeated measures).
(c) Questionnaire
Figure 2: (a) (a) Performance (pen gesture classification accuracy) over runs in the S Condition;
(b) Performance over runs in the PSP Condition; (c) Average UQ scores.
Irrespectively of the condition, classification accuracy results (see Figures 2(a) and 2(b))
showed that 17 out of 20 participants managed to learn the task (best run accuracy 82.6%),
with accuracies increasing with the number of runs on average, as can be observed in BCI.
However, 3 participants did not manage to learn the tasks (best run accuracy < 70%). This is
particularly interesting considering the simplicity of the motor tasks which ensured the users
could technically perform them. Such training approaches thus seem suboptimal. Moreover,
this rate of 15% of people who did not manage to learn is close to the BCI-illiteracy rate
(20% [1]). Overall, this suggests that BCI illiteracy may not be due to the user only, but
also substantially to the training protocol. Then, results showed neither a main eect of the
Article ID 083-3
DOI:10.3217/978-3-85125-378-8-83
Conclusion
This study aimed to concretely assess how well one could learn a given skill with BCI training
approaches. To do so, we proposed to study BCI training approaches without using BCI,
i.e., we used feedback and training tasks from MI-BCI to train participants to draw triangles
and circles (i.e., simple motor tasks) so these can be recognized by the computer. Half of the
participants did so using a S training approach while the other half used a PSP one. In terms
of learning eects, results unfortunately showed no relevant dierences between conditions (S
vs. PSP), due to initial performances that diered between conditions, by chance. However,
irrespectively of the condition, 15% of the participants (3 out of 20) seemed unable to learn
the motor tasks, despite their simplicity. This suggests that such training approaches are not
optimal for learning and thus that they may be an important factor of BCI illiteracy. Concerning
user experience, UQ showed a tendancy towards a better feeling of learnability/memorability,
efficiency/eectiveness and safety with the PSP than with the S approach, while the satisfaction
appeared similar for both. Overall, this study confirmed in practice the theoretical analyses
from [3] suggesting that current BCI training approaches were suboptimal and need to be
changed. In the future, we will increase the number of participants, explore the PSP approach
with actual BCIs, and propose new training approaches that consider the users cognitive style
and motivational states to improve both the learning experience and performance.
References
[1] B.Z. Allison and C. Neuper. Could anyone use a BCI? In Desney S. Tan and Anton Nijholt, editors,
Brain-Computer Interfaces, pages 3554. Springer London, 2010.
[2] B Blankertz, C Sannelli, S Halder, EM Hammer, A K
ubler, K-R M
uller, G Curio, and T Dickhaus.
Neurophysiological predictor of SMR-based BCI performance. NeuroImage, 51(4):13031309, 2010.
[3] F. Lotte, F. Larrue, and C. M
uhl. Flaws in current human training protocols for spontaneous BCI:
lessons learned from instructional design. Frontiers in Human Neurosciences, 7:568, 2013.
[4] C. Neuper, G. M
uller, A. K
ubler, N. Birbaumer, and G. Pfurtscheller. Clinical application of an
EEG-based BCI. Clinical Neurophysiology, 114:399409, 2003.
[5] G. Pfurtscheller and C. Neuper. Motor imagery and direct brain-computer communication. proceedings of the IEEE, 89(7):11231134, 2001.
[6] J. R. Wolpaw and E. W. Wolpaw. BCI: Principles and Practice. Oxford University Press, 2012.
Article ID 083-4
DOI:10.3217/978-3-85125-378-8-84
Introduction
Quantifying interactions in dynamic large-scale brain networks is an important and useful tool
in neuroscience. The source connectivity toolbox (SCoT1 ) [3] is a Python package for estimating
spectral eective connectivity between brain sources. SCoT extracts connectivity measures from
vector autoregressive (VAR) models fitted to source signals. Typically, the sources are obtained
by performing MVARICA [5] or CSPVARICA [3], which are based on independent component
analysis (ICA) decomposition of VAR residuals.
Several brain-computer interface (BCI)-related studies have included connectivity features
for classification [4, 2]. Although the tools in SCoT were originally designed for single-trial BCI
feature extraction, they also support multi-trial data and are useful for functional and eective
connectivity analysis of electroencephalogram (EEG) signals.
In this article, we demonstrate several ways to use SCoT on a motor imagery (MI) data set.
Figure 1: Multi-trial joint source/connectivity estimation. Left: hand MI, right: foot MI. Each
plot shows the PDC spectrum of a source pair. The area shaded in gray is the one-sided 95 %
confidence interval of the PDC under the null-hypothesis of no connectivity.
1 https://github.com/SCoT-dev/SCoT
Article ID 084-1
DOI:10.3217/978-3-85125-378-8-84
SCoT Workflow SCoT provides routines for EEG source and connectivity estimation. Two
estimation approaches are possible in SCoT. One approach is joint estimation, where sources
and connectivity are estimated together. This is performed by applying MVARICA or CSPVARICA to data where sources can be assumed to be spatially and temporally stationary.
Alternatively, a two-step approach estimates sources and connectivity separately, possibly on
dierent data sets. MVARICA or CSPVARICA can be employed in the source decomposition
step by discarding their VAR estimates. In the second step, the unmixing matrix is used to
obtain source activations. Connectivity measures are estimated from VAR models fitted to
these source activations.
Example Data Set An example data set is available with SCoT. This data set contains a
recording of 45 EEG channels from one healthy subject performing hand and foot MI.A total of
180 trials (90 trials per MI task) were recorded. In each trial the subject was cued to perform
either MI task by an arrow pointing up (hand) or down (foot). The beginning of a trial was
indicated by a fixation symbol appearing on the screen, and the motor imagery period started
with the cue after 2.5 s. The motor imagery period was 4.5 s long and was followed by a 2.53.5 s
break.
Usage Examples In this article, we demonstrate how to use SCoT to estimate and visualize
connectivity on the example data set. We show how to perform joint estimation, multi-trial
two-stage estimation, single-trial connectivity, and circular plots. In each case we measure
connectivity with the PDC [1] in a 1 s window starting 2 s after the cue. For clarity of demonstration, we only use four sources in the first three examples and seven sources for the circular
plots. We manually removed sources that were clearly related to artifacts (such as eye movement
or neck muscle activity).
Figure 2: Multi-Trial Two-Stage Estimation. Left: MVARICA, right: CSPVARICA. Each plot
shows PDC spectra of a source pair for hand MI (blue) and foot MI (green). The shaded areas
correspond to the 95 % confidence intervals obtained by bootstrapping.
Article ID 084-2
DOI:10.3217/978-3-85125-378-8-84
located along the diagonal. A high connectivity value in a plot generally means that there
is causal interaction from the source in the same column to the source in the same row, at a
certain frequency.
Results
Figure 3: Single-trial estimation. Left: least squares fitting (LSQ), right: ridge regression
(RCV). Lines correspond to individual trials, and the colors indicate the dierent classes (red:
hand, blue: foot).
Article ID 084-3
DOI:10.3217/978-3-85125-378-8-84
Figure 4: Circular plots. Left: hand MI, right: foot MI. These plots show the relative strength
of connectivity (PDC in the alpha (812 Hz, red) and beta (1624 Hz blue) bands. The width
and intensity of the arrows indicates the strength of the connection.
References
[1] L. A. Baccal
a and K. Sameshima. Partial directed coherence: a new concept in neural structure
determination. Biological Cybernetics, 84:463474, 2001.
[2] Martin Billinger, Clemens Brunner, and Gernot R M
uller-Putz. Single-trial connectivity estimation
for classification of motor imagery data. Journal of Neural Engineering, 10:046006, 2013.
[3] Martin Billinger, Clemens Brunner, and Gernot R M
uller-Putz. SCoT: A python toolbox for EEG
source connectivity. Front. Neuroinform., 8, 2014.
[4] C. Brunner, R. Scherer, B. Graimann, G. Supp, and G. Pfurtscheller. Online control of a braincomputer interface using phase synchronization. IEEE Transactions on Biomedical Engineering,
53:25012506, 2006.
[5] German G
omez-Herrero, Mercedes Atienza, Karen Egiazarian, and Jose L. Cantero. Measuring
directional coupling between EEG sources. NeuroImage, 43:497508, 2008.
Article ID 084-4
DOI:10.3217/978-3-85125-378-8-85
Introduction
Extensive training in gait rehabilitation after stroke may be provided by using a robotic gait
orthosis. Robotic rehabilitation requires little eort from the individual and can lead patients
to move passively. However active contribution to a movement, has been shown to be crucial
for motor learning [3].
Neural correlates of active participation during gait training have previously been shown by
our group. We showed significant dierences between active and passive walking in EEG and
sensorimotor rhythms over the foot area of the sensory cortex [6]. The Authors showed that
it is possible to distinguish between active and passive walking in single trials (mean accuracy:
68%) in 6 subjects using EEG band power features [5].
In the current manuscript, we show single trial connectivity analysis from the EEG recorded
during active and passive walking in a gait robot. The goal of this analysis is twofold: First, we
want to evaluate whether single trial connectivity measures allow to distinguish between active
and passive walking. Second, we want to investigate the cortical networks related to active
participation in gait training. This may help to better understand the underlying mechanisms
and the optimal activation patterns for gait recovery.
Article ID 085-1
DOI:10.3217/978-3-85125-378-8-85
Participant
1
2
3
4
5
6
mean
Figure 1: Experimental setup. Walking in
the robotic gait orthosis. Speed (1.8-2.2 km/h)
and body weight support ( 30 %) were adjusted for each participant.
mean accuracy
76.4
83.7
75.7
82.3
75.6
92.8
81.0
std
6.0
7.9
4.7
4.5
5.9
3.8
6.7
Methods
We recorded the electroencephalogram (EEG) from 120 sites from 6 healthy volunteers (24
2 years, 5 male) during active and passive walking (4 runs of 6 min each) with a robotic gait
orthosis (Lokomat, Hocoma). In the active walking condition, participants were instructed to
walk independently in the gait robot at the speed of the treadmill supporting their own weight.
Passive walking demanded participants to let their legs be moved by the robot. Foot contact
was measured by electrical foot switches placed on the heels of both feet. Figure 1 summarizes
the experiments. For a more detailed description of the experiment see [6]
2.1
EEG Analysis
Preprocessing
Due to the high artifact contamination of the EEG during walking the data was pruned using Independent Component Analysis (ICA) prior to single trial analysis. Preprocessing for
ICA included filtering from 1 to 200 Hz, resampling at 500 Hz, and manual rejection of nonstereotyped artifacts. Typical stereotyped artifacts (eye movements, muscle tension) were kept
in the analysis as they are separated by ICA into only a few independent components (ICs). Infomax ICA [4] decomposed the EEG into ICs, representing brain, muscle, and artifact sources.
The ICs were then categorized into cortical sources and artefact components considering scalp
map, power spectrum, and event-locked time course. This procedure left on average 14 cortical
sources per participant (range: 6-18 ICs). These cortical sources were then backprojected to
the EEG data. To minimize computational eort, only 62 channels equally distributed over
the scalp were selected for further analysis. PCA (Principal Component Analysis) was applied
to the remaining 62 channels to reduce the dimensions of the remaining EEG channels to the
number of backprojected ICs. The data was segmented from -0.5 to 2.5 seconds around the
right heelstrike (time between contralateral steps 1 s). Around 250 trials were used for each
class.
Article ID 085-2
DOI:10.3217/978-3-85125-378-8-85
CSPVARICA
30
15
0
30
15
15
0
30
15
0
30
0
30
15
0
0 15 30 0 15 30 0 15 30 0 15 30
fre q u e n c y (Hz)
15
0
0 .8
0 .4
0 .0
m a g n it u d e
m a g n it u d e
0 15 30 0 15 30 0 15 30 0 15 30
0 .8
0 .4
0 .0
30
CSPVARICA
0 15 30 0 15 30 0 15 30 0 15 30
0 .8
0 .4
0 .0
30
15
0
30
15
0
30
15
0
0 15 30 0 15 30 0 15 30 0 15 30
fre q u e n c y (Hz)
30
15
0
30
15
0
30
15
0
0 .8
0 .4
0 .0
Figure 2: Directed connectivity with CSPVARICA for participant 1 (left) and 6 (right). The
DTF between four components is plotted for active (blue) and passive (green) walking. The
x-axis corresponds to the frequencies from 0 to 40 Hz, the y-axes to the magnitude of the DTF
(in arbitrary units). Columns represent sources and rows sinks. The power spectral density of
each source is plotted along the diagonal.
Single trial connectivity analysis
For single trial connectivity analysis, the Python-based source connectivity toolbox SCoT [1]
was used. Single trial analysis is performed in two steps. In the first step independent components are estimated with CSPVARICA [1]. The method transforms the EEG with Common
Spatial Patterns (CSP) to find components that maximize the variance between conditions.
Then a VAR model is fitted to the CSP components, and the residuals of the VAR model are
decomposed by ICA to estimate the final unmixing matrix U. In the second step single trial
component activations are obtained by multiplying the EEG with U. Then an autoregressive
model is fitted for these component activations. The DTF (full frequency directed transfer
function) [2] was calculated from the model to measure connectivity. This method estimates
the direction of causal influences between sources. The DTF was averaged in the frequency
bands (7-12Hz), (15-21Hz) and a subject specific frequency band ranging from 24 to 40 Hz.
We have previously shown in [6] that these frequency bands account for dierences between
active and passive walking. The number of components was four and model order was set to
100. This resulted in 48 features for classification. Classification was performed with linear discriminant analysis. The whole procedure was applied in a 10 fold cross-validation to estimate
the performance. For a more detailed description of the method see [1].
Results
Connectivity measures allowed dierentiation between active and passive movements with an
average classification accuracy over subjects of 81.0% 6.7. Classification accuracies for single
subjects are displayed in Table 1. Visual inspection revealed that connectivity magnitude in
classification relevant frequency bands and was higher during passive compared to active
walking. Most participants had at least one contributing source in the central midline area and
in posterior areas. For the results of two exemplary subjects see Figure 2.
Article ID 085-3
DOI:10.3217/978-3-85125-378-8-85
Discussion
Our results show that it is possible to separate active and passive walking in single trial EEG
data based on connectivity measures. Classification accuracies were above chance in all subjects,
and were on average 10% higher compared to our previous results using band power features
[5]. However, it still has to be evaluated whether this improvement is due to the method used,
the features or to the pruning of the EEG with ICA prior to feature selection. Furthermore, a
major challenge for the online application of these methods during gait rehabilitation remains
automatic artifact correction.
From a neurophysiological perspective, the analysis revealed that connectivity magnitude
in and bands between sensorimotor sources is higher during passive compared to active
walking. The magnitude of the DTF depends on spectral power, thus the results fit with
our previous findings showing a suppression of and bands related to active participation.
Suppression of and rhythms has previously been related to the activation of sensorimotor
areas. Thus, our results suggest that active walking increases the activation of sensorimotor
regions and reduces information flow between these areas. Further analysis should evaluate
in more detail the sources contributing to dierences in connectivity over subjects. This may
reveal neural networks underlying active participation in gait training.
Acknowledgements
This work was partly supported by the European Union research project BETTER (ICT2009.7.2-247935), the Land Steiermark project BCI4REHAB and the FWF project (P-20848).
The authors thank G. Schahauser and P. Holper for assistance during the experiments.
References
[1] M. Billinger, C. Brunner, and G. R. M
uller-Putz. SCoT: a Python toolbox for EEG source
connectivity. Frontiers in neuroinformatics, 8, 2014.
[2] A. Korzeniewska, M. Manczak, M. Kaminski, K. J. Blinowska, and S. Kasicki. Determination of information flow direction among brain structures by a modified directed transfer
function (dDTF) method. Journal of Neuroscience Methods, 125:195207, 2003.
[3] M. Lotze, C. Braun, N. Birbaumer, S. Anders, and L. G. Cohen. Motor learning elicited by
voluntary drive. Brain, 126:866872, 2003.
[4] S. Makeig, A. J. Bell, T. P. Jung, and T. J. Sejnowski. Adv Neural Inf Process Syst, chapter
Independent Component Analysis of Electroencephalographic Data, pages 145151. MIT
Press Cambridge, MA, 1996.
[5] T. Solis-Escalante, J. Wagner, R. Scherer, P. Grieshofer, and G. M
uller-Putz. Assessing participation during robotic assisted gait training based on EEG: feasibility study. Proceedings
of the 3rd TOBI Workshop, pages 6162, 2012.
[6] J. Wagner, T. Solis-Escalante, P. Grieshofer, C. Neuper, G. M
uller-Putz, and R. Scherer.
Level of participation in robotic-assisted treadmill walking modulates midline sensorimotor
EEG rhythms in able-bodied subjects. NeuroImage, 63:12031211, 2012.
Article ID 085-4
DOI:10.3217/978-3-85125-378-8-86
K. Tsuru 1, M. Nagaki 2*
1 Introduction
We have studied biometrics on brain waves to develop a diverse and secure authentication system.
Brain wave biometrics has two advantages over prevailing biometrics. One is the difficulty of
eavesdropping on personal brain wave data. The second advantage is that brain waves can reflect
individual mental activities. This property leads to many possibilities for diverse uses of biometrics.
Traditional biometric methods used single fixed templates. In contrast, brain wave biometrics could
identify people based on templates that reflect different brain activities, such as cognitive processes.
The biometric using the brain wave approach is used to assess waves. Poulos et al. [1] first tried
to identify individuals based on the EEG. They analyzed waves of four subjects EEGs, using a
neural network classification method. Paranjape [2] also used brain wave data based on waves
during eyes open/closed for biometric analysis. These data based on waves reported consistent
classification results, and this work required only a few electrodes. However, it was necessary for
subjects to sit quietly for a relatively long period. A newer method utilized an event-related potential
from a cognitive human brain process. Palaniappan [3] investigated the wave band of the visualevoked potential elicited during a mental task for personal identification, and Mercel [4] studied
personal authentication based on motor images of left or right hand movement and word generation.
In addition we previously studied the approach to the discrete cosine transform (DCT) of motor
imageries for features, extracting the best individual features [5]. However, all these studies measured
brain signals using the wired electrode system. A major drawback is that preparation for using this
system takes time and care, and it is also difficult to realize a brain wave biometric system.
In this paper, we investigate brain wave biometrics using a wireless measurement device on brain
signals. This device has not previously been explored for this task. We attempted to classify five
healthy subjects based on four situations.
*
Article ID 086-1
DOI:10.3217/978-3-85125-378-8-86
2 Methods
Five healthy subjects (4 males and 1 female, age: 20 0.32) provided their informed consent to
participate in the experiment on several different days.
2.2 Measurement
We employed an Emotiv EEG neuroheadset [6] to measure brain waves. The Emotiv has 14 saline
electrodes with 2 reference electrodes to wirelessly transmit brain wave signals to a computer. The
sampling frequency of the brain wave signals was 128 Hz. The brain data was removed as artifacts
using digital notch filters (50 Hz and 60 Hz) and a low-pass filter (~43Hz). To avoid the effect of
impedance difference, we prepared the low impedance (~ 2 k) of each electrode with an impedancemeasuring program. We used C++ program using Emotiv API for storing the brain wave data in a
computer. The brain wave data was divided into 8-second lengths, and the average was subtracted.
The number of each subjects brain wave data was between 208 and 320 in every case. So we put the
data from each case together for all subjects.
2.4 Classification
We selected three classifiers: linear discriminant analysis (LDA), support vector machine (SVM), and
neural network (NN). The probability of personal identification was called an authentication rate. We
estimated the rate using a 10-fold cross-validation method.
Article ID 086-2
DOI:10.3217/978-3-85125-378-8-86
3 Results
Table 1 shows the results of the average authentication rate that evaluated four features of the DCT
data per electrode at 040 Hz. Also the authentication rate was calculated by several classification
methods. The authentication rates are the average for the five subjects. As NN classification is
dependent upon initial values, their authentication rate is calculated based on an average of 10 trials.
Taking into account the characteristics of the small subject group, all authentication rates showed
consistent results. Those results of the authentication rates were not different among the four cases, as
we anticipated.
LDA
linear quadratic
Mahalanobis
I
0.95
0.96
0.96
II
0.99
0.99
0.98
III
0.98
0.98
0.96
IV
0.98
0.98
0.97
-: no convergence
Case
rbf
0.81
0.80
0.81
0.81
SVM
polynomial quadratic
0.98
0.99
0.99
0.99
0.96
0.99
0.97
0.98
NN
linear
0.99
0.99
0.97
0.92
0.94
0.83
0.93
Next, we estimated the average authentication rates for each electrode by LDA classifier for
finding the area of the head that distinguished our subjects. Table 2 shows average authentication rates.
Their rates ranged from 0.47 to 0.76; the average authentication rates of all electrodes in the cases
were 0.58, 0.59, 0.60, and 0.60 respectively. The rate of the electrodes used on the occipital region
gave consistent results on the whole. Meanwhile, the rate of the electrodes on the temporal cortex
regions depends on the activity.
Case
AF3
F7
F3
FC5
T7
P7
O1
O2
P8
T8
FC6
F4
F8
AF4
0.56
0.58
0.50
0.55
0.51
0.56
0.67
0.70
0.61
0.55
0.55
0.58
0.58
0.60
0.53
0.58
0.55
0.56
0.52
0.62
0.69
0.71
0.65
0.57
0.56
0.61
0.58
0.50
0.53
0.64
0.55
0.55
0.64
0.62
0.65
0.62
0.59
0.60
0.64
0.54
0.59
0.60
0.47
0.58
0.52
0.53
0.57
0.64
0.76
0.68
0.65
0.64
0.69
0.48
0.61
0.59
Table 2: The average authentication rates for each electrode by LDA classifier. The light gray areas
are over 0.6 and the dark gray areas are over 0.7
Article ID 086-3
DOI:10.3217/978-3-85125-378-8-86
Furthermore, setting them requires help from a few other people. Consequently it is a more
challenging process when compared to the existing authentication methods, such as fingerprint, iris,
and facial recognition. By contrast, the preparation for using the wireless EEG neuroheadset took little
time, and an examinee can prepare for authentication by him/herself. However, we must develop
ones positioning method for the location compensation in the next step. Hence, this represents a
potential first step towards the practical application of a biometrics system with brain waves.
The authentication rates by DCT feature extraction showed consistent results by all classification
methods though five subjects. A previous study showed the authentication rate to be 0.79 for 23
subjects using the wired measurement system [7]. The wireless device indicated the performance of
the wired measurement system.
The authentication rate on the occipital electrode was better than that on the temporal electrode.
These results were the same for every situation. Under the centered state, such as solving a puzzle or
reading a book, the authentication rate on the temporal electrode gave better results. Those results
show that brain wave biometrics with several selected electrodes provides a consistent authentication
rate.
We investigated the brain wave biometric approach using a wireless brain wave device. The
results indicate that the authentication rate of brain wave biometrics was over 0.9. In addition,
biometrics using a wireless brain wave device is suitable for a personal authentication system because
of its high level of accuracy and short setup time. Moreover, future research should increase the
number of subjects in authentication experiments to improve reliability.
Acknowledgements
This study was supported in part by a Grant-in-Aid for Scientific Research (C) 25330163 from the
Japan Society for the Promotion of Science (JSPA).
References
[1]
[2]
[3]
[4]
[5]
[6]
[7]
Article ID 086-4
DOI:10.3217/978-3-85125-378-8-87
Abstract
To examine the presences of electrophysiological indicators, by measuring eventrelated potentials (ERPs), is one solution to detect residual cognitive functions in
patients with disorders of consciousness (DOC). To investigate such a patient, different
kinds of stimuli like tones, words, and sentences were used with and without instruction
and were hierarchically ordered according to the needed processing steps. In addition,
the patient performed a semantic computer-brain-interface (BCI) while
electroencephalography (EEG) and near-infrared spectroscopy (NIRs) signals were
recorded. We found evidence that the DOC patient was able to follow active
instructions and to shift his attention, but had only limited control of the BCI
performance.
1 Introduction
One way to detect the residual cognitive functions in brain-damaged patients, who has lost the
ability for behavioural performance, is to measure their brain responses, for example using eventrelated brain potentials (ERPs). This approach has been successfully reported in a number of studies
(e.g., Cruse, Chennu, Chattele, C, Bekinschtein, Fernndez-Espejo, Pickard et al., 2011; Daltrozzo,
Wioland, Mutschler, Lutun, Calon, Meyer et al., 2009, Kotchoubey, Lang, Mezger, Schmalohr,
Schneck, Semmler et al., 2005; Schoenle & Witzke, 2004) and indicates that the brain of such patients
might be able to process information at various levels of complexity including semantic information
as well as understanding active instructions like silent counting or even attentional shifts (e.g., Boly,
Garrido, Gosseries, Bruno, Boveroux, Schnakers et al. 2011; Monti ; Vanhaudenhuys, Coleman, Bol,
Pickard, Tshibanda, et al., 2010).
In addition, brain responses can be used to set-up a communication channel by a so called Brain
Computer Interface (BCI) for patients who are severe paralysed and cannot communicate by any other
means. The brain responses of those patients can be controlled by using electroencephalography
(EEG; e.g., Birbaumer, Ghanayim, Hinterberger, Iversen, Kotchoubey, Kbler, et al., 1999) or
functional-near infrared spectroscopy (fNIRs; e.g., Gallegos-Ayala, Furdea, Takano, Ruf, Flor,
Birbaumer, in press). Here, we wanted to investigate whether a good outcome in the cognitive
assessment using ERPs leads to good BCI performance.
Masterminded EasyChair and created the first stable version of this document
Article ID 087-1
DOI:10.3217/978-3-85125-378-8-87
2 Methods
A 60 year old, male patient, diagnosed was minimally conscious state (MCS: Giacino, Ashwal,
Childs, Cranford, Jennett, Katz, et al., 2002) and having a CRS-R score of 23 participated in four
sessions. In the first and second sessions he was assessed by using several passive and active ERP
assessment paradigms. In the other two sessions he performed a semantic BCI with two answer
categories.
Article ID 087-2
DOI:10.3217/978-3-85125-378-8-87
EEG data were preprocessed similarly to the method used above and then t-CWT analysis for feature
classification (Bostanov et al. 2004). For fNIRs data we used the mean amplitude for each channels of
the oxyhemoglobin. For both ERP and fNIRs data, we used a support vector machine to classify the
correct and incorrect answers using a grid searching to find the best parameter und using a five-fold
cross-validation. We trained the model using the first 3 blocks and tested the last block. In addition,
we calculated the coincidences of the correct detected classification results of EEG and fNIRs.
3 Results
3.1 ERP Assessment
In Table 1 we show the summarized results of the all expected ERP components. In the (1)
multifeature paradigm 3 deviant elicited the expected ERP component, the mismatch negativity
(MMN). In the frequency oddball (2) we also find the expected P3 component as well in the counting
oddball task (5). However, the P3 component in the active task was not larger than in the active
condition, thus, it is not clear whether the P3 component of the active counting task includes really
active counting contingence. In addition, we found a N400 component in the sentence understanding
paradigm but not in the word-priming paradigm. Further, we obtained a side specific effect in the
dichotic listening task, where the P3 component for the attended side was enhanced as compared with
the unattended side.
(1)
(2)
(3)
(4)
(5)
(6)
mutifeature
frequency
word-prime
sentence
counting
dichotic
d f
i
l
c
left
right
+
+
+(-)
+ + + +
+
Table 1: Results of the ERP assessment paradigms: + is expected ERP component was present; expected ERP component was absent.
Figure 1: (A) Experimental design of the EEG/fNIRs paradigm; (B) ERP/fNIRs classification
results
Article ID 087-3
DOI:10.3217/978-3-85125-378-8-87
References
Birbaumer, N., Ghanayim, N., Hinterberger, T., Iversen, I.,Kotchoubey, B., Kbler, A., Perelmouter, J.,
Taub, E. and Flor, H. (1999). A spelling device for the paralyzed. Nature,398, 297298.
Boly M, Garrido M.I, Gosseries O, Bruno MA, Boveroux P, Schnakers C, et al Laureys S, Friston K.
Preserved feedforward but impaired top-down processes in the vegetative state. Science 2011: Vol. 332
no.6031 pp 858-862.
Bostanov V, Kotchoubey B. The t-CWT: A new ERP detection and quantification method based on the
continuous wavelet transform and Student's t-statistics. Clinical Neurophysiololgy. 2006;117:2627-44.
Cruse, D., Chennu, S., Chatelle, C., Bekinschtein, T. A., Fernndez-Espejo, D., Pickard, J. D., . . .
Owen, Adrian M. (2011). Bedside detection of awareness in the vegetative state: a cohort study. The
Lancet, 378(9809), 2088-2094.
Daltrozzo, J., Wioland, N., Mutschler, V., Lutun, P., Calon, B., Meyer, A., . . . Kotchoubey, B. (2009).
Cortical information processing in coma. Cognitive and behavioral neurology: official journal of the
Society for Behavioral and Cognitive Neurology, 22(1), 53-62..
Gallegos-Ayala G, Furdea A, Takano K., Ruf CA, Flor H, Birbaumer N. Brain communication in a
completely locked-in patient using bedside near-infrared spectroscopy. Neurology (in press).
Giacino, J., Ashwal, S., Childs, N., Cranford, R., Jennett, B., Katz, D. I., . . . Zasler, N. D. (2002). The
minimally conscious state: definition and diagnostic criteria. Neurology, 58(3), 349-353
Monti M.M, Vanhaudenhuyse A, Coleman M.R, Boly M, Pickard J.D, Tshibanda L, Owen A.M,
Laureys S. Willful modulation of brain activity in disorders of consciousness. N Engl J Med 2010;362:57989.
Kotchoubey, B., Lang, S., Mezger, G., Schmalohr, D., Schneck, M., Semmler, A., . . . Birbaumer, N.
(2005). Information processing in severe disorders of consciousness: Vegetative state and minimally
conscious state. Clinical Neurophysiology, 116(10), 2441-2453. doi: DOI 10.1016/j.clinph.2005.03.028
Mller-Putz G, Scherer R, Brunner C, Leeb R, Pfurtscheller G. Better than random: a closer look on
BCI results. International Journal of Bioelectromagnetism. 2008; 10(1): 52-55.
Schoenle, P. W., & Witzke, W. (2004). How vegetative is the vegetative state? Preserved semantic
processing in VS patients - Evidence from N400 event-related potentials. Neurorehabilitation, 19(4), 329334.
Article ID 087-4
DOI:10.3217/978-3-85125-378-8-88
Milenas Functional Brain Mapping and BCI Lab, Florida Hospital for Children, Orlando, FL,
USA; 2MEG Lab, Florida Hospital for Children, Orlando, FL, USA; 3Comprehensive Pediatric
Epilepsy Center, Florida Hospital for Children, Orlando, FL, USA; 4g.tec Guger Technologies OG,
Schiedlberg, Austria
*Corresponding author: milena.korostenskaja@gmail.com
Abstract
Brain-Computer Interfaces (BCIs) are expected to become a game-changing
modality for the communication and control of both devices and virtual environments in
humans whose direct neuromotor response needs to be augmented, bypassed, or
replaced. Recent developments in the area of BCI research suggest that subdural
electrocorticography (ECoG)-based BCIs might be more efficient than scalp
electroencephalography (EEG)-based ones. In this study we compared the performance
of an EEG-based P300 speller BCI (which allows typing without utilizing muscular
activity) to an ECoG-based one. Three different approaches (i-iii) were used to select
the best electrode sites from acquired ECoG signals. We found that the ECoG-based
P300 speller performance accuracy varied widely between the different approaches.
Approach iii, based on the choice of the longest duration of significant difference
between TARGET and NON-TARGET stimuli, provided the most accurate ECoGbased results. Those results were comparable to and even surpassed the EEG-based
results. However, the invasive nature of ECoG-based electrode selection has its own
disadvantages. Therefore, as a future perspective, we propose to explore the potential
contribution of non-invasive methods (for example, magnetoencephalography MEG)
to augment the subdural/depth electrode placement process.
1 Introduction
Brain-Computer Interfaces (BCIs) are powerful tools for enabling communication between people
and the surrounding world using direct brain activity recorded non-invasively from the scalp (EEG),
invasively from the brain surface (ECoG), or from deep within the brain itself using depth electrodes
(Shih & Krusienski, 2012). The need for BCIs is immense as it can benefit a number of clinical
populations, such as people with stroke, locked-in syndrome, amyotrophic lateral sclerosis (ALS), and
other severe disorders involving deterioration or damage of muscular system (Silvoni, et al., 2013)
(Birbaumer, Gallegos-Ayala, Wildgruber, Silvoni, & Soekadar, 2014). The P300 speller is currently
the most popular BCI system enabling communication by spelling words or phrases with direct braincontrolled selection from menus presented on a computer screen (Farwell & Donchin, 1988). People
achieve up to 91% accuracy with a speed of 2-3 characters per minute using a scalp EEG-based P300
speller (Guger, et al., 2009). However, some questions still remain: (1) Are ECoG-based BCIs
superior to the EEG-based ones? and (2) How can the performance of ECoG-based BCIs be further
improved? Recent, though limited, reports (Shih & Krusienski, 2012) suggest that invasive ECoGbased BCIs can be faster and more accurate than non-invasive EEG-based BCIs. Approaches to
Article ID 088-1
DOI:10.3217/978-3-85125-378-8-88
improve the accuracy of ECoG-based P300 speller have been proposed (Speier, Fried, & Pouratian,
2013), however they still need further development. Therefore, in this study we sought (1) to compare
ECoG- and EEG- based BCI performance; and (2) to maximize performance of the ECoG-based P300
speller by applying different approaches to the selection of electrodes chosen for ECoG signal
detection.
2 Methods
One female and 3 male patients (mean age 23 +/-10.6) diagnosed with intractable epilepsy and
undergoing evaluation for resective epilepsy surgery were recruited. Two patients (#1 and #2)
underwent both EEG- and ECoG-based recordings with P300 speller; while two other patients (#3 and
#4) underwent only an ECoG-based P300 speller session. Participants received both character- and
face-based stimuli in the testing sessions.
2.1 Training
Each participant was presented with three 5 character words to train the linear classifier to
distinguish the P300 response. The row/column speller that flashes an entire column or row of
characters was utilized. Flashing rows and columns were each presented 15 times (15x15) and an
individual classifier was calculated before the free spelling experiment began. All active electrodes
(8 total) were used for free spelling based on EEG recordings, whereas the 8 best were chosen for
each subject from all possible ECoG electrodes. The criteria for selecting the 8 best in each
approach are detailed below.
Article ID 088-2
DOI:10.3217/978-3-85125-378-8-88
NON-TARGET trials. The ECoG electrode locations for P300 speller varied between study
participants. The main regions of overlap included frontal and central cortex (patients #1-3). In
addition, patients #3 electrodes were located in occipital cortex.
Figure 1. Best eight ECoG P300 responses using approach iii in subject #4 Average signal amplitude over
time for response to TARGET (blue) and NON-TARGET (green) stimuli.
Article ID 088-3
DOI:10.3217/978-3-85125-378-8-88
Importantly, with approach (iii) for the selection of electrodes, the accuracy of the ECoG-based
P300 speller surpassed results from approaches i-ii, as well as scalp EEG-based P300-speller by
providing with accuracy ranging from 85% to 100% (again, for a session with flashing rows and
columns each presented 15 times).
4 Conclusions
Intracranial grid placement may provide users with unique opportunity to control real and virtual
worlds with high speed and accuracy. ECoG-based P300 speller can provide comparable results to the
EEG-based one in terms of accuracy. In fact, it can surpass EEG-based P300 speller accuracy.
Moreover, this accuracy can be further improved. Indeed, the identification of the best locations of
ECoG electrodes plays very important role in ensuring high accuracy of P300 speller performance.
Our proposed signal processing algorithm based on the duration of the significant difference between
TARGET and NON-TARGET responses may be of high value to achieve this goal. It is important
note, that this approach is based on information from invasively implanted ECoG electrodes. Because
the location of the BCI implantation site may play a critical role in performance, we believe that
future approaches should explore non-invasive procedures, such as magnetoencephalography (MEG),
to enhance ECoG electrode placement. We suggest that one of the optimal ways to achieve maximal
P300 speller performance is: (1) use intracranially/subdurally implanted electrodes; (2) navigate
electrode implantation with non-invasive methods; and (3) utilize quantitative approaches/algorithms
based on available ECoG information for the choice of the best electrodes.
References
Birbaumer, N., Gallegos-Ayala, G., Wildgruber, M., Silvoni, S., & Soekadar, S. R. (2014). Direct
brain control and communication in paralysis. Brain Topogr, 27, 4-11.
Farwell, L. A., & Donchin, E. (1988). Talking off the top of your head: toward a mental prosthesis
utilizing event-related brain potentials. Electroen Clin Neuro, 70, 510-523.
Guger, C., Daban, S., Sellers, E., Holzner, C., Krausz, G., Carabalona, R., . . . Edlinger, G. (2009).
How many people are able to control a P300-based brain-computer interface (BCI)?
Neurosci Lett, 462, 94-98.
Shih, J. J., & Krusienski, D. J. (2012). Signals from intraventricular depth electrodes can control a
brain-computer interface. J Neurosci Meth, 203, 311-314.
Silvoni, S., Cavinato, M., Volpato, C., Ruf, C. A., Birbaumer, N., & Piccione, F. (2013). Amyotrophic
lateral sclerosis progression and stability of brain-computer interface communication.
Amyotroph Lateral Scler Frontotemporal Degener, 14, 390-396.
Speier, W., Fried, I., & Pouratian, N. (2013). Improved P300 speller performance using
electrocorticography, spectral features, and natural language processing. Clin Neurophysiol,
124, 1321-1328.
Article ID 088-4
DOI:10.3217/978-3-85125-378-8-89
Introduction
Despite encouraging improvements over the last decade, the P300 Speller remains primarily a
research device rather than a home-based communication aid. One limitation that inhibits its
widespread use outside the research lab is the complexity of setting up the system. Standard
P300 systems employ gel based electroencephalographic (EEG) electrode caps to record electrical activity along the scalp. These caps require a conductive gel to be applied to each electrode
in order to ensure electrical contact with the users scalp, and the caregiver must check each
electrode to ensure that low impedances levels have been achieved. The process can require
thirty minutes or more for an experienced lab technician. For people with locked-in syndrome
who need to use the system on a daily basis, minimizing this set up time is highly desirable.
In an eort to improve the set-up time of the system, the use of dry electrode caps for P300
Spellers has been proposed. Dry electrode caps are much faster and easier to apply since they
do not require any gel to be applied to the users scalp. However, despite sophisticated preamplification, recording EEG with dry electrodes tends to introduce a greater amount of noise
into the system. The lower signal-to-noise ratio (SNR) can decrease the detectability of the
P300 potential which may negatively impact both the spelling speed and accuracy of the P300
Speller. To compensate for the lower SNR, we propose using a data-driven dynamic stopping
algorithm that relies on a Bayesian update process to determine the amount of data collection
needed based on a probabilistic level of confidence that a character is the target [1].
In this study, we first compare the dierences in performance of gel and dry based electrode
caps in the standard static data collection environment for online testing. We then present the
preliminary data performance results of each cap using a dynamic stopping algorithm.
Article ID 089-1
DOI:10.3217/978-3-85125-378-8-89
Methods
Data collection for this study took part in two separate experiments. EEG responses using the
standard static stopping criterion were collected from seventeen healthy participants at East
Tennessee State University (ETSU), while the results from the dynamic stopping criterion were
collected from ten healthy participants at Duke University. Participants in both experiments
completed two P300 Speller sessions; one with a gel based electrode cap and one with a dry
based electrode cap.
EEG responses were measured using electrodes positioned on standard 32-channel caps
according to the International 10-20 system and connected to a computer via a 16-channel
GugerTec g.USBAMP Biosignal Amplifier. The dry electrode cap utilized the GugerTec
g.SAHARA active dry electrode system, which is comprised of 8-pin golden alloy electrodes.
The gel electrode cap was purchased from Electo-Cap International, Inc. Eight electrodes (Fz,
Cz, P3, Pz, P4, P07, P08, and Oz) were used for data collection and classification. These
electrodes have been demonstrated to provide adequate information for P300 Speller communication [2]. The EEG responses were sampled at a rate of 256 Hz.
Participants were presented with a 9x8 grid of characters, which was flashed based on the
checkerboard paradigm [3]. Each target character was flashed twice in a sequence of 24 flashes.
The flash duration was set at 62.5 ms followed by an inter-stimulus interval of 62.5 ms, with an
inter-target interval of 3.5 s. The number of sequence responses collected per target character
dier between experiments and are detailed in Sections 2.1 and 2.2.
2.1
For the static data collection experiment, both the gel and dry based electrode cap sessions
consisted of three calibration runs and three online test runs. At the start of each calibration
session, the participant was asked to copy-spell three six-character words randomly drawn from
a subset of words from the English language (18 characters total). For each character presented,
EEG responses to five flash sequences, or 120 flashes (5 sequences x 24 flashes/sequence = 120
flashes), were collected. These data were preprocessed and features were extracted according
to a method described by Krusienski et al. [4]. Stepwise linear discriminate analysis (SWLDA)
was used to classify the extracted features. Responses to five flash sequences per target of three
six-character words were collected for the three online test runs.
2.2
The calibration runs for both sessions in the dynamic data collection experiment were gathered
in a similar manner to the static data. However, due to the performance results of the static
data (Section 3), we chose to collect more data to improve the weights of the classifier. Each
participant in this experiment was asked to copy-spell five six-character words randomly drawn
from a subset of words from the English language (30 characters total). For each character
presented, EEG responses to seven flash sequences, or 168 flashes (7 sequences x 24 flashes/sequence = 168 flashes), were collected. Preprocessing techniques, feature extraction, and
classification methods were identical to the static data collection experiment.
Five online test runs were collected in each session using the dynamic stopping criterion
presented in Section 1. Instead of having a pre-set number of sequences for data collection, the
dynamic stopping algorithm automatically determined the necessary amount of data to collect
for each target character. The amount of collected data was controlled by a threshold of 90%
on the probabilities that each character in the matrix was the target character. The character
Article ID 089-2
DOI:10.3217/978-3-85125-378-8-89
probabilities were updated after each response to a flash was collected and data collection
stopped once one of the character probabilities increased above 90%.
Results
The results from the static data collection experiment illustrate the impact of the lower SNR
resulting from the dry electrode caps. In Figure 1, accuracy and bit rate are plotted for both
the dry and gel cap sessions of both experiments. Assuming a binomial distribution on the
probability of correctly selecting a set of characters, chance level accuracy was 11.1% for the
static data collection experiment and 10.0% for the dynamic data collection experiment [5] [6].
Bit rate is a measure of communication systems that incorporates accuracy, speed, and the
number of selectable characters presented [7].
Figure 1: Comparison of dry and gel electrode caps in terms of (a) static data collection
accuracy, (b) static data collection bit rate, (c) dynamic data collection accuracy, and (d)
dynamic data collection bit rate. Participants are sorted by gel cap accuracy.
Figure 1(a) and Figure 1(b) display the accuracy and bit rate performance for the results
of the static data collection experiment. Thirteen out of the seventeen participants performed
worse when wearing the dry electrode cap than when wearing the gel electrode cap. An average
decrease of 30% in accuracy and 4.1 (bits/min) in bit rate was observed from the dry electrode
session of this experiment.
The results of the dynamic data collection experiment are shown in Figure 1(c) and Figure
1(d). These results suggest that the dynamic stopping algorithm may improve the performance
of the dry electrode cap by collecting additional data in the online test runs to help compensate
for the added noise. Although the performance of the dry electrode cap remains lower when
compared to the results of the gel electrode cap, the dierence between gel and dry electrode
accuracy and bit rate decreased with an average decrease of 15% in accuracy and 3.5 (bits/min)
in the bit rate. The increase in data collection for the dry electrodes can be observed in Figure
2. For comparison, the number of flashes that would have been collected for a static data
collection of 5 sequences per target character is included. Dynamic data collection increased
the amount of data collected across all participants for the dry electrode system, indicating
that the data-driven dynamic data collection algorithm can detect and respond to the need for
increased data collection in low SNR scenarios. However, the increased data collection was not
able to fully compensate for the low SNR responses measured using dry electrodes.
Article ID 089-3
DOI:10.3217/978-3-85125-378-8-89
Figure 2: Amount of data collected for each participant in the dynamic data collection experiment. A simulated static data collection was included (5 sequences per target character).
Conclusion
Using a dry electrode cap with the P300 Speller would greatly reduce the complexity and time
it takes to set up the system. However, dry electrodes can reduce the SNR of the recorded
responses, reducing their potential utility for a home-based communication aid. Preliminary
results using a data-driven dynamic stopping algorithm compensates for the additional noise
by collecting more data, but performance is still lost when compared to the gel electrode cap
and the time to complete the task is increased. Additional work is needed to further improve
the performance of dry electrode caps in P300 Speller systems.
Acknowledgments
This research was supported by NIH/NIDCD grant number R33 DC010470-03.
References
[1] C. Throckmorton, K. Colwell, D. Ryan, E. Sellers, and L. Collins. Bayesian approach to dynamically
controlling data collection in p300 spellers. IEEE Trans Neural Syst Rehabil Eng, 21(3):50817,
2013.
[2] D. J. Krusienski, E. W. Sellers, F. Cabestaing, S. Bayoudh, D. J. McFarland, T. M. Vaughan, and
J. R. Wolpaw. A comparison of classification techniques for the p300 speller. Journal of Neural
Engineering, 3:229305, 2006.
[3] G. Townsend, B. K. LaPallo, C. B. Boulay, D. J. Krusienski, G. E. Frye, C. K. Hauser, and
E. W. Sellers. A novel p300-based braincomputer interface stimulus presentation paradigm: moving
beyond rows and columns. Clinical Neurophysiology, 121(7):11011120, 2010.
[4] D. J. Krusienski, E. W. Sellers, D. J. McFarland, T. M. Vaughan, and J. R. Wolpaw. Toward
enhanced p300 speller performance. Journal of neuroscience methods, 167(1):1521, 2008.
[5] G. R. Mller-Putz, R. Scherer, C. Brunner, R. Leeb, and G. Pfurtscheller. Better than random? a
closer look on bci results. nternational Journal of Bioelectromagnetism, 10(1):5255, 2008.
[6] S. Silvoni, C. Volpato, M. Cavinato, M. Marchetti, K. Priftis, A. Merico, and F. Piccione. P300based braincomputer interface communication: evaluation and follow-up in amyotrophic lateral
sclerosis. Frontiers in neuroscience, 3, 2009.
[7] D. J. McFarland, W. A. Sarnacki, and J. R. Wolpaw. Braincomputer interface (bci) operation:
optimizing information transfer rates. Biological psychology, 63(3):237251, 2003.
Article ID 089-4
DOI:10.3217/978-3-85125-378-8-90
Introduction
2
2.1
Methods
Adapted online SSVEP-based BCI speller system
Conceptual design. The BCI is based upon a user-facing tablet with modules for both SSVEP
and oscillatory activity (imagery) based communication. In the current manuscript only the
SSVEP module is described. The user front-end of the system runs on an Android tablet, while
Article ID 090-1
DOI:10.3217/978-3-85125-378-8-90
Figure 1: Participant walking on a treadmill while spelling with the SSVEP BCI system. The
User Interface presented on an Android tablet (ASUS Transformer Pad TF700) is shown on
the right side.
Article ID 090-2
DOI:10.3217/978-3-85125-378-8-90
than the mean of a focused target. A selection is made when the mean of a target reaches a
threshold of 0.35 (determined empirically). Feedback about the progress towards selection is
presented to the user in form of green colored frames around the target (Figure 1). Color black
corresponds to a mean of 0, and color light green corresponds to a mean close to the threshold.
The color is updated whenever the mean for that target changes by a certain step size.
The artifact correction method worked on blocks of one second of EEG data. The BCI
system requires operation on smaller time steps. To ensure smooth operation and allow frequent
updates of the feedback, artifact correction and BCI run as separate processes. Communication
and data exchange was implemented through shared memory. Due to the block-wise cleaning
of EEG, however, the BCI feedback presentation was delayed by 1 second.
2.2
To evaluate the online system and test how the online artifact removal method aects the
performance of the BCI we performed a study with four able-bodied male participants (273
yr). All participants had previous experience with SSVEP BCIs. In the experiment participants
had to spell with the SSVEP BCI system during dierent conditions which were more or less
prone to produce artifacts in the EEG. Participants were: (i) sitting in a shielded, shaded
and sound-proof measurement cabin with the tablet placed about 70 cm in front of them on a
table (condition CAB-SIT), (ii) standing in the measurement cabin holding the tablet in their
hands (condition CAB-STND)), (iii) standing outside the measurement cabin on a treadmill
in a room with daylight and holding the tablet in their hands (condition TRD-STND), and
(iv) walking on a treadmill with slow speed (1.4-1.6 km/h) holding the tablet in their hands
(see Figure 1). Condition (iv) was performed twice, once with the online artifact removal
method (condition TRD-WLK, see Figure 1) and once without artifact correction (condition
TRD-WLK-OFF). All other conditions were performed with artifact correction enabled. Before
performing conditions (iv) participants trained spelling during walking on the treadmill for ten
minutes. Conditions were randomized. Participants were asked to write one word of four letters
(German words: Mund, Zaun, Auto, Haus, Werk, Baum) during each run. Participants had
ten minutes respectively to complete the task.
EEG was recorded from 16 electrodes placed over cortical areas according to the international 10-20 system. Electrode positions included AFz, FC3, FCz, FC4, C3, Cz, C4, CP3, CPz,
CP4, PO3, POz, PO4, O1, Oz, O2. Reference and ground were placed at position Pz and P5,
respectively. The g.GAMMAsys system with g.LADYbird active electrodes (Guger Technolgies,
Graz, Austria) and one g.USBamp biosignal amplifier was used to record EEG at a rate of 512
Hz (Notch 50 Hz, 0.5-100 Hz band pass).
Results
Results are summarized in Table 1. Arithmetic mean values, averaged over all participants for
each condition individually, show that the the number of correct and erroneous selections are in
a comparable range for conditions CAB-SIT, CAB-STND and TRD-STND. Also the selections
per minute are in a comparable range over these conditions. When walking on the treadmill
(TRD-WLK) there is a slight decrease in correct selections and in total 0.9 selections per minute.
Disabling the artifact rejection method during treadmill walking (TRD-WLK-OFF) degrades
the performance to 4.7 correct selections per run and 0.67 selections per minute. The average
number of scan steps needed to correctly select an item was in the range 3.8-4.8 selections.
Article ID 090-3
DOI:10.3217/978-3-85125-378-8-90
Table 1: Number of correct/erroneous selections and the number of selections per minute for
each participant (P1-P4) and condition. Additionally the average number of scan steps needed
to make a correct selection (Steps) for each condition is shown.
P1
P2
P3
P4
Mean
Steps
CAB-SIT
07/4, 1.0
11/2, 1.6
07/1, 0.7
08/0, 1.9
8.2/1.7, 1.30
4.5
CAB-STND
09/1, 1.6
10/1, 1.4
08/2, 0.9
08/0, 1.5
8.7/1.0, 1.35
3.8
TRD-STND
09/2, 1.5
10/2, 1.2
07/0, 1.3
06/1, 0.6
8.0/1.2, 1.15
4.6
TRD-WLK
10/5, 1.6
09/1, 0.9
07/3, 0.9
01/0, 0.3
6.7/2.2, 0.90
4.8
TRD-WLK-OFF
12/3, 1.6
5/1, 0.7
2/1, 0.4
0/0, -.4.7/1.2, 0.67
4.4
Discussion
The preliminary results of our study show that when EEG data is contaminated by artifacts the
performance of the BCI system improves when using the developed online artifact correction
method. Participant P4 had problems to spell outside the box as the light in the room was very
bright and dazzled him. In total for three of the participants (P2-P4) the system performed
better with artifact correction method, and for one participant (P1) the system performed
equally well with and without artifact correction.
The implemented scanning mechanism sequentially accumulates evidence until a decision
can be made. Hence, providing user on-demand access is an intrinsic feature of this strategy.
Robustness of detection can be enhanced by optimizing scanning time and selection threshold
for each individual. Selection time can be enhanced by optimizing the scanning protocol.
One big issue when designing the experimental paradigm was to find a strategy to naturally
elicit artifacts. As a first step, we asked participants to walk on a treadmill and hold the tablet
in their hand. Since the results of the study suggest that the developed methods work at a
fundamental level, in the next step we will explore their performance with CP users and adapt
methods and scanning protocols following user-center design principles.
Acknowledgments
This work was supported by the FP7 Framework EU Research Project ABC (No. 287774).
This paper only reflects the authors views and funding agencies are not liable for any use that
may be made of the information contained herein.
References
[1] I. Daly, R. Scherer, M. Billinger, and G. M
uller-Putz. FORCe: Fully Online and automated artifact
Removal for brain-Computer interfacing. IEEE Trans Neural Systems Rehab Eng, accepted.
[2] P. Horki, C. Neuper, G. Pfurtscheller, and G. M
uller-Putz. Asynchronous steady-state visual evoked
potential based bci control of a 2-dof artificial upper limb. Biomed Tech, 55(6):367374, 2010.
[3] R. Scherer, G. R. M
uller-Putz, and G. Pfurtscheller. Flexibility and practicality: Graz braincomputer interface approach. International Review of Neurobiology, 86:119131, 2009.
[4] G.A.F. Seber. Multivariate Observations. Wiley, 1984.
Article ID 090-4
DOI:10.3217/978-3-85125-378-8-91
Neuroelectrical Imaging and BCI Lab. IRCCS Fondazione Santa Lucia, Rome, Italy;
Department of Computer, Control, and Management Engineering Antonio Ruberti, Sapienza
University of Rome; 3Department of Psychology, "Sapienza", University of Rome
f.schettini@hsantalucia.it
Abstract
This work aims at evaluating the usability of an assistive device for people with
amyotrophic lateral sclerosis, designed to permit a wide range of input modalities to
support communication and interaction with the environment. Nine stakeholders
(assistive technology experts, medical doctors and BCI researchers) were involved in
the study, which included four experimental conditions, each considering a different
input modality: touchscreen, buttons (scan mode), headtracker, and a P300-based brain
computer interface. The latter exhibited a lower effectiveness and efficiency with
respect to the other input devices. However no differences were found among the four
conditions in terms of ease of access, ease of use, ease of understanding, usefulness and
satisfaction.
1 Introduction
The user-centered design (UCD; ISO 9241-210) implies that end-users play an active role in the
device design and development following an iterative process. The UCD approach has been recently
introduced in the Brain Computer Interface (BCI) field of research (Zickler et al. 2011; Kubler et al.
2013). The Brindisys project (www.brindisys.it) recently deployed a prototype assistive device (AD)
providing functionalities which are seamlessly accessible through several conventional/alternative
input channels, among which a P300-based BCI, to enhance/allow basic communication and
environmental interaction of people with Amyotrophic Lateral Sclerosis (ALS). The multimodal
access to this AD prototype provides end-users with an adaptable system coping with the different
stages of the disease. This study aims at evaluating the usability of the proposed prototype AD by
involving three different categories of stakeholders, namely medical doctors working with patients
with ALS, psychologists experienced in the assistive technology (AT) field and BCI researchers with
previous experience with ALS. In fact, stakeholders have a broad knowledge about the different
stages of the disease and about the conventional ATs currently adopted by end-users, making their
opinion extremely valuable for the evaluation of a new AT.
Article ID 091-1
DOI:10.3217/978-3-85125-378-8-91
2 Methods
2.1 Description of the Assistive Device Prototype
The AD prototype can be operated through several input modalities: touchscreen, hardware
buttons, headtracker and a P300-based BCI. To ensure portability and affordability, the prototype was
developed on a 10 tablet and the software written in Java and C++ running on the Windows
operating system (Caruso et al. 2013). The selection of available functionalities has been performed in
agreement with the results of a preliminary user survey (Schettini et al. 2014). In the domain of
interpersonal communication, a Graphical User Interface (GUI) running on the tablet, provided three
main functions: (i) an alarm sound to draw the attention of a caregiver; (ii) a simple text editor, for
both face-to-face and remote (SMS) communication; and (iii) an interface to select predefined
sentences for quick communication. In the domain of environmental control, functionalities available
on the GUI included on/off switching of lights and appliances, TV and music players remote control.
These functionalities required the deployment of a domotic kit, i.e. dedicated hardware packed in
briefcase, which includes: a WiFi router (communication with the tablet), three controllable mains
sockets (appliances), an infrared controller (TV remote), and an UMTS router (internet monitoring).
As for the BCI input, stimulation timing and data acquisition were managed by the BCI2000
framework. A custom software program managed the communication between BCI2000 and the GUI,
and generated the visual stimuli (green grids), necessary to generate evoked potentials, on top of the
aforementioned input-independent GUI.
Article ID 091-2
DOI:10.3217/978-3-85125-378-8-91
3 Results
Figure.1 a) Accuracy achieved during the tasks in all conditions; b) Time for correct selection in all
conditions; c) Efficiency perceived by stakeholders
With regard to effectiveness, (Figure 1.a.), the ANOVA showed significant differences among the
four conditions ( =25.59; p<.01). The P300-BCI exhibited significant lower accuracy (p<.05) with
respect to other conditions. Figure 1.b shows the values of time for correct selection. The ANOVA
pointed out significant differences among the four conditions ( =49.02; p<.01). P300-based BCI
exhibited higher (p<.05) time for correct selection with respect to the other conditions, the touch
screen being the fastest condition (p<.05), as assessed by the post-hoc test.
These differences influenced the efficiency perceived by participants and assessed by means of the
questionnaire. Indeed, Friedman ANOVA showed a significant difference among the four conditions
in the efficiency variable ( =7.9; p<.05). As a result of the post-hoc test, efficiency of the touch
screen condition resulted higher than the two buttons condition (p<.05) and the BCI condition (p<.05).
No significant differences were found by analyzing the other 5 variables (ease of access, ease of use,
ease of understanding, usefulness and satisfaction).
With regard to the comparison between the first and the second run, the ANOVAs point out
significant differences neither in terms of accuracy increment ( =1.74; p=.62) nor in terms of time
for correct selection ( =2.20; p=.53) among the four conditions (Table 1).
The analysis of feedbacks about the overall usability of the system highlighted some usability
issues regarding both the structure of the GUI (e.g. the pause function resulted confusing), and some
weak points with specific input devices (e.g. the alarm call menu was not well designed for the
buttons input). Problems identified in usability were not directly related to the inclusion of the BCI as
control input.
Article ID 091-3
DOI:10.3217/978-3-85125-378-8-91
Table 1. Comparison between the first and the second run. White and gray cells denote an increment
and a decrement of the value from the first to the second run respectively.
Mean
Touch
Screen
0,05%
Accuracy
Head
Buttons
tracker
2,51%
0,37%
3,19%
Touch
Screen
0,29
P300-BCI
Median
0,00%
0,00%
0,00%
2,05%
0,26
1,14
0,11
0,22
I Quartile
0,00%
9,38%
0,00%
3,33%
0,18
0,05
0,01
2,41
III Quartile
0,00%
0,00%
0,00%
5,53%
0,33
2,43
0,39
1,99
Min Value
0,00%
12,06%
6,67%
28,48%
0,12
1,98
0,23
10,69
Max Value
0,42%
10,00%
3,33%
12,50%
0,73
3,40
0,64
4,19
4 Discussion
Despite not conclusive, these results confirm the feasibility of a single AD accessible from a broad
range of input modalities, including a BCI. In fact, no differences in terms of simplicity in getting
acquainted with the prototype were pointed out by stakeholders. The P300-BCI exhibited lower
efficiency and effectiveness with respect to the other access conditions, but this did not affect the ease
of access, ease of use, ease of understanding, usefulness and satisfaction perceived by participants
with BCI. Indeed no differences were found between the four input conditions. This confirms that the
overall system has been perceived by stakeholders as a single multimodal AD system.
Acknowledgments
The work is partly supported by the Italian Agency for Research on ALS ARiSLA, project
Brindisys.
References
Caruso, M. et al., 2013. My-World-in-My-Tablet: An Architecture for People with Physical
Impairment. In M. Kurosu, a c. di Human-Computer Interaction. Interaction Modalities and
Techniques. Berlin, Heidelberg: Springer Berlin Heidelberg, pagg. 637647.
Kbler, A., et al., 2013. Applying the user-centred design to evaluation of Brain-Computer Interface
controlled applications. Biomed Tech (Berl). doi:10.1515/bmt-2013-4438
Schettini, F. et al.; 2014. An assistive device with conventional, alternative and Brain-computer
interface inputs to enhance interaction with the environment for people with Amyotrophic
Lateral Sclerosis: a feasibility and usability study. Archives of Physical Medicine and
Rehabilitation in press.
Zickler, C. et al., 2011. A brain-computer interface as input channel for a standard assistive
technology software. Clinical EEG and neuroscience, 42(4), pag.236244.
Article ID 091-4
DOI:10.3217/978-3-85125-378-8-92
We report the development of an aective BCI based on frontal alpha asymmetry neurofeedback, which has previously been used in clinical experiments. Our results evidence a
pattern of high-performance for some subjects, combined with high illiteracy, with 52.3%
of subjects succeeding in the neurofeedback task. We suggest that individual asymmetry
baseline values may be one of the factors explaining BCI illiteracy in this context.
Frontal alpha EEG asymmetry (henceforth FAA) was identified by Davidson [2] as a correlate
of approach/avoidance, a high-level emotional dimension that plays an important role in many
aspects of human psychology. FAA was later demonstrated to be a marker of clinical depression,
which led to the proposed use of neurofeedback (NF) techniques for depression therapy [6],
demonstrating this marker to be amenable to NF.
We have explored the use of FAA to implement an aective BCI for interactive media
based on NF [5]. In this system, a virtual narrative is generated in the form of a real-time
3D animation, in which the user can intervene to support the central character by expressing
positive thoughts, which in turn will drive the evolution of the narrative towards a happier
ending for the character. A previous feasibility pilot has demonstrated a success rate of 50%
for an all-male sample of 12 subjects. Furthermore, these experiments were carried out under
an EEG/fMRI installation to provide a first level of anatomical validation, which confirmed
our original hypothesis [4]. In this paper we present the results of follow-up work in which
we explore the determinants of successful NF, using a larger number of subjects and analysing
training data.
Following previous work we decided to use the A2 score (F4 F3 )/(F4 + F3 ) (with F4 (R)
and F3 (L) electrodes) as a measure of FAA for NF input. However, because of the intrinsic
fluctuations in A2 and the need to provide a stable signal to support visual feedback (here,
the central characters skin colour saturation) that reduces jitter, we used a 4-point moving
average as a simple low-pass filter (henceforth M A2 ). As a baseline, we used the A2 average
value obtained during a 2min calibration period (which was discussed by Allen et al. [1] as the
minimum reliable calibration time).
Since our NF signal is visual and continuous, we defined a mapping function between each
subjects individual baseline and an empirically determined maximum value for M A2 onto the
0-100% saturation range. The rationale for determining such a maximum was that the activity
in the right hemisphere cannot be reduced infinitely hence the A2 score should be seen as
asymptotic. We conducted a calibration experiment with 16 subjects to empirically determine
Article ID 092-1
DOI:10.3217/978-3-85125-378-8-92
the mapping of M A2 values to saturation, where we used the variation in the subjects M A2
values to determine maximum saturation. In order to avoid a few high scores raising the value
where maximum saturation occurs (thereby decreasing the amount of feedback received for
scores that are likely to occur more frequently) we decided to define the maximum feedback
below the maximum of M A2 values ( 0.7, across all subjects). When applying a mean +
1.64SD filter1 , the average dierence between subjects maximum M A2 value and their M A2
mean (i.e. baseline) was 0.23 (SD = .06). It should be noted that this finding is consistent
with Zotev et al. [8] setting variations to A2 of 0.2 as a success criterion in their own NF
experiments. This led us to determine the M A2 value resulting in maximum saturation for
each subject as min(0.7, baselinesubject + 0.2), making saturation scores subject specific.
The next step was to define a success score for a NF trial. This consists in the ability
to increase M A2 and to sustain this increase over a non-trivial amount of time (so as to
distinguish it from spontaneous fluctuations). In order for the success score to be normalised
across subjects, we used the above saturation scores instead of raw M A2 since they already
incorporate the individuals A2 baseline. We assign a maximum score of 100 to an ideal result
of 100% saturation over the full 30s NF window. This score is meant to account for any
combination of above-threshold amplitude and time, essentially integrating above-threshold
values over time. For instance, a score of 10 would correspond to either 100% saturation over
3s, 50% saturation over 6s, 30% saturation over 10s ... this is meant to cover the very dierent
NF patterns observed across subjects, while appearing a stricter criterion than that of [3] who
imposed above threshold values to be sustained for 500ms as well as Rosenfeld [6] and its
number of hits above threshold metric.
Finally, a trial can only be considered successful if M A2 actually increases during NF over
its previous resting value. Following several authors (including Zotev et al. [8] who have used
as a reference the average asymmetry scores during the preceding 40s), we have defined as a
criterion for success for a NF trial that it reaches a score of 10, such score being also higher
than that of the preceding resting period of 15s in-between trials (scores being normalized for
respective durations of resting and NF periods during trials), as shown in Figure 1.
Methods
We recruited 36 subjects (17 male, 19 female), whose average age was 30.4 years (SD =
9.25, range : [20, 52]). Experiments were approved by our local ethics committee, and subjects
were issued detailed consent forms; all data were anonymised. Subjects were sat in a comfortable
armchair in a quiet room with dimmed lighting, and they were given instructions on how to
relax to minimise muscular artefacts as well as avoiding blinking as much as possible. They
were introduced to the concept of a NF loop in simple terms, with the characters skin colour
saturation introduced as a visual indicator of the magnitude of mental support (as shown
in Figure 2). Instructions were deliberately generic (express positive thoughts towards the
character.), in order to avoid influencing users cognitive strategies towards any implicit or
explicit one.
Each training block consisted of a 30s NF session preceded by a 15s resting period during
which subjects were instructed to relax and remain staring at a black screen (Figure 2a).
Each subject performed 12 successive training trials for a total duration of 9min. All subjects
completed the training session. We defined a subject as successful in training if s/he succeeded
in 6 out of 12 trials. After the training session, each subject participated in one session of the
1 Filtering
Article ID 092-2
DOI:10.3217/978-3-85125-378-8-92
Figure 1: Training results. (a) Number of successful training trials per participant, (b) comparing scores to previous resting periods for each trial for one subject, and (c) for another subject,
rest values are greater than NF for trials 3, 6, 7, 9, 11, 12, which are all rated failed. The
blue line represents success threshold as a score of 10 (see text).
Figure 2: Experiment setup. (a) Initial calibration, followed by 12 training trials pairs
(Rest/NF). (b) BCI-based narrative. During NF, M A2 is mapped to the colour saturation
of the character in need of support by the user (see text).
Article ID 092-3
DOI:10.3217/978-3-85125-378-8-92
Figure 1a plots the number of subjects that performed successfully on a given number of trials
during NF training. The most striking finding is to observe FAA NF as a high-performance,
high-illiteracy BCI: the average in-story score for successful subjects is actually 20, twice the
success threshold we have defined, together with a global success of 52.3%. However, the
proportion of subjects failing the task (although not all failure can be assimilated to BCI
illiteracy) is much higher than previous reports of BCI illiteracy, which were in the region of
20-30% [7]. Since BCI illiteracy is considered to be specific to the BCI technique considered, we
investigated the role of the A2 baseline as a possible determinant of illiteracy by calculating the
biserial correlation between the individual threshold value and training success, rb = .69, p =
.002, which suggested a strong and statistically significant negative relationship between baseline
and training success. On the other hand, the contribution of training, or lack thereof, to
the observed illiteracy failure rate figures, is more difficult to assess without extending our
experiments to the same level of training as the one described for clinical applications (previous
authors having reported hours of training over multiple sessions). However, our ethical approval
was limited to one short session, as these are unlikely to alter subjects trait variables over a
prolonged period. Another possible direction to explore to improve performance consists in
the baseline measure itself. Following previous literature, we have alternated epochs of eyes
open and closed during baseline measures, but we have observed by retrospective analysis3 that
only using eyes open signals (as Davidson reported for experiments with films [2]) would have
resulted in a lower baseline, hence potentially higher success rates.
References
[1] J. J. B. Allen, J. A. Coan, and M. Nazarian. Issues and assumptions on the road from raw signals
to metrics of frontal EEG asymmetry in emotion. Biological Psychology, 67(1):183 218, 2004.
[2] R. J. Davidson, P. Ekman, C. D. Saron, J. A. Senulis, and W. V. Friesen. Approach-withdrawal
and cerebral asymmetry: Emotional expression and brain physiology: I. Journal of personality and
social psychology, 58(2):330, 1990.
M. Dias and A. van Deusen. A new neurofeedback protocol for depression. The Spanish Journal
[3] A.
of Psychology, 14(01):374384, 2011.
[4] S. W. Gilroy, J. Porteous, F. Charles, M. Cavazza, E. Soreq, G. Raz, L. Ikar, A. Or-Borichov,
U. Ben-Arie, I. Klovatch, and T. Hendler. A brain-computer interface to a plan-based narrative.
In Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, pages
19972005. AAAI Press, 2013.
[5] C. M
uhl, A.-M. Brouwer, N. van Wouwe, E. L. van den Broek, F. Nijboer, and D. K. J. Heylen.
Modality-specific aective responses and their implications for aective BCI. In Proceedings of the
Fifth International Brain-Computer Interface Conference, pages 120123. Verlag der Technischen
Universit
at, 2011.
[6] J. P. Rosenfeld, G. Cha, T. Blair, and I. H. Gotlib. Operant (biofeedback) control of left-right
frontal alpha power dierences: Potential neurotherapy for aective disorders. Biofeedback and
Self-Regulation, 20(3):241258, 1995.
[7] C. Vidaurre and B. Blankertz. Towards a cure for BCI illiteracy. Brain Topography, 23(2):194198,
2010.
[8] V. Zotev, R. Phillips, H. Yuan, M. Misaki, and J. Bodurka. Self-regulation of human brain activity
using simultaneous real-time fMRI and EEG neurofeedback. NeuroImage, 85:985995, 2014.
3 Repeated measures t-test confirmed that eyes-open baseline (M = .25, SD = .13) was significantly lower
than eyes-closed baseline (M = .35, SD = .14), t(35) = 6.61, p < .001, r = .75(large).
Article ID 092-4
DOI:10.3217/978-3-85125-378-8-93
Abstract
Cognitive states can be monitored and exploited in real-time by brain-computer interface (BCI) systems. Comparable to motor rehabilitation after stroke, we propose a simple
BCI-supported paradigm for the cognitive rehabilitation of speech production deficits in
aphasia patients. The paradigm thrives to close the loop between top-down execution and
bottom-up perception, while constraining the execution of a rehabilitation trial to BCIdetectable attention patterns. In an oine study, the basic characteristics of the novel
paradigm were explored with n=9 normal hearing healthy students in a free-field spatial
auditory setup. Event-related potentials (ERPs) of the electroencephalogram were analyzed and the possibility for an online classification of target vs. not-target words was
explored. Despite diering in delay from ERP components evoked by artificial tones, the
ERPs evoked by words can be classified on a comparable level in single trial, encouraging
future tests of the attention-constraint paradigm in closed-loop.
Introduction
The inability to speak certain words (e.g. after stroke) can have dierent causes, ranging from
a lesion within the dual loop model of language [4, 6] to disturbed attention mechanisms [1, 3].
Several cognitive models propose that word production may derive from high and rapid online
interaction between top-down and bottom-up processing at dierent levels (i.e. formulation of
conceptual representation into a linguistic form and an articulation level).
It is intriguing to engage both interaction directions in a rehabilitation approach. Practically this requires to close the loop from (1) the willful attempt of a word production or at least
the concentration onto a word to (2) producing the sound of the intended word and thereby
providing sensory feedback of the (artificially) spoken word. Transferring methods from spatial
auditory BCI paradigms, we make a first attempt towards such an attention-constrained rehabilitation paradigm: a BCI system, detects attention onto a difficult-to-produce target word
among a sequence of several words. Upon detection, a trial ends by playing the target word,
which closes the loop via auditory feedback.
2
2.1
Methods
Participants
Two pilot sessions were performed. Data thereof was used to fine-tune experimental parameters,
but excluded from further analysis. After providing written informed consent, N=9 healthy
students (5 male and 4 female, age 2226 yrs, no history of hearing defects, native German
speakers) participated in the study.
Article ID 093-1
2.2
DOI:10.3217/978-3-85125-378-8-93
Experimental Setup
Subjects were seated in a ring of six speakers (AMUSE paradigm, [5]). A screen provided a
fixation cross, indicated periods for relaxation and active trials. At trial start, subjects received
static visual and auditory information about the current target direction.
Stimuli and Trial Structure. The following one-to-one relation between sentences and endof-sentence words had been learned by subjects during a familiarization phase:
Wir m
ussen putzen,
uberall ist ... Dreck.
Komm, wir gehen raus an die frische ... Luft.
Zum Fr
uhst
uck mache ich mir ein belegtes ... Brot.
Wir machen Picknick auf der Wiese im ... Gras.
Alle Kinder trinken gerne ... Saft.
Er hat schon wieder gewonnen - er hat so viel ... Gl
uck.
Prior to presenting a rapid stimulus sequence consisting of the six end words, one of the
sentences was cued from a loudspeaker. It indicated the target direction of the current trial.
(Target directions were pseudo-randomized between trials.) The last word of the sentence was
missing, and subjects were instructed to focus their attention to complement the missing word.
The subsequent rapid sequence of 90 word stimuli per trial was presented with a stimulus onset
asynchrony (SOA) of 175 ms. Comprising 245 ms each, word stimuli slightly overlapped in
time. During a run, the six words were tied one-to-one to the loudspeakers. This mapping was
changed between runs and balanced over runs.
Structure. Subjects were familiarized with a six-class AMUSE spatial auditory setup by
first slowly. During the familiarization phase only, subjects had to count and indicate by finger
movements when they had perceived target stimuli. Subsequently, three main blocks were
performed with an SOA of 175 ms and one last block with 300 ms. Only data recorded with
175 ms SOA entered the analysis. Each block consisted of six runs, every run of six trials.
A trial contained 90 stimuli (15 iterations of the six stimuli). As each iteration contained
only one target stimulus and five non-target stimuli, this procedure totaled to an amount of
3 6 6 15 1 = 1620 target epochs and 3 6 6 15 5 = 8100 non-target epochs.
Electroencephalogram Recordings. During a single session, electroencephalogram (EEG)
was recorded from 63 passive Ag/AgCl electrodes (EasyCap / BrainAmpDC amplifiers). The
electrodes were placed according to the 10-20-system and referenced against the nose. In addition, one channel was placed below the right eye. Signals were sampled at 1 kHz, band-filtered
between 0.01 Hz and 250 Hz and stored for oine processing.
2.3
Data of one subject had to be removed due to excessive eye blinks. Data of the remaining
eight subjects was bandpass-filtered between 0.5 Hz and 20 Hz. For each stimulus, an epoch
was windowed at [-175 ms , 1200 ms] relative to stimulus onset. Epoch osets were corrected
based on the interval [-175 ms , 0 ms]. Epochs were removed, if the variance of their amplitude
exceeded three standard deviations or if max-min amplitudes exceeded 80 V . On average, 37
target epochs and 182 non-target epochs were dropped per subject. Remaining epochs were
averaged over subjects, but separately for target- and non-target stimuli to reveal the grandaverage ERP structures (time courses and scalp maps) of the word paradigm.
Article ID 093-2
Cz (thick)
FC5 (thin)
0
t
-1
nt
-2
200
400
600
800
0.7
[ms]
1000
[V]
-2
0.65
0.1
[V]
nt
sAUC (t,nt)
-2
0.6
[sAUC]
[V]
DOI:10.3217/978-3-85125-378-8-93
-0.1
150 - 260 [ms]
Gras
Luft
Brot
Figure 1: Left: evoked ERP components (grand average, n=8). TOP ROW: average time
series of EEG channels Cz and FC5 for target (blue) and non-target (green) epochs. A color
bar below the time series indicates the class-discriminative information (signed AUC) values
contained over time in the Cz channel. MIDDLE ROWS: average potential maps evoked by
target stimuli (t) and non-target stimuli (nt) of three earlier intervals and a later one (see
corresponding light and dark gray shadings of top row). BOTTOM ROW: maps of channelwise class-discriminative information content as indicated by signed AUC values of the same
intervals. Right: Classification accuracies estimated for six classifiers trained separately for
each word. Asterisks indicate the eight subject-specific accuracies, grand average results are
indicated by colored circles.
Prior to classification with a shrinkage-regularized linear discriminant analysis (LDA), 15
average potentials (six intervals of 20 ms duration between 170 ms and 290 ms and nine intervals
of 60 ms length between 290 ms and 830 ms) were extracted from each EEG channel, resulting
in 15 63 = 945 dimensions per epoch. Classification values are given relative to a chance level
of 0.5 and were estimated by five-fold chronological cross-validation.
To get an overview over the novel paradigm and to estimate its single-trial classification level,
a binary target vs. non-target classifier was trained for each subject. The average performance
over all subjects is reported.
In addition, six separate binary classifiers were trained per user. For each classifier, only
epochs from one of the six words were included. Though this strategy reduced the available
amount of data by a factor of 1/6 and thereby lead to classifiers of decreased performance, it
allowed us to investigate systematic dierences between words.
Results
The evoked ERP components for target- and non-target words varied strongly between users.
For this reason, the left plot of Figure 1 must be taken with caution. It shows the grand average
ERP response for two channels and four selected time intervals. Over subjects, the word stimuli
elicited an attention-modulated, class-discriminative fronto-central to fronto-bilateral negativity
Article ID 093-3
DOI:10.3217/978-3-85125-378-8-93
around 230 ms post stimulus onset, and a subsequent class-discriminant central positivity ending
at approx. 800 ms with peak amplitudes at more (right-) lateralized electrodes. Compared to the
original AMUSE setup, which also shows these two class-discriminative features, the latencies
on average are longer for the word stimuli. The excessive length of the class-informative late
positivity in fact is caused by averaging over shorter corresponding intervals of single subjects,
which strongly dier in latency and duration.
Classification analysis reveals, that the word stimuli can be classified with accuracies between
65 % and 77 %. The grand average performance over subjects is 71.2 %, which slightly improves
over the brisk AMUSE tone stimuli [5] (68.5 %).
Direction-related performance dierences with word stimuli by and large replicate findings
of the AMUSE reports (not shown). Thus it is interesting to see, if the six words, which have
been randomized over the loudspeaker directions, result in similar classification performances.
The right plot of Figure 1 shows the results of six classifiers trained specifically on epochs of one
word each. Fortunately the spread of binary classification accuracies between words is small
(between 0.625 and 0.655).
The data presented is an indicator for the eectiveness of the proposed attention-constrained
word paradigm to elicit target and non-target responses, such that profound problems are not
expected when transferring the paradigm to an online setup. As next steps, the trade-o
between delayed high-quality feedback (cp. to current auditory BCI routines) and immediate,
but more unreliable feedback must be addressed. Furthermore, the feasibility of the paradigm
with individually chosen word sets needs investigation, as well as the use of longer words. The
far goal is to test the potential eectiveness of the attention-constrained aphasia rehabilitation
training procedures, which of cause can only be verified in randomized, blinded online studies
with the target user group of chronic stroke patients. Tight time budgets in clinics or for homeuse should be accommodated for by reducing the calibration time to a minimum. A combination
of the attention-constrained paradigm with recently proposed transfer learning approaches in
combination with unsupervised adaptive classification [2] may provide a solution.
Acknowledgments The authors appreciate their funding by the German Research Foundation within the Excellence Cluster BrainLinks-BrainTools (grant EXC 1086 ).
References
[1] F. Burgio and A. Basso. Memory and aphasia. Neuropsychologia, 35(6):759766, 1997.
[2] P.-J. Kindermans, M. Tangermann, K.-R. M
uller, and B. Schrauwen. Integrating dynamic stopping,
transfer learning and language models in an adaptive zero-training erp speller. J Neural Eng,
11(3):035005, 2014.
[3] L. L. Murray. Review attention and aphasia: Theory, research and clinical implications. Aphasiology, 13(2):91111, 1999.
[4] J. P. Rauschecker and S. K. Scott. Maps and streams in the auditory cortex: nonhuman primates
illuminate human speech processing. Nature neuroscience, 12(6):718724, 2009.
[5] M. Schreuder, B. Blankertz, and M. Tangermann. A new auditory multi-class brain-computer
interface paradigm: Spatial hearing as an informative cue. PLoS ONE, 5(4):e9813, 2010.
[6] C. Weiller, T. Bormann, D. Saur, M. Musso, and M. Rijntjes. How the ventral pathway got lostand
what its recovery might mean. Brain and language, 118(1):2939, 2011.
Article ID 093-4
DOI:10.3217/978-3-85125-378-8-94
IRCCS Fondazione Santa Lucia, Rome, Italy. 2Dept. of Computer, Control, and Management
Engineering, Univ. of Rome Sapienza, Italy.
manuela.petti@uniroma1.it
Abstract
Recent BCI applications stroke motor rehabilitation have raised important concerns
regarding the type of brain activity which one would train in agreement with an
evidence-based approach in rehabilitation. In this pilot study we proposed an offline
analysis on EEG data acquired during a BCI-assisted motor imagery training performed
by a stroke patient, with the aim of defining an index for the evaluation of the training
achievements across session. The proposed h parameter would be independent from the
selected BCI training setting and would better describe the physiological properties of
the patterns generated during training, allowing a more appropriate evaluation of the
training achievements than the behavioral performance (i.e. percentage of hit target).
1 Introduction
Nowadays, Brain Computer Interface (BCI) represents a promising technology to support motor
and cognitive rehabilitation after stroke. In such rehabilitative context, BCI application aims at
increasing the neuroelectric or metabolic brain responsiveness, which in turn would lead to a better
recovery of function.
The Electroencephalographic (EEG) -based BCI operated by motor imagery (MI) can provide a
valuable approach to support mental motor practice to enhance arm motor recovery after stroke [Mattia
et al., 2012]. However, as stroke cortical lesions may result in a functional reduction/derangement of
neuroelectrical activity generated over the ipsilesional hemisphere there is a need for further
implementation of the procedures for recognition of those EEG patterns which are reinforced during
the BCI-supported training of MI. Furthermore the online classification of trials as successful and failed
also relies on a arbitrary choice of parameters and gains that do not strictly reflect the intrinsic
properties (i.e. the level of desynchronization of SMR) of the patterns of activity recorded during the
training.
The aim of this study was to define an index which would be independent from the settings adopted
for the online control and thus, would describe the properties of neuroelectrical activations across BCI
training sessions more appropriately than the hit rate (behavioral performance). To this purpose, we
performed an offline analysis of EEG data sets acquired from stroke patients who underwent a MIassisted BCI training aiming at promoting functional motor recovery of the paralized upper limb
[Pichiorri et al., 2011]. The estimated index was monitored across training sessions and used to sort
trials according to their intrinsic properties.
Article ID 094-1
DOI:10.3217/978-3-85125-378-8-94
2 Methods
2.1 Experimental design
EEG data were collected from 14 stroke patients (age: 64 8 years; first ever, unilateral stroke
causing hemiparesis/plegia) who underwent a BCI-assisted MI training. The training was preceded by
an EEG screening session. EEG signals were recorded from 61 scalp positions (sampling rate 200 Hz)
and such data were used to extract the features for the online control. Such control features were
spatially selected only over the damaged (stroke) hemisphere (two channels) within an EEG frequency
range relevant for sensorimotor function (10-15 Hz). The training protocol included 4 weeks of MIbased BCI training (3 sessions per week), during which the patient was asked to control the
movements of a virtual representation of his own stroke-affected hand throughout the imagination of
simple hand movements (visual neurofeedback). Each training session included 4 up to 8 runs (20
trials per run). Trials consisted of a baseline period (4 sec) followed by MI (max 10 sec). EEG signals
(sampling rate at 200 Hz) during training were collected from 31 electrode positions (fronto-central,
central, centro-parietal and parietal lines).
3 Results
The results are showed for one exemplary training data set acquired form a stroke patient with left
affected hemisphere whose selected features were C3 and Cp3 electrode position at frequency bin of
10-12 Hz.
Article ID 094-2
DOI:10.3217/978-3-85125-378-8-94
The graphs in Figure 1 show the trend of the h median values relative to each trial (left panel) and
of the correspondent hit rate percentage (behavioral performance) (right panel) as a function of
training sessions. In particular, Figure 1 shows that the median follows the positive outcome of the
training with a decreasing trend along the investigated sessions. The hit rate, instead, after large
oscillations in the first half of training sessions, saturated around 90% for the second half, not showing
a clear trend along the intervention.
Figure 1: Median value of h parameter (left panel) and hit rate percentage (right panel) across training
sessions.
The Figure 2 illustrates the distribution of the h parameter (top panel) and the relative statistical
scalp maps (bottom panel) obtained from the second training session data set, considering the two
groups of trials divided by the median. Note that only the statistical scalp map relative to the trials
distributed to the left of the median (i.e. stronger desynchronization) showed a significant pattern
(blue color in the left side map) as compared to that relative to the trials distributed to the right of
median.
Article ID 094-3
DOI:10.3217/978-3-85125-378-8-94
Figure 2: Distribution of the h parameter (top panel) and statistical scalp maps (bottom panel) obtained from the
second training session data set. Top panel: dashed line indicates the median (percentile 50%). Bottom panel: the
color bar codes for t-value associated with each pixel: gray color = not significant differences between baseline
and task trial PSD values; hot (yellow-red) = significant synchronization; cold (blue) = significant
desynchronization.
References
D. Mattia, F. Pichiorri, M. Molinari, R. Rupp, "Brain computer interface for hand motor function
restoration and rehabilitation", Towards Practical Brain-Computer Interfaces for Hand Motor
Function Restoration and Rehabilitation, 131-153, 2012, Springer.
F. Pichiorri; F. Cincotti; F. de Vico Fallani; I. Pisotta; G. Morone; M. Molinari; D. Mattia;
"Towards a brain computer Interface-based rehabilitation: from bench to bedside", Proceedings of the
5th international braincomputer interface conference, Verlag der Technischen, 268-271, 2011.
Welch, (1967) "The Use of Fast Fourier Transform for the Estimation of Power Spectra: A
Method Based on Time Averaging Over Short, Modified Periodograms", IEEE Transactions on Audio
Electroacoustics, AU-15, 7073.
Y. Benjamini and D. Yekutieli, The Control of the False Discovery Rate in Multiple Testing
under Dependency, The Annals of Statistics, vol. 29, no. 4, pp. 11651188, Aug. 2001.
Article ID 094-4
DOI:10.3217/978-3-85125-378-8-95
A preliminary analysis of the correlation between high resolution EEG and three
dimensional (3D) hand motion trajectories is presented. The study involved assessing
which EEG frequency components and cortical areas show the most significant
correlation with hand motion trajectory towards five targets positioned in different
locations in space. The time shift between time-power pattern of the selected EEG
frequency and related motion trajectory is also analyzed. The results show strong
correlation between EEG and kinetic data in low frequency (0.5-4Hz) range as well as
significant correlations in the 28-36Hz range. The results indicate that these EEG
frequency bands may best be used to develop an EEG-based brain-computer interface
(BCI) for the decoding of 3D hand motion trajectories.
1 Introduction
Motion trajectory prediction based BCIs aim to exploit the relationship or correlates between EEG
signals and limb motion to decode an imaginary limb movement in 3D space. To date only a limited
number of studies have investigated EEG and kinetic data associated with 3D limb movements [1],
[2]. Studies have shown that band-pass filtered low frequency EEG components around 2Hz convey a
lot of information to the decoding task. Although Antelis et al [3] have drawn attention to possible
misinterpretation of the results when using low frequency based motion prediction models, Paek et al.
[4] have recently demonstrated the feasibility of decoding finger kinematics from low frequency scalp
EEG signals. To date a comprehensive assessment of the spatial and spectral EEG correlates of real
and imagined 3D hand motion trajectory has not been conducted. This paper aims to address this with
an analysis of subjects undergoing a high resolution EEG recording whilst performing real and
imagined 3D hand movements towards five targets positioned in different location in space. Subbands in the 0.5-40Hz EEG spectrum which show the highest level of correlation with movement
trajectory are identified. The correlation strength of different cortical areas and different phase shifts
between EEG and kinetic records are also assessed to learn more about the relationship between EEG
signals and kinetic data.
Article ID 095-1
DOI:10.3217/978-3-85125-378-8-95
below and above the shoulder plane, forming 90 between the torso and the shoulder. Real movement
blocks to a particular target were followed by imagined movement blocks to the same target.
Participants could choose a home position that varies between subjects and blocks. Figure 1 illustrates
motion trajectories of Subject 1. The task cue was synchronized with an auditory signal. Movements
were followed by a rest phase. Both fast movements and slow movement blocks were interleaved
where the length of motion and rest phases was 800ms and 500ms for slow and fast movement, e.g.,
Block 1 movement H ! 1 !"#"!"$"!$"%&'()"*+,-.. The duration of the blocks was 48s, with an
approximate inter-block interval (IBI) of 30s. The number of registered blocks was 20 from which
only ten were considered in this study (slow and fast real movements between home and target
positions). The analysis of the imagined movement blocks will be the subject of future study. Datasets
containing parallel registered Electroencephalogram (EEG), Electromyogram (EMG) and kinetics
data were acquired from six healthy right handed male human subjects (age range 25-42 years). EEG
signals were registered in 62 channels + 1 electro-oculogram (EOG) at 1200 Hz. EMG was recorded
from the Biceps with a sample rate of 2000Hz. Kinetic data were recorded from the right dominant
hand, elbow and shoulder at 30 frame per seconds (FPS) using a 3D Microsoft Kinect camera system.
All datasets were acquired at the Hybrid BCI lab at Holon Institute of Technology (HIT), Israel.
3 Preprocessing
EEG and kinetic data were stored in a pointer based structure that enabled variable length of
epochs without re-slicing and reduced memory requirement. Baseline shift was removed and the EEG
was filtered by 0.5-40Hz, eight-order, band-pass Butterworth filter. The Fast Fourier Transformation
(FFT) was applied for calculating power value of EEG frequency components in 2Hz wide bands
(non-overlapped) between 0 and 40 Hz. We used 500ms width FFT windows and 33.3ms time lags
between two windows. This lag has been chosen for ease of synchronization with kinetic data sampled
at 30FPS. Figure 2 illustrates the setup of FFT window for slow tasks i.e., 800ms. Smoothing filter
has been applied on kinetics data for noise reduction as the band-pass filtering causes a distortion in
data at the beginning of motion. The filter calculated a mean value of five adjacent samples.
Time synchronization is crucial in this comparative analysis. We used the sample rate converted
EEG triggers for synchronizing the kinetic data with the EEG data where the first trigger was recorded
simultaneously for both data types. Task compliance validation was performed manually. Tasks were
considered valid only when converted EEG triggers matched the beginning of the motion. Subject 6
was discarded due to inadequate kinetic records.
4 Correlation Analysis
The correlation coefficient between a power pattern (belonging to one of the computed FFT
frequencies) and related kinetic data was calculated for each valid trial.
Article ID 095-2
DOI:10.3217/978-3-85125-378-8-95
The kinetic data consist of an x, y or z Cartesian vector component of hand coordinates in the 3D
space or joint angle at the elbow, hand and shoulder reference points. We analyzed the correlation
between the two descriptors during the movement hence the size of the correlation window matched
the time interval of the related task (slow tasks 800ms, fast tasks 500ms). Pearsons linear correlation
was chosen as we were interested in the correlation strength between two row vectors. The first row
vector contained EEG band power values which were gained from FFT at analyzed time lags, the
second one contained the kinetic data (see Figure 2). The corrcoef() Matlab function was used for
computation of correlation coefficients (R) and p values from Students t-test. The most significant R
values were used for further analysis (p<0.01). As we were only interested in the correlation strength
between the two descriptors, the correlation sign was overlooked.
We also aimed at detecting the time shift between EEG and kinetic record wherein correlation is
maximal. Therefore, we repeated the above calculations for different time shifts between EEG and
kinetic patterns. Figure 3 summarizes the distribution of correlation level between the EEG signal and
related Hand(z) trajectory, where the vertical axis represents the different EEG frequencies and the
horizontal axis represents the time shifts between EEG signal and kinetic record. (e.g., the -200ms
column contains correlation intensity for an EEG compared with kinetic data registered 200ms earlier
which is an already executed motion thus the +200ms column compares EEG and planned motion).
Colour code indicates strength of the correlation where red colour denotes higher correlation levels.
Figure 3: Correlation level distribution between EEG patterns and Hand(z) trajectories for different EEG
frequencies and time shifts between EEG signal and kinetic data. Colour code is an indicator of correlation level.
We have found the highest level of correlation around 2Hz in the low EEG frequency range.
Additionally, the 28-36Hz band also shows high correlation level although it is not as pronounced as
that in the 2Hz range. The distribution of correlation levels was similar for different kinetic
components (hand x, y, z and joint angle at the elbow). Analysis of different velocities (slow and fast)
and movement directions shows some diversity in the correlation level although the variance was not
significant. The correlation was maximal when EEG signals were compared with parallel registered
kinetic data whose results are illustrated at 0ms column in Figure 3.
Figure 4A: Correlation between power pattern of EEG bands and Hand(z) trajectory of subject 1-5 (slow tasks)
Article ID 095-3
DOI:10.3217/978-3-85125-378-8-95
Figure 4B: Comparison of the correlation levels of EEG patterns and kinetic components (0.5-4Hz, slow tasks)
Figure 4A and Figure 4B illustrates distribution of correlation levels in case of parallel recorded
EEG signals and kinetic data (time shift = 0) for different cortical areas. Dark blue (background)
colour indicates areas where we were unable to collect good quality synchronized EEG and kinetic
data in our pilot study for valid statistics. Figure 4A compares results from different subjects and
Figure 4B shows the difference when x, y or z vector component of hand trajectory or joint angle at
the elbow has been used as kinetic data in correlation. The similarity between Hand (z) and joint angle
at the elbow is the highest, compared to x or y. The result indicate the EEG associated with the z
direction vector component of the hand movement is more correlated with the joint angle than x or y.
The increased frontal and occipital correlation in case of 30-40Hz gamma band [5] highlight
importance of these cortical areas in motion trajectory prediction. Our results could not unequivocally
support this theory because there are insufficient data gained form some important cortical areas.
5 Conclusion
We have used a correlation analysis to examine the importance of different EEG frequencies,
cortical areas and time shift between EEG signals and kinetic data in motion trajectory prediction. We
identified strong correlation between EEG and kinetic data in the delta (0.5-4Hz) EEG range which is
consistent with repetitive finger movement study from Paek et al. [4]. Furthermore, the correlation in
28-36Hz band was also more pronounced at pre-movement and at movement onset than in the mu,
alpha and beta bands. Gamma band activity or 40 Hz EEG is strongly related to planning of a specific
movement [6]. We found that the correlation between EEG and kinetic data is maximal when both are
registered in parallel i.e., no lag between EEG and Kinetic data.
References
[1]
[2]
[3]
[4]
[5]
[6]
Article ID 095-4
DOI:10.3217/978-3-85125-378-8-96
Introduction
Previous work has investigated active BCIs [6] based on motor imagery in lab studies (see [1]
for example). As BCIs are intended to be applied in real world scenarios the question arises
whether the methodology developed in recent years is also reliably working in more complex
and less controlled environments. From a human factors perspective it is of interest whether
users can use a BCI while interacting with a technical system in a realistic setting [5]. In
the precursor experiment we are presenting here we addressed these questions by investigating
motor imagery-based BCI control in a flight simulator. Experienced pilots controlled a plane
in the horizontal axis while performing operational flight tasks.
Setup
Six pilots with dierent levels of experience were invited to participate in our experiment. In
each run the participants occupied the left seat in a flight simulator. The simulated outside
world view was projected on a cylindrical 180 screen round the fixed-base cockpit. The instruments provided to the subjects comprised classical (backup) instruments (airspeed indicator,
attitude indicator, altimeter and magnetic compass) as well as a research display, which showed
attitude, airspeed, altitude, vertical speed, flight path angle, heading, turn rate, a brain signal
feedback with a delay of 1 second and task specific elements. Two types of flight control were
investigated. Control type A enabled the pilot to change the rate of turn directly (direct mapping). Control type B provided a turn rate only if the BCI input exceeded a threshold, and
automatically returned to straight flight otherwise.
Article ID 096-1
3
3.1
DOI:10.3217/978-3-85125-378-8-96
Experiment
Calibration
Before online flight control each participant generated calibration data. Based on data from
these trials a BCI classifier was trained for each participant individually. Each trial had a
sequence of fixation, command and relax states. This sequence was communicated by objects
displayed in the center of the screen. Fixation was indicated by a cross displayed for 1 second.
It was then replaced by a letter randomly drawn from the set {L,R,F}, which was displayed
for 4 seconds. Participants were advised to consistently imagine a specific movement with their
left hand (L), their right hand (R) or their foot (F), respectively. Each calibration session had
120 trials (40 trials per class) in total, subdivided by breaks of 60 seconds after 40 trials.
3.2
Application
The resulting BCI was then applied to lateral control in simulator scenarios. Altitude and
throttle were controlled automatically by the simulator. As all pilots were familiar with the
controls and displays of the simulator, tasks were communicated through the displays. Either a
marker on the Horizontal Situation Indicator (HSI), indicating a specific angle to take (heading
bug), or the alignment of the main axis of the plane (lubber line) with a course select pointer
on the HSI (localizer) was used for this purpose. Participants were advised about the tasks by
the experimenter before the application experiment began. The application had three stages.
Participants flew all stages with both controls (as described in section 2). The order of control
types was permuted over participants.
Turns. The task was to follow steps of the heading bug on the HSI, i.e. to acquire and
hold a given heading. The pilot was advised to always choose the shortest turn to acquire the
next heading and, if possible, to turn with standard rate of turn (3 /s). The sequence of steps
has been selected to be random appearing with an equal number of left and right turns.
Tracking. In the tracking task, pilots were instructed to follow the heading bug on the
HSI. This time, however, the heading bug oscillates about the initial heading. The forcing
function of this tracking task was composed of 10 dierent sine waves, so that it was randomly
appearing.
Approach with oset localizer and visual landing. The task was to first intercept the
oset localizer, and then track it [3]. As there were no outside visual references in this stage, it
was not apparent to the pilot that the localizer was oset. When the aircraft descended below
500ft above ground level, the runway became visible. The pilot now ignored his navigation
instruments and continued the approach only by outside visual references. Since the first part
of the approach was oset, the pilot was forced to conduct a sidestep maneuver. After that, he
tracked the runway centerline. The simulation ended just before touchdown.
Analyses
BCILAB [2] was used to calibrate classifiers for all pairwise comparisons of classes. Features
were extracted with standard parameters for Common Spatial Patterns [4] on time windows
between 1 and 3 seconds after task onset. The classifier with best cross-validation estimates
was then used in online application. Flight performance was tracked by diagrams indicating
the deviation of the optimal target heading.
Article ID 096-2
DOI:10.3217/978-3-85125-378-8-96
Figure 1: Examplary flight diagram for control type A (left) and control type B (right) during
turns. The blue line indicates the optimal target heading, the green line the actual flown target
heading. X-Axis is in seconds, Y axis is heading in degree.
Figure 2: Common Spatial Patterns selected for Participant 6. Discriminating between classes
left hand (upper row) and foot (lower row). Pattern 1 focuses on the right motor cortex,
while pattern 2 aims at the left motor cortex inversely weighted and pattern clearly weights the
central motor cortex.
Results
Table 1 shows cross-validation estimates of the chosen classifiers (left column), while figure 2
shows the chosen CSPs for the best performing participant (no. 6). Online flight performance is
shown exemplarily for participant no. 6 in the turns condition for control type A and control
type B (figure 1). Participants were asked whether they felt having control. Participants with
high classification accuracies stated to have control while participants with low accuracies did
not (see table 1, right column).
Article ID 096-3
Participant
Participant
Participant
Participant
Participant
Participant
No.
No.
No.
No.
No.
No.
1
2
3
4
5
6
CV
acc.
98
58
51
64
89
95
DOI:10.3217/978-3-85125-378-8-96
Participant reported
having control?
yes
no
no
no
yes
yes
Table 1: Best classification accuracy from pairwise comparisons of the classes via cross validation.
The results of this precursor study indicate that a one-dimensional control via active BCIs is
applicable in real world applications. Interestingly the group of participants was divided in two
subgroups. One group had almost perfect control while the other had no control at all. In
future studies we will investigate whether this is a stable eect and whether the participants
show a similar result in standardized lab-studies. As the aircrafts flight dynamics have an
impact on control as well, we will also investigate further improvements of the flight controller.
References
[1] Benjamin Blankertz, Ryota Tomioka, Steven Lemm, Motoaki Kawanabe, and K-R Muller. Optimizing spatial filters for robust eeg single-trial analysis. Signal Processing Magazine, IEEE, 25(1):4156,
2008.
[2] Christian A. Kothe and Scott Makeig. Bcilab: a platform for braincomputer interface development.
Journal of neural engineering, 10(5):056014, 2013.
[3] Herman Albert Mooij. Criteria for low-speed longitudinal handling qualities of transport aircraft
with closed-loop flight control systems. 1984.
[4] Herbert Ramoser, Johannes Muller-Gerking, and Gert Pfurtscheller. Optimal spatial filtering of
single trial eeg during imagined hand movement. Rehabilitation Engineering, IEEE Transactions
on, 8(4):441446, 2000.
[5] Thorsten O. Zander. Utilizing Brain-Computer Interfaces for Human-Machine Systems. PhD thesis,
Universit
atsbibliothek der Technischen Universit
at Berlin, 2011.
[6] Thorsten O Zander and Christian Kothe. Towards passive braincomputer interfaces: applying
braincomputer interface technology to humanmachine systems in general. Journal of Neural
Engineering, 8(2):025005, 2011.
Article ID 096-4
DOI:10.3217/978-3-85125-378-8-97
Introduction
Visual evoked response (VEP) is an event related potential that occurs involuntarily in response
to a visual stimulus. It has been shown that the defocusing of the retinal image has a greater
eect on the latency of this potential [8]. An object is defocused when it is located out of the
eyes Depth of Field (DOF), that is defined as the range of distance in which an object appears
sharp at the retinal image [6]. The focusing of a target is performed by an accommodation
mechanism that is achieved when a neural signal is sent to the ciliary muscle changing the
shape of the crystalline lens. It modifies the angle of refraction minimizing automatically
the blurriness in the retinal image [3]. Then, the performance of brain-computer interfaces
(BCI) based on steady-state visual evoked potentials (SSVEP) could be aected by this optical
phenomenon, specifically when the flickering stimuli go out of focus of the subjects eye. As the
amplitude of the VEP pattern can be reduced as the retinal image is increasingly defocused,
an arising hypothesis is: if a BCI user is gazing simultaneously two stimuli flickering with
dierent frequencies and located at dierent distances (enough to get only one into the subjects
DOF), the focused stimulus will be able to elicit distinguishable SSVEP pattern regardless the
non-focused stimulus is also present. Due to the user choose the target stimulus by shifting
the focus instead of gaze movements or attention, this work becomes an alternative method
for presenting SSVEP stimuli either to traditional SSVEP paradigm or covert attention based
SSVEP paradigms. Results showed that clear SSVEP pattern can be elicited when the subjects
are asked to focus either the nearest or the farthest stimulus.
Methods
Three healthy subject without any experience with BCI were considered in this work. The
experiments were undertaken with the understanding and written consent of them. This study
Article ID 097-1
Nearest
stimulus
Top view
DOI:10.3217/978-3-85125-378-8-97
Farthest
stimulus
Side view
2 cm
Eyes reference
28 cm
Zn
Zf
49 cm
Figure 1: (a) Schematic diagram of the experimental setup; (b) photography of the two LED
arrangements when nearest and farthest stimuli were focused on; and (c) experimental setup.
was approved by the research ethics committee of the Federal University of Espirito Santo
(Brazil). Two stimuli, emitted by two 5 7 LED arrangements flickering at 5.6 Hz and 6.4
Hz located at dierent distances were displayed simultaneously. As shown in Figure 1(a), the
nearest (6.4 Hz) and farthest (5.6 Hz) arrangements were placed at 28 and 47 cm far from
the subject, respectively. In this positions non-focused objects appear considerably blurred [9].
Figure 1(b) shows the simulated appearance of the two stimuli in the subjects field of view
when either the nearest or the farthest is focused. Two 10-trial experiments were performed in
which the subjects were asked to focus on the nearest stimulus in the first one and focus on the
farthest stimulus in the second one. During each 22-seconds trial, the subjects seated in front of
a experimental box (see Figure 1(c)) and were asked to focus on the target stimulus for 17 s, then
was asked to close the eyes for 5 s. The electroencephalographic (EEG) signals were recorded
between 5 s and 17 s. EEG signals from 19 electrodes positioned according to the international
10-20 system were acquired at a sampling rate of 200 Hz using the BNT36 device with a cap of
integrated wet electrodes. The grounding electrode was positioned on the user forehead and biauricular reference was adopted. Signals from electrodes O1 and O2 filtered using an (4 - 50 Hz)
elliptic band-pass were used to verify the SSVEP patterns. Other channels were used to perform
common average reference spatial filtering. The traditional Power Spectral Density Analysis
(PSDA) method and the Canonical Correlation Analysis (CCA) method [2] were employed to
compute the classification accuracy. Due to the gaze focus may occur without intention, to
evaluate the robustness of the assessment and the probability of false positive errors, accuracy,
Kappa coefficient, sensitivity, specificity and information transfer rate (ITR) were computed.
Results
Figure 2 shows the normalized power spectrum of the SSVEP pattern at electrode O2 of the
subject 1 computed using the Fourier transformation. To emphasize the elicited SSVEP peaks,
oset component was removed by subtracting the mean value of each frequency. Figures 2(a)
and 2(b) show the spectral responses of all trials (gray curves) together with the average (black
curve) computed over the 12 s of the EEG signals of the entire trial when the subject was
asked to focus on the 6.4 Hz and 5.6 Hz stimulus, respectively. In both cases SSVEP peaks
were elicited at stimulus frequency, at the second (12.8 Hz and 11.2 Hz) and at third harmonic
frequencies (19.2 Hz and 16.8 Hz). The length of the analysis window is an important aspect
to be considered when assuming that the background noise is a random aditive signal and
the potential is deterministic. Figures 2(c) and 2(d) show the normalized amplitude spectra
Article ID 097-2
DOI:10.3217/978-3-85125-378-8-97
Figure 2: (a) and (b) Normalized amplitude spectra corresponding to 10 trials (gray curves) together with the average (black curves). (c) and (d) Normalized amplitude spectra corresponding
to dierent data lengths for one subject.
computed using dierent time intervals (1, 2 , 4 and 8 s). The signal/noise ratio can be improved
by increasing data to estimate the spectra, since the energy of the deterministic signal increases
quadratically with the increasing of the signal window length. Table 1 shows the parameters
that were considered to evaluate the robustness of the assessment for three subjects. Values
in parentheses correspond to the PSDA method applied to peaks detection. Values without
parentheses correspond to the CCA method. In all cases, the performance of the CCA method
was better than PSDA method.
Table 1: Evaluation parameters when the PDSA method (in parentheses) and the CCA method
(without parentheses) were employed to detect SSVEP peaks.
Subject
Subject 1
Subject 2
Subject 3
Average
Sensititivy
0,78
0,90
0,78
0,82
(0,70)
(0,58)
(0,70)
(0,66)
Specificity
Accuracy
1,00
1,00
1,00
1,00
0,85
0,95
0,85
0,88
(0,68)
(0,87)
(0,68)
(0,75)
(0,69)
(0,73)
(0,69)
(0,70)
Kappa
0,69
0,89
0,69
0,76
(0,37)
(0,45)
(0,37)
(0,40)
ITR
3,37
6,96
3,37
4,57
(1,49)
(2,24)
(1,49)
(1,74)
Conclusion
The results here presented conclude that if two flickering stimuli are appropriately located,
then, the focused stimulus is able to elicit distinguishable SSVEP pattern, regardless the nonfocused stimulus also present in the users field of view. Based on the results showed in Figure
2 and Table 1, which are very promissory, a SSVEP-BCI based on DOF can be developed.
Although, the proposed SSVEP-BCI based on DOF uses overt orienting, it is independent of
Article ID 097-3
DOI:10.3217/978-3-85125-378-8-97
gaze movements, because the focusing of a target is performed by an accommodation mechanism that, like a pupil contraction, is an ocular reflex response. Hence, this work becomes
an alternative method for presenting SSVEP stimuli to the traditional SSVEP-BCI paradigm.
The existing traditional SSVEP-BCIs are becoming robust systems and achieving high transfer
rates [4]. However, many designs require reliable control of eye movements because the subject performs the selection by gazing directly at each stimulus location. On the other hand,
this work also becomes a complementary method to the attention based SSVEP paradigm.
SSVEP-BCIs named independent-BCIs, like BCIs based on spatial-visual selective attention [5]
or non-spatial selective attention [10] use selectively covert attention to make the selection of
the stimulus instead of muscular gaze shifting [1]. Nevertheless, covert orienting demands high
attention, and it is associated with difficult tasks that might not become automatic, even with
high levels of practice, and requires intention [7]; while overt orienting demands low attention
and is associated with easy and/or well-practiced tasks.
ACKNOWLEDGMENT
The authors would like to thank CAPES/Brazil and FAPES/CNPq (Process 53666038/2011)
for the financial support. Also, the authors would like to acknowledge the Ophthalmologist
Diogo Sperandio for their advices.
References
[1] Brendan Z. Allison, Dennis J. McFarland, Gerwin Schalk, Shi Dong Zheng, Melody Moore Jackson,
and Jonathan R. Wolpaw. Towards an independent brain-computer interface using steady state
visual evoked potentials. Clinical Neurophysiology, 119(2):399 408, 2008.
[2] Guangyu Bin, Xiaorong Gao, Zheng Yan, Bo Hong, and Shangkai Gao. An online multi-channel
ssvep-based braincomputer interface using a canonical correlation analysis method. Journal of
Neural Engineering, 6(4):046002, 2009.
[3] Sheldon M. Ebenholtz. Oculomotor Systems and Perception. Cambridge University Press, 1
edition, 2001.
[4] Christoph Guger, Brendan Allison, Bernhard Grosswindhager, Robert Pruckl, Christoph Hintermuller, Christoph Kapeller, Markus Bruckner, Gunther Krausz, and Gunter Edlinger. How many
people could use an SSVEP BCI? Frontiers in neuroscience, 6, 2012.
[5] S.P. Kelly, E.C. Lalor, C. Finucane, G. McDarby, and R.B. Reilly. Visual spatial attention control
in an independent brain-computer interface. Biomedical Engineering, IEEE Transactions on,
52(9):15881596, Sept 2005.
[6] Alex Paul Pentland. A new sense for depth of field. Pattern Analysis and Machine Intelligence,
IEEE Transactions on, PAMI-9(4):523531, July 1987.
[7] Michael I. Posner and Steven E. Petersen. The attention system of the human brain. Annual
Review of Neuroscience, 13(1):2542, 1990. PMID: 2183676.
[8] Samuel Sokol and Anne Moskowitz. Eect of retinal blur on the peak latency of the pattern evoked
potential. Vision Research, 21(8):1279 1286, 1981.
[9] Dhanraj Vishwanath and Erik Blaser. Retinal blur and the perception of egocentric distance.
Journal of Vision, 10(10), 2010.
[10] Dan Zhang, Xiaorong Gao, Shangkai Gao, A.K. Engel, and A. Maye. An independent braincomputer interface based on covert shifts of non-spatial visual attention. In EMBC 2009. Annual
International Conference of the IEEE, pages 539542, Sept 2009.
Article ID 097-4
DOI:10.3217/978-3-85125-378-8-98
Abstract
This work presents an alternative method to detect events correlated to eyes opening
and closing, based on electroencephalography (EEG) measured from the occipital lobe.
The goal is to propose a method based on variance and variability analysis of alpha wave
to classify segments of EEG signals that contain activities originated by eyes closing. The
peak values of alpha waves are determined to obtaining the variability series, which are
used to compute the power spectrum through Welch periodogram. The combination of
alpha wave variance and their variability, were used to obtain a coefficient that allows
discriminating between events of eyes opening and closing. This approach can be used to
control the switching of a brain computer interface (BCI).
1 Introduction
Several technologies have been developed to assist and improve the communication of people with
paralysis and severe motor disability. BCI is a communication system that does not depend on the brain's
normal output pathways of peripheral nerves and muscles. At UFES/Brazil we are developing a BCI
system for an autonomous car based on evoked visual stimulus (Castillo et al 2013), which can cause
visual fatigue. A good alternative is switching the BCI through a user command, that can be performed
by eyes closing. This way, electroencephalography signals (EEG) have been employed, which contain
information that allows the eyes closure detection. Eyes opening and eyes closing activities can be
sensed on the occipital lobe through alpha wave analysis in a frequency range of 8 Hz to 13 Hz. A high
energy of the alpha wave corresponds to closed eyes on awake subjects (in 90% of healthy and people
with disabilities) (Alaraj and Fukami 2013). Alpha waves have been applied to operate electrical
devices, however, the automatic recognition associated to eyes opened (EO) and closed (EC) is not a
trivial task, because the bandwidth of alpha wave is affected by natural variation and electrical noise,
and muscle artefacts. Several methods have been developed to automatic detection of alpha wave, such
as: analogue filtering and smoothing (AFS), peak detection and counting, power spectrum analysis,
fractal dimension, KM2O-Langevin and approximated entropy (Kirkup et al 1998, Craig et al 2005,
Sakai et al 2010, Alaraj and Fukami 2013). All aforementioned methods use a threshold value as a
reference that depends on each subject and experiment conditions. The aim of this work is to propose
one automatic method based on the variability information of alpha wave of EEG for recognition of
eyes closing events in awake subjects, in order to activate a BCI.
Article ID 098-1
DOI:10.3217/978-3-85125-378-8-98
EEG signal
windows=4s
Power spectrum
Pre-processing
Activity
estimator
Computation
T of peak
Adaptive
amplitude
threshold
output
Peak detection
Computation
V/t of peak
Figure 1: Representation of the proposed method to detect eyes opening or eyes closing.
The electroencephalographic (EEG) signals are captured and analyzed in sliding windows of 4s,
shifted for 1s. First, EEG signals are pre-processed by a Common Average Reference (CAR) filter and
a bandpass elliptic filter (bandwidth: 8-13 Hz, 5nd order), to reduce the common interference and select
the principal information of the alpha wave, respectively. Second, the variance and first derivative of
alpha wave are computed, and adaptive amplitude thresholds are calculated on the first derivative signal
to detect local peak values. The thresholds are computed on windows of 78 ms with overlapping 90%
using the equations (1) and (2). The local peak values higher than a threshold are taken as reference to
detect the peak values of the alpha wave, which are employed to obtain the temporal series of the
variability. After, these peak values are used to compute the transition velocity (V/t) and power
spectrum (by Welch periodogram, overlapping of 50%, and Blackman window) to determine the peak
and total power of the variability. Finally, the variance and variability information of the alpha wave
are employed to compute a coefficient to detect the events (eyes opening or eyes closing). This
coefficient can be calculated by equation (3).
(1)
(2)
,
where RMS is the root mean square of the window of 4 s, y is the first derivative of alpha wave, N is the
total samples, is the standard deviation of the first derivative, AT is the amplitude threshold, 0 < k <W1, and W is the total windows.
,
(3)
where Pmax and Ptotal are the peak and total value of power spectrum of the variability series, fmax is the
frequency that correspond to Pmax, 2 and Vt are the variance and transition velocity between peak values
(from minimum to maximum value) of the alpha wave, respectively.
Article ID 098-2
DOI:10.3217/978-3-85125-378-8-98
were acquired from Emotiv EPOC neuroheadset using fourteen electrodes on the scalp, placed according
to the international 10/20 system: AF3, AF4, F3, F4, F7, F8, FC5, FC6, P7, P8, T7, T8, O1, O2. However,
only the data from occipital alpha waves (8-13Hz) on O1 and O2 sites were used. The occipital locations
O1 and O2 were chosen for two reasons: (1) alpha activity is larger in the occipital region as it is directly
linked to the visual perception; and (2) there is less artefact in this region, such as ocular muscle activity,
compared to the frontal scalp regions. EEG channels were sampled with 128 Hz at 1.95V, the least
significant bit voltage resolution.
Eyes
opening
Eyes
closing
p-value
d effect
Transition
velocity (u.a/s)
161.07 73.42
114.81 40.38
257.62 64.08
109.94 50.17
127.37 38.56
172.23 51.75
133.76 41.35
60.65 17.59
536.56 171.53
453.54 145.30
539.61 190.13
364.95 90.74
186.69 64.81
269.64 106.80
276.07 78.28
114.24 37.43
0.0078
1.68
Variance
14.72 17.95
5.72 6.41
26.59 13.31
5.83 6.96
6.38 7.85
11.73 7.98
8.73 4.55
1.63 1.46
101.33 52.16
70.63 33.87
110.66 66.42
46.94 19.14
12.98 9.59
27.76 22.58
32.15 15.20
4.93 2.99
0.0078
1.43
Total power
(s2)
20.66 8.40
16.36 8.71
16.07 7.29
17.28 10.63
17.45 6.93
17.59 12.69
19.62 8.80
21.13 8.39
4.50 2.31
4.80 3.16
6.26 3.88
3.98 2.69
9.40 5.15
8.25 5.01
9.92 4.91
8.49 4.53
0.0078
5.25
Peak power
(PP) (s2)
305.64 117.77
237.05 110.6
258.40 116.4
261.05 121.84
265.23 108.82
259.76 128.14
302.89 128.07
316.54 141.63
64.23 30.72
71.42 45.04
88.48 51.88
60.52 41.72
129.27 60.59
117.60 75.87
141.21 68.00
116.40 62.74
0.0078
5.91
Frequency
of PP (Hz)
1.59 1.25
1.59 1.20
1.64 1.08
1.29 0.98
1.81 1.14
1.64 1.18
1.75 1.25
1.68 0.98
1.37 1.09
1.35 0.96
1.55 0.92
1.71 1.08
1.64 1.38
1.55 1.14
1.68 1.10
1.73 1.16
0.2343
0.34
Table 1: Performance of the variables (mean standard deviation) for the proposed method to detect eyes
opening and eyes closing.
Wilcoxon Signed-Ranks test for paired samples and effect size methods were employed to evaluate
the performance of the proposed method (Haidous et al 2013). For each event of eyes (eyes opening or
eyes closing), it can be observed that there are statistical significance (p-value<0.05) and high effect
size (d effect>0.8) in the variables that contain information of the variability (transition velocity,
variance, total power, peak power), as well as the variance. It is worth to connect that the last subject
presented a low difference in the variance. Figure 2 shows the coefficient computed throughout the
signal. In these figures it can be seen that artefacts present during segments of open eyes do not affect
the coefficient value.
Article ID 098-3
DOI:10.3217/978-3-85125-378-8-98
Figure 2: Representation of the coefficient computed to detect eyes opening or eyes closing.
4 Conclusion
The preliminary results suggest that the combination of the variance and information variability can
be used to detect events of eyes opening and eyes closing through EEG signals measured on the occipital
lobe. Several works have reported that power spectral analysis of alpha wave is not a good option to
detect events of eyes closing and eyes opening. Thus, this work presents the possibility of using the
power spectral technique to analyze the variability information of the alpha wave. As future works, the
proposed method should be improved to detect the peak values detection on the alpha wave. This
approach here proposed can be also used to control the switching of BCI systems.
References
Kirkup L, Searle A, Craig A, McIsaac P and Moses P. (1997) EEG based system for rapid on-of
switching without prior learning, Med. Biol. Eng. Comput., pp. 5049.
Kirkup L, Searle A, Craig A, McIsaac P and Larsen G. (1998). Three methods compared for
detecting the onset of alpha wave synchronization following eye closure, Physiol. Meas., vol. 19,
213224.
Craig A, Tran Y, Craig D and Thuraisingham R. (2005). Improving correct switching rates in a
hands free environmental control system, Journal of Neural Engineering, pp. L9-L13.
Sakai M, Wei D, Kong W, Dai G, and Hu H. (2010). Detection of change in alpha wave following
eyes closure based on KM2O-Langevin equation, International Journal of Bioelectromagnetism,
12(2), 8993.
Alaraj M and Fukami T (2013) Quantification of subject wakefulness state during routine EEG
examination, International Journal of Innovative Computing, Information and Control, 9 (8), 3211
3223.
Castillo J, Muller S, Caicedo E, Ferreira A and Bastos T (2014) Proposal of a Brain Computer
Interface to Command an Autonomous Car, In: 5th IEEE Biosignals and Biorobotics Conference
(BRC 2014), Salvador. Proc. of the 5th IEEE Biosignals and Biorobotics Conference, pp. 6.
Haidous, N (2013) Robustness and Power of the Kornbrot Rank Difference, Signed Ranks, and
Dependent Samples T-test. American Journal of Applied Mathematics and Statistics 1(5), 99102
Article ID 098-4
DOI:10.3217/978-3-85125-378-8-99
1 Introduction
The term Brain-Computer Interface or BCI has been in use since the 1970s (Vidal, 1973).
Numerous definitions of the term have been presented in the published literature and elsewhere. In
addition, many other terms have been used to describe similar systems, such as Direct Brain
Interface, Brain-Machine Interface, and Brain Interface. As BCI research gains attention in
academic, commercial, medical, and other sectors, it seems increasingly important that we agree what
a BCI is. A standardized definition is an obvious prerequisite for a mature field, and should not be
especially daunting for a field dominated by scientists, engineers, and clinicians.
Other BCI researchers have agreed that a standardized definition is important. At the Fourth
International BCI Meeting at the Asilomar Conference Grounds in Pacific Grove, California in 2010,
over 65% of survey respondents felt that a standard definition of a BCI is needed within two years,
and 79% felt that one is needed within five years (Nijboer et al., 2011). At the Fifth International BCI
Meeting in the same location in June 2013 (organized by the Program Organization Committee for the
Fifth International BCI Meeting), attendees unanimously voted to establish a BCI Society. This
Society has actively sought to explore a BCI definition through online discussions, workshops at the
BNCI Horizon 2020 retreat in Hallstadt, Austria in March 2014 (organized by the Graz University of
Technology through the BNCI Horizon 2020 project), and other mechanisms. EU-funded projects,
including two Coordination and Support actions called Future BNCI (2010-2011) and BNCI Horizon
2020 (2013-2015) have also been charged with developing, publicizing, and entrenching a standard
BCI definition. This paper presents some results from a survey we conducted at the Fifth International
BCI Meeting focused on two terminological issues the definitions of BCI and BCI Illiteracy.
Article ID 099-1
DOI:10.3217/978-3-85125-378-8-99
# of respondents
% of respondents
80
61.07
76
58.02
76
58.02
43
32.82
101
77.10
These results reflect strong disagreement over the critical features of a BCI. These results echo the
2010 survey, which also found that respondents had widely divergent views. Other efforts to
summarize different views of BCI components have also been presented (See Fig. 1). This figure was
developed through discussion among the team that proposed and later executed the Future BNCI
project, and reflects the efforts of several BCI experts to define critical features to foster discussion.
This figure is consistent with the newer results presented above. Most respondents felt that a BCI
must directly detect brain activity, but generally disagreed with the intentional component below.
Figure 1: The four components of a BCI, as defined in the proposal for the Future BNCI
project, submitted in 2009. The project was active from Jan 2010 to Dec 2012 and extensively
discussed the definition of a BCI and how to encourage an official definition within the field.
Article ID 099-2
DOI:10.3217/978-3-85125-378-8-99
3 BCI Illiteracy
One of the most consistent challenges in BCI research has been dubbed BCI Illiteracy (Kbler
and Mller, 2007). That is, a minority of users cannot use any particular BCI system, and may be as
high as 20% (among healthy users) with some BCI approaches (Allison and Neuper, 2010). This
problem has persisted since the earliest days of BCI research, a across a wide variety of different types
of sensors, mental activities and corresponding changes in brain activity, signal processing
parameters, and other BCI feature. This problem can be especially daunting for home use with less
sophisticated systems. Several publications have argued that reliability is one of the most serious
problems in the BCI community (e.g., Huggins et al., 2011, Wolpaw and Wolpaw, 2012; Allison et
al., 2013).
At many BCI conferences and other discussion forums, many people have expressed concern
about the term BCI Illiteracy, arguing that it is vague or implies that the problem should be blamed
on the user. We asked: Sometimes, a BCI system does not work for a particular user. Different terms
have been used in the literature to describe this problem. What term do you prefer? Of the 107
responses, 32 chose BCI Illiteracy, 33 chose BCI Inefficiency, 13 chose BCI apraxia and 29
chose Poor BCI Proficiency. These results indicate considerable disagreement over the best term to
describe this phenomenon. Many respondents commented on this question, emphasizing that (as
several people replied), It is not the users fault! We authors strongly agree and hope we can agree
on a term that is descriptive and avoids blaming the user.
Article ID 099-3
DOI:10.3217/978-3-85125-378-8-99
each example is a BCI. Such efforts might encourage more focused surveys, discussions in person and
online, and other efforts to find consensus. The nascent BCI Society has also been discussing the BCI
definition, and may consider actions such as forming a panel with an appropriate range of different
people to develop official terms and definitions for the Society. We hope to work with our colleagues
to reach consensus acceptable to most (if not all) of us in the BCI community.
References
Allison BZ, Neuper C. (2010). Could anyone use a BCI? In: Tan DS, Nijholt A, editors. Applying
our minds to humancomputer interaction. Springer, p. 3554.
Allison BZ, Dunne S, Leeb R, Millan J, and Nijholt A. (2013). Towards Practical BrainComputer Interfaces: Bridging the Gap from Research to Real-World Applications. Springer
Verlag, Berlin Heidelberg.
Huggins J, Wren P, Gruis K. (2011) What would brain-computer interface users want? Opinions
and priorities of potential users with amyotrophic lateral sclerosis. Amyotrophic Lateral Sclerosis
(0):18.
Kbler A, Mller K-R (2007). Toward brain-computer interfacing. In: An Introduction to BrainComputer Interfacing. MIT Press, Boston, pp 125.
Nijboer F, Clausen J, Allison BZ, and Haselager P. (2011). The Asilomar survey: researchers
opinions on ethical issues related to brain-computer interfacing. Neuroethics, 1-38.
Ramsey, N., Allison, B.Z., Donchin, E., Sellers, E., Huggins, J., Milln, J Del R., Guger, C., Gao,
S., and Wolpaw, J. (2014). The Emerging Brain-Computer Interface Society. Poster accepted for
presentation at the Sixth International BCI Conference, Graz, Austria, Sep. 2014.
Vidal JJ. (1973). Towards direct braincomputer communication. Annu Rev Biophys Bioeng
2:157180.
Wolpaw JR, Wolpaw EW. (2012). Brain Computer Interfaces Principles and Practice. Oxford:
Oxford University Press.
Article ID 099-4
DOI:10.3217/978-3-85125-378-8-100
brownke@etsu.edu, ryand1@golmail.etsu.edu,
sellers@etsu.edu
Abstract
BCI technology can provide a non-muscular method of communication. In the
P300-based BCI, users focus their attention on a specific item within an array of
flashing items. After the BCI selects an item, it is presented on a computer monitor as
feedback to the subject. In the BCI literature, only a few studies have examined
motivation. The current study examined the effect of an intrinsic motivation induction
on BCI performance in a non-disabled population. Participants were randomly sorted
into two groups. Before participating in the experiment, one group (i.e., Motivation)
was read a vignette describing the significance of BCI research for individuals with
amyotrophic lateral sclerosis (ALS); the other group (i.e., non-Motivation) was not. The
Motivation group performed at 89.7% accuracy in a copy-spelling task and the nonMotivation performed at 80.6% accuracy. Further, data showed that BCI use causes a
mild increase in sleepiness. Future studies should look into the role of motivation and
sleepiness on BCI performance within the ALS population.
Introduction
A brain-computer interface (BCI) using the P300 event-related potential offers an alternative
method of communication. Originally introduced by Farwell and Donchin (1988), users are shown a
matrix that randomly flashes alphanumeric characters. The participants task is to focus attention on
the item they wish to select. When the desired item flashes, a discriminable ERP is produced. Several
paradigm and stimulus presentation modifications have improved speed and accuracy of the P300 BCI
(e.g., Sellers, Arbel, and Donchin, 2013). However, few studies have examined psychological and
situational factors that may affect BCI use. The current study examines the construct of motivation.
Motivation as a psychological construct is commonly divided into intrinsic and extrinsic
categories. Intrinsic motivation implies that an individual pursues a goal for the enjoyment of doing
so, while extrinsically motivated individuals pursue a goal in order to receive some sort of outside
compensation for completing a task (Kruglanski et al, 1975; Gillet et al, 2011; Reiss; 2012).
Goldstein et al. (2006) used a monetary reward to show that P300 amplitude increase was
positively correlated with the amount of money that could be earned. Kleih et al (2010) reported
similar findings using a BCI task; P300 amplitude was higher for participants that reported higher a
higher level of motivation. Monetary rewards can be problematic because they provide an external
reward separate from simply succeeding at the task. Moreover, payment has been shown to decrease
intrinsic motivation (Anderson et al, 1976).
In a clinical population, extrinsic motivation may not be as relevant. It is reasonable to propose
that clinical use of the BCI in certain populations may be more intrinsic in nature. Kleih and Kluber
Article ID 100-1
DOI:10.3217/978-3-85125-378-8-100
(2013) examined the effects of intrinsic motivation using verbal and visual prompts to explain the
importance of BCI research for a clinical population. They found a significant effect of grouping,
participants in the motivation condition reported higher levels of motivation; however, the groups did
not differ in copy spelling accuracy.
In the current study, we focused on intrinsic motivation. It is likely that individuals who need or
desire assistive communication will be more intrinsically motivated. Thus, we focused on task-based
motivation, operationally defined as willingness to participate in a task (Appel & Gilabert, 2002). In
order to manipulate task-based motivation, we designed an information vignette in order to elicit a
reaction from the participants that may increase their intrinsic desire to perform well. This is novel in
that it does not provide extrinsic motivation or mention a performance standard; rather, the
manipulation is in the form of a personal choice. Our hypothesis is that copy-spelling performance
will be higher in the Motivation group (Mot) than in the non-Motivational (nMot) vignette group.
Further, we hypothesize that those in the Mot group would have higher P300 amplitude than those in
the nMot group.
Methods
Thirty-six non-disabled students (11 male, 25
female) from East Tennessee State University
participated in the study. Students were enrolled in
psychology courses and received extra course credit
for participating. The study was approved by the East
Tennessee State University Institutional Review
Board.
EEG was recorded with a 32-channel tin
electrode cap (Electro-Cap International, Inc.). All
Figure 1. Pre and Post Stanford Sleepiness
channels were referenced to the right mastoid and
Scale means by group (range 1 7). Significant
grounded to the left mastoid. Impedance was reduced
differences were observed between pre and post
to below 10.0 kOhm before recording. Two Guger
sleepiness scores. The Motivation group
Technologies g.USBamps were used to record EEG
reported being less sleepy before the task than
data, which were digitized at 256 Hz, and bandpass
the Non-motivation group. Error bars represent
filtered from 0.5 to 30 Hz. Stepwise linear
Std. Error of teh Mean.
discriminant analysis was used to classify ERP
responses. Only electrodes Fz, Cz, P3, Pz, P4, PO7, PO8, and Oz were used for online BCI operation
(Krusienski et al., 2006).
Upon arrival, students were randomly assigned to one of two categories: Mot (n=18) or nMot
(n=18). The Mot participants were told that it was essential that their full attention and effort could be
applied to the task. They were given the option of leaving without penalty (i.e., they would be
awarded full extra credit if they felt as though they could not perform at top capacity). It was reasoned
that those who agreed to participate even after being informed that they could leave without penalty
would be more motivated to perform the task than those performing the task without prompting. The
Mot participants were then read a vignette describing the importance of BCI research for severely
disabled people who may eventually depend on BCI technology. The nMot group was not given the
opportunity to leave upon arrival nor where they read the vignette.
Prior to completing a copy-spelling BCI task, all participants completed the Online Motivation
Questionnaire (OMQ) (Boekaerts, 2002) and the Stanford Sleepiness Scale (SSS; Hoddes et al, 1973).
The OMQ measures task-based motivation in a learning environment. The BCI task was novel to all
of the participants; thus, we assumed it would be a good measure of motivation. The SSS was selected
Article ID 100-2
DOI:10.3217/978-3-85125-378-8-100
because BCI use may increase sleepiness, and there may be an interaction between motivation and
sleepiness. After the participants completed the OMQ and the SSS, they were provided experimental
instructions and completed the BCI task. Each participant copy spelled three randomly selected sixletter words. These data were used for calibration (SWLDA was used to derive the classification
coefficients). After calibration, they copy spelled four randomly selected six-letter words and were
provided with online feedback as to the accuracy of the BCI classification. When the BCI task was
complete, each participant completed the SSS for a second time (i.e., post task).
Results
A 2x2 mixed model ANOVA (group (Mot vs. nMot) x SSS (pre vs. post score)) was used to
examine whether sleepiness scores varied across group or time. Only the main effect of SSS score was
significant (F(1, 33)= 7.25, p=.004; see Figure 1), indicating that both groups reported being more
sleepy after the BCI
session.
However,
mean scores were
generally high; mean
ratings were awake
but relaxed after the
session. Independent
samples t-tests were
used
to
examine
motivation
and
accuracy. Motivation
was
statistically
Figure 2. (A) P300 waveforms for the Motivation (red) and Non-Motivation
similar for the nMot
(blue) conditions. The motivation condition resulted in higher early positivity and
and Mot groups (t<1,
late negativity in the motivation condition. (B) Topography for the P300 for
ns.) Accuracy was also
Motivation and Non-motivation conditions. The higher amplitude in the
motivation waveform is observed for the positive and negative peaks. The data
statistically
similar
are averaged over the positive window of 145 - 182 ms (Figure 2A red box) and
across the nMot and
the negative window of 359 - 547 (Figure 2A blue box).
Mot groups (t(27;
adjusted)=1.84,
p=.072); however, accuracy in the Mot condition (Mean=89.7, SD=11.2) was higher than the nMot
condition (Mean=80.6 SD=17.2).
The waveforms for the Mot condition showed higher early positivity and later negativity than the
nMot condition. Although not statistically significantly different, the differences were in the
hypothesized direction. Figure 2A shows Pz waveforms for the Motivation and non-Motivation
groups. Figure 1B shows topographies that correspond to the shades boxes in panel 2A.
Discussion
This study examined the effect of an intrinsically based motivation induction on BCI performance
and waveform morphology. The study revealed several noteworthy findings. Most importantly, the
group that was exposed to the motivation induction was 89.7% accurate, whereas the group that was
not exposed to the motivation induction was only 80.6% accurate. Although the difference in accuracy
was not statistically significant, we suggest that the current findings have high practical significance.
For example, based on a Monte Carlo simulation conducted by Sellers et. al. (2006), to produce an
Article ID 100-3
DOI:10.3217/978-3-85125-378-8-100
error corrected message containing 100 characters would require 130 selections at 90% accuracy and
170 selections at 80% accuracy. To put this finding into context, participants in the motivation
condition would complete their message 12.45 minutes more quickly than participants in the nonmotivation condition would complete their message. This is in spite of the fact that the motivation
questionnaire did not reveal statistical significance. A possible explanation for this finding is that our
motivation induction was not effective. However, this is unlikely because the ratings for both groups
were near ceiling. Thus, the most likely explanation of this finding is that the questionnaire was not
sensitive enough to detect existing differences in motivation. The waveform data support the ceiling
effect explanation; higher early positivity and higher late negativity was observed for the motivation
group.
The results also suggest that participants are sleepier after BCI use. Despite being sleepier, preexperiment mean ratings were reported as: Functioning at high levels, but not at peak; able to
concentrate, and mean post-experiment ratings were: awake but relaxed; responsive but not fully
alert. It is recommended that future studies examine the effects of motivation and sleepiness in an
ALS population. Presumably, people with ALS would have more intrinsic motivation to perform well.
At the same time, people with ALS may become tired more quickly than non-ALS participants, which
could cause frustration when sleepiness begins to affect accuracy.
References
Anderson, R., Manoogian, S.T., & Reznick, J. (1976). The undermining and enhancing of intrinisic
motivation in preschool children. J Pers Soc Psychol, 34(5), 915-922.
Appel, C., & Gilabert, R. (2002). Motivation and task performance in a task-based web-based tandem
project. Recall: Journal Of Eurocall, 14(1), 16-31.
Boekaerts, M. (2002). The on-line motivation questionnaire: A self-report instrument to assess
students context sensitivity. In P. R. Pintrich & M. L. Maehr (Eds.) Advances in Motivation and
Achievement, Volume 12: New Directions in Measures and Methods. (pp. 77-120). New York: JAI /
Elsevier Science.
Farwell, L. A., & Donchin, E. (1988). Talking off the top of your head: Toward a mental prosthesis
utilizing event-related brain potentials. Electroen Clin Neuro, 70(6), 510-523.
Gillet, N., Vallerand, R. J., & Lafrenire, M. K. (2012). Intrinsic and extrinsic school motivation as a
function of age: The mediating role of autonomy support. Social Psychology Of Education, 15(1),
77-95.
Goldstein, R. Z., Cottone, L. A., Jia, Z., Maloney, T., Volkow, N. D., & Squires, N. K. (2006). The
effect of graded monetary reward on cognitive event-related potentials and behavior in young
healthy adults. Int J Psychophysiol, 62(2), 272-279.
Hoddes, E. (1973). Quantification of sleepiness: A new approach. Psychophysiology, 10(4), 431-436.
Kleih SC and Kbler A (2013) Empathy, motivation, and P300-BCI performance. Front. Hum.
Neurosci. 7:642.
Kleih, S. C., Nijboer, F. F., Halder, S. S., & Kbler, A. A. (2010). Motivation modulates the P300
amplitude during braincomputer interface use. Clinical Neurophysiol, 121(7), 1023-1031.
Kruglanski, A. W., Riter, A., Arazi, D., Agassi, R., Montegio, J., Peri, I., & Peretz, M. (1975). Effect
of task-intrinsic rewards upon extrinsic and intrinsic motivation. J Pers Soc Psychol, 31(4), 699.
Reiss, S. (2012). Intrinsic and extrinsic motivation. Teaching of Psychology,39(2), 152-156.
Sellers, E.W., Arbel, Y., Donchin, E. (2012). P300 event-related potentials and related activity in the
EEG. In J.R. Wolpaw and E.W. Winter Wolpaw (Eds.). Oxford University Press Inc. NY. (pp. 215226).
Article ID 100-4
DOI:10.3217/978-3-85125-378-8-101
Introduction
Noninvasive brain-computer interfaces (BCIs) use electroencephalogram (EEG) to provide nonmuscular communication. A paucity of BCI research has considered how psychosocial factors affect
task performance. Recent research suggests a need for studies that focus on specific qualities of BCI
users, such as motivation and depression [Kleih et al., 2010; Nijboer, et al., 2010]. Examination of
these factors may increase BCI performance by learning more about individual users, treating them as
active participants, and addressing their specific needs. In the current studies, participants completed
two tasks: copy- and free-spelling. After each task their motivation to perform the task was assessed
with a visual analogue scale and one of two Likert scale measures. It is hypothesized that free spelling
will lead to higher motivation and accuracy. This hypothesis is driven by the finding that autonomy in
a task (such as free-spelling) is related to an increase in intrinsic motivation (Ryan & Deci, 2000).
Learning about how and why a particular person may be motivated or unmotivated can help determine
what tasks may lead to higher BCI performance by fully engaging participants in the task.
Experiment 1
Participants. Non-disabled participants (n=21) were recruited from the East Tennessee State
University (ETSU) subject pool. All participants provided informed consent and the study was
approved by the ETSU Institutional Review Board.
Article ID 101-1
DOI:10.3217/978-3-85125-378-8-101
Materials. Guger Technologies g.USBamps were used to record EEG data, which were digitized
at 256 Hz, and bandpass filtered from 0.5 to 30 Hz. Stimulus presentation, EEG data collection, and
online processing were controlled via BCI2000 [Schalk, et al., 2004]. Stepwise-linear discriminant
analysis (SWLDA) was used for classification. Eight electrode locations were used: Fz, Cz, Pz, Oz,
PO7, PO8, P3, P4.
Methods. Participants were instructed to attend to a target item, by mentally saying or counting
the item, when it flashed. Stimuli were presented in an 8x9 (72-item) matrix. Stimulus duration was
62.5ms with an SOA of 125ms (ISI=62.5ms). Copy- and free-spelling tasks were counter balanced for
each subject. Before the copy- and free-spelling tasks, each participant copy spelled 18-characters
without feedback. These data were then used for calibration. The resulting classifiers were used for
online copy- and free-spelling tasks. The words used for calibration and in the online copy-spelling
task were randomly selected from a database of 6,000 words.
The copy-spelling task consisted of three
six-letter words presented as one string with a
space in between each word (20 total
selections). In the free-spelling task,
participants constructed their own sentence.
The sentence ranged from 20-24 characters.
Before the free spelling task, each participant
wrote his or her sentence on a sheet of paper
(to calculate accuracy). They were also
instructed to correct selection mistakes using
the BACKSPACE character (denoted Bs). If
the participant had not completed the sentence Figure 1: Average performance accuracy for the copy-,
copy and
before the limit of 24 character selections the free-spelling tasks, post 1, and post 2 motivation scores
task was terminated by the experimenter in organized by task order.
order to keep the number of character selections on the free-spelling task between 20 and 24
characters in length.
In addition to the BCI tasks, participants also completed two surveys: the Stanford Sleepiness
Scale (SSS), which was used to measure fatigue, [Hoddes et al., 1973] and the On-Line Motivation
Questionnaire (OLMQ), which was used to measure motivation [Boekaerts, 2002]. The SSS and
OLMQ were administered three times throughout the session. The OLMQ consists of pre- and posttask components. The pre-task OLMQ was completed before the first BCI task, the post OLMQ was
completed after each of the two BCI tasks. The SSS was completed before the first BCI task and after
each of the two BCI tasks.
Results
Performance accuracy for the copy-spelling condition was calculated by dividing the number of
correct selections by the number of total selections. Performance accuracy for the free-spelling
condition was calculated by comparing the message completed through the BCI to the intended
message. Mean accuracy in the copy- and free-spelling conditions did not show a significant
difference (91.19% and 88.71%, (t<1)). No significant differences in OLMQ scores were observed at
any of the three time points (pre, post 1, and post 2). Similarly, SSS scores were not significantly
different across the three time points. For BCI accuracy there was an order effect. The participants
who completed free-spelling first had significantly higher accuracy in copy-spelling task than the
participants in the copy-free order (F(1, 19)=5.617, p=.029). Figure 1 shows the relationship between
motivation and accuracy for the copy- and free-spelling tasks and post 1 and post 2 motivation.
Article ID 101-2
DOI:10.3217/978-3-85125-378-8-101
Experiment 2
Participants. Participants with ALS (n=6) were recruited based on participation in previous
studies. All participants provided informed consent and the study was approved by the ETSU
Institutional Review Board. ALSFRS-r scores were recorded for all participants indicating level of
functionality ranging from 0 (completely locked-in) to 48 (fully functional). ALSFRS-r scores were as
follows: 1, 5, 21, 30, 33, 33.
Materials. The materials were identical to Experiment 1 except that only one 16-channel amplifier
was used and the cap contained 16 instead of 32 electrodes.
Methods. Participants wore a 16-channel electrode EEG cap and attend to a screen that presented
a 6x6 item matrix. In contrast to Experiment 1, stimulus duration was 187.5ms and ISI was 62.5ms. In
general, we have used longer stimulus durations when testing people with ALS [Townsend et al.,
2010]. All other methods were the same as those used with the non-disabled participants.
Participants with ALS also completed two survey instruments: visual analogue scales were
used to measure motivation and mood and the Questionnaire for Current Motivation for BCI2000
(QCMBCI2000) was used to measure motivation [Kleih, Nijboer, Halder, & Kbler, 2010]. The
researchers found the QCMBCI2000 after the first experiment was competed and decided to use it as
a measure of motivation for the second experiment since it has been previously used to measure
motivation to perform a BCI task in people with ALS. The visual analogue scales for motivation and
mood and the QCMBCI2000 were given to participants with ALS twice; once at the beginning of the
session and once at the end of the session. At the conclusion of the session, participants were asked a
qualitative question describing their motivation for participating in the study (i.e., What was your
motivation for participating in this study?). This served as additional data to confirm participants
responses to the surveys. The majority of participants described their main motivation as helping
others to communicate.
Results
Mean accuracy in the copy- and freespelling conditions did not show a
significant difference (73.33% and
83.33%, respectively (t<1)). The visual
analog scale for motivation was not
statistically different at pretest (9.60) and
posttest (9.19; p=.52). Similarly, the
visual analog scale for mood did not
show a significant difference at pretest
(8.75) and posttest (8.47; p=..64).Figure 2
shows the relationship between accuracy
Figure 2: Average performance accuracy for the copy-, and freeand motivation organized by the order in
spelling tasks, pre, and post motivation scores organized by task
which the tasks were performed (i.e., copy
order.
first/free second or free first/copy second). No
significant differences were observed in the four subscales of the QCMBCI2000.
Discussion
This study examined the hypothesis that participants will be more motivated in a free-spelling
paradigm than in a copy-spelling paradigm, which would provide higher accuracy in a free-spelling
Article ID 101-3
DOI:10.3217/978-3-85125-378-8-101
Acknowledgements
This research is supported by NIH/NIDCD (R33 DC010470-03) and NIH/NIBIB & NINDS
(EB00856).
References
Boekaerts, M. (2002). The on-line motivation questionnaire: A self-report instrument to assess
students context sensitivity. In P. R. Pintrich & M. L. Maehr (Eds.) Advances in Motivation and
Achievement, Volume 12: New Directions in Measures and Methods. (pp. 77-120). Elsevier
Science New York.
Hoddes E, Zarcone V, Smythe H, Phillips R, Dement WC. Quantification of sleepiness: a new
approach. Psychophysiology,10: 431-6, 1973.
Kleih SC, Nijboer F, Halder S, Kubler A. Motivation modulates the P300 amplitude during braincomputer interface use. Clin Neurophysiol, 121: 1023-31, 2010.
Nijboer, F., Birbaumer, N., & Kbler, A. (2010). The influence of psychological state and
motivation on brain-computer interface performance in patients with amyotrophic lateral sclerosis
a longitudinal study. Frontiers in Neuroscience, 4(55), 1-13. doi:10.3389/fnins.2010.00055
Ryan, R. M., & Deci, E. L. (2000). Self-determination theory and the facilitation of intrinsic
motivation, social development, and well-being. American Psychologist, 55, 68-78.
doi:10.1037//0003-066X.55.1.68
Schalk, G., McFarland, D. J., Hinterberger, T., Birbaumer, N., & Wolpaw, J. R. (2004). BCI2000:
a general-purpose brain-computer interface (BCI) system. IEEE Trans Biomed Eng, 51, 10341043.
Townsend, G., LaPallo, B. K., Boulay, C. B., Krusienski, D. J., Frye, G. E., Hauser, C. K., Sellers,
E. W. (2010). A novel P300-based brain-computer interface stimulus presentation paradigm:
Moving beyond rows and columns. Clinical Neurophysiology, 121, 1109-1120.
doi:10.1016/j.clinph.2010.01.030
Article ID 101-4