Wang Et Al 2024
Wang Et Al 2024
Ya Wang1 a
, Peiman Alipour Sarvari2 b
and Djamel Khadraoui2 c
1 Faculty of Science, Technology and Medicine, University of Luxembourg, Esch sur Alzette, Luxembourg
2 IT for Innovative Services, Luxembourg University of Science and Technology, Esch sur Alzette, Luxembourg
Keywords: Wearable Fall Detection, Feature Extraction, Threshold, Machine Learning, Inertial Sensors.
Abstract: In the healthcare sector, specifically for elderly care, accurate and efficient fall detection is crucial. We present
an advanced fall detection methodology tailored for wearable systems. Our approach blends threshold-based
screening with machine learning models like Support Vector Machine, K-Nearest Neighbors, Decision Tree,
Random Forest, and XGBoost. Utilizing 65 features extracted from the gyroscope and accelerometer data from
Inertial Measurement Units, our method addresses the class imbalance often found between Activities of Daily
Living and actual fall events. Threshold-based pre-screening serves to mitigate the class imbalance of the fall
dataset, making the subsequent machine-learning classification more effective. Validation on two open-source
IMU datasets, Sisfall and FallAllD, achieving high accuracy rates of 99.55%, 99.68% (wrist), 99.76% (waist),
and 99.52% (neck), shows our model surpassing existing solutions in detection accuracy. Furthermore, our
strategic feature extraction not only enhances the model’s performance but also allows for a fourfold reduction
by using the 15 most important features in data transmission without sacrificing accuracy. These findings
underscore the efficiency and potential of our methodology, indicating that wearables can indeed be powerful
tools for high-precision fall detection with minimal data overhead.
573
Wang, Y., Sarvari, P. and Khadraoui, D.
Fusion of Machine Learning and Threshold-Based Approaches for Fall Detection in Healthcare Using Inertial Sensors.
DOI: 10.5220/0012250500003657
Paper published under CC license (CC BY-NC-ND 4.0)
In Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2024) - Volume 1, pages 573-582
ISBN: 978-989-758-688-0; ISSN: 2184-4305
Proceedings Copyright © 2024 by SCITEPRESS – Science and Technology Publications, Lda.
BIOSIGNALS 2024 - 17th International Conference on Bio-inspired Systems and Signal Processing
574
Fusion of Machine Learning and Threshold-Based Approaches for Fall Detection in Healthcare Using Inertial Sensors
Low Pass
A x ,A y ,A z
Raw Data ? x,? y,? z filter 10Hz
575
BIOSIGNALS 2024 - 17th International Conference on Bio-inspired Systems and Signal Processing
i
tSMV max = arg max{SMVi (t) : SMVi (t) ∈ ith fall }
t
Data Split
(2)
2. ADL Events: To diversify the ADLs dataset, a Training data Testing data
576
Fusion of Machine Learning and Threshold-Based Approaches for Fall Detection in Healthcare Using Inertial Sensors
Within the predefined time window tW , if both changes in the forward and sideward attitudes of in-
SMVmax and SMAmax exceed their respective thresh- dividuals during a fall.
olds, the data is forwarded to the second tier; other- |Ax |
wise, it is disregarded. Roll = arccos q (8)
Ax 2 + Ay 2
3.3 Machine Learning-Based Approach The attitude angle change during the fall is defined
as:
The second tier capitalizes on machine learning algo-
NAng = Pitch + Roll (9)
rithms to meticulously classify the dataset that’s been
pre-screened by the initial tier. This dual-stage ap- The selected statistic features are analytically de-
proach amplifies the algorithm’s precision in detect- fined as follows:
ing falls, while also boosting computational speed.
Maximum (Peak) of Feature S. This feature rep-
3.3.1 Feature Extraction resents the peak or maximum value of a specific
data feature (S) during the fall window. It serves as
Before applying machine learning classifiers, we fo- a meaningful descriptor of the force of the impact
cus on feature extraction to accurately represent the against the ground. Mathematically, it can be defined
underlying data patterns. We compute a set of eight as:
statistical features that encapsulate information from
accelerometer and gyroscope readings. These fea- Smax = max{S(t) : t ∈ tW } (10)
tures include metrics such as angular velocity, Signal
Magnitude Vector (SMV), and Signal Magnitude Vec-
Minimum of Feature S. This feature indicates the
tor of Angular Velocity (SMA). For a detailed sum-
minimum value achieved by the data feature (S) dur-
mary, refer to Table 1. These statistical features are
ing the fall window. It is a key element in describing
universally accepted in the domains of Human Ac-
the fall.
tivity Recognition (HAR) and Fall Detection Systems
Smin = min{S(t) : t ∈ tW } (11)
(FDS) (Sucerquia et al., 2017; Giuffrida et al., 2019;
Martinez-Villaseñor and Ponce, 2020; Casilari and
Silva, 2022). Mean of Feature S. The mean provides informa-
We denote the human acclivity feature derived tion about the average body motion intensity during
from the raw data of the IMU sensor by S, which is the fall. It is computed as the average of the feature
defined as values over the observation window (tW ) containing
NW feature samples.
S = [Ax , Ay , Az , ωx , ωy , ωz , SMV, SMA, NAng ], (6)
1
µS = ∑S (12)
where Ax , Ay , Az represent the accelerometer read- NW t∈tW
ings along the X, Y, and Z axes, respectively, and
ωx , ωy , ωz represent the angular velocity readings where NW is defined as:
along the X, Y, and Z axes, respectively, SMV and T
SMA denote the Signal Magnitude Vector and Signal Nw = 2[ fs ] + 1 (13)
2
Magnitude Vector of Angular Velocity, respectively,
NAng represent the attitude change during the fall. Standard Deviation of Feature S. This feature de-
The attitude angle change during the fall NAng is scribes the variability of the feature (S) during the ob-
defined as follows: servation window. It is calculated as the square root
of the average squared deviation from the mean.
Pitch Angle. The pitch angle represents the forward s
angle of the sensor during a fall. 1
σS = ∑ (S − µS )2 S
NW t∈t
(14)
W
|Az |
Pitch = arccos q (7)
Az 2 + Ay 2 Skewness of Feature S. Skewness characterizes the
symmetry of the distribution of feature values. It in-
Roll Angle. The roll angle represents the sideward dicates whether the distribution is skewed to the left
angle of the sensor during a fall. By incorporat- or right.
1
ing these pitch and roll angles, we can capture the γS = 3 ∑ (S − µS )3
σS NW t∈t
(15)
W
577
BIOSIGNALS 2024 - 17th International Conference on Bio-inspired Systems and Signal Processing
Kurtosis of Feature S. Kurtosis measures the based on their ability to accurately distinguish be-
tailedness (frequency of outliers) of the distribution tween fall and non-fall events.
of feature values. Our dual-layer approach, incorporating these se-
(S − µS ) 4 lected features, is rigorously tested on the Sisfall and
Kurt S = E[( ) ] (16) FallAllD datasets. By applying the chosen statisti-
σS
cal features to both categories, we aim to critically
assess and validate the performance of our machine
Valley-to-Peak Range (Rs ). This feature represents learning-based fall detection model.
the value of the interval between the minimum (Sm in)
and maximum (Sm ax) of the feature S. It is calculated
as:
RS = Smax − Smin (17) 4 RESULTS AND DISCUSSION
Valley-to-Peak Time (Ts ). This feature indicates 4.1 Performance Evaluation Metrics
the duration of the interval between the minimum
Various methods have been developed to evaluate the
(Smin ) and maximum (Smax ) of the feature S.
performance of different classifiers. These methods
TS = tSmax − tSmin (18) rely on the outcomes obtained from the classifiers,
which are represented in the form of a confusion ma-
3.3.2 Model Implementation trix (Figure 10). The confusion matrix provides a vi-
sual representation of the classifier’s performance, in-
With the features extracted, we proceed to apply cluding true positives, true negatives, false positives,
machine learning models for classification. In this and false negatives.
study, we explore multiple classifiers including Sup- • True positive (TP): The ADL events have been
port Vector Machines (SVM), K-Nearest Neighbors correctly classified.
(KNN), Decision Trees, Random Forests, and XG-
Boost. These classifiers are trained and evaluated • True negative (TN): The fall events have been cor-
rectly detected.
578
Fusion of Machine Learning and Threshold-Based Approaches for Fall Detection in Healthcare Using Inertial Sensors
• False positive (FP): Fall events that have not been FallAllD
𝜇𝜔𝑧
detected.
𝑇𝑆𝑀𝑉
• False negative (FN): A false alarm situation oc- 𝜎𝑆𝑀𝑉
curs. 𝜎𝐴𝑧
𝜇𝜔𝑥
One commonly used method to assess classifier 𝛾𝑆𝑀𝑉
performance is accuracy, which calculates the pro- 𝑅𝑆𝑀𝑉
portion of correctly classified samples overall. How- 𝛾𝑁𝐴𝑛𝑔
ever, accuracy has certain limitations, such as being 𝑅𝑆𝑀𝐴
susceptible to the influence of large abnormal data 𝑆𝑀𝑉𝑚𝑎𝑥
𝜇𝑁𝐴𝑛𝑔
and potentially misleading results in class-imbalanced
𝜇𝜔𝑦
training data scenarios. To address these limitations,
𝐴𝑍 𝑚𝑎𝑥
alternative evaluation methods are selected to evalu-
𝑅𝐴𝑧
ate class-imbalanced classifiers effectively. For fall 𝐾𝑢𝑟𝑡𝑆𝑀𝑉
detection tasks with imbalanced classes, sensitivity,
specificity, F-score, and receiver operating character-
istic (ROC) are commonly utilized to assess the clas- (a) Shap values for FallAllD.
sifier’s ability to differentiate falls from a large num- SisFall
ber of activities of daily living (ADL) events. 𝑇𝑆𝑀𝐴
𝑅𝑆𝑀𝑉
TP+TN
Accurancy = (19) 𝑆𝑀𝐴𝑚𝑖𝑛
T P + T N + FP + FN 𝑇𝑆𝑀𝑉
Sensitivity measures the proportion of correctly 𝑅𝐴𝑧
identified positive samples. Specificity, on the other 𝑅𝐴𝑥
𝑁𝐴𝑛𝑔
hand, measures the proportion of correctly identified 𝑚𝑖𝑛
𝐾𝑢𝑟𝑡𝑆𝑀𝑉
negative samples. 𝑅𝜔𝑥
TP 𝐾𝑢𝑟𝑡𝐴𝑧
Sensitivity = (20) 𝐾𝑢𝑟𝑡𝑆𝑀𝐴
T P + FN
𝐴𝑧 𝑚𝑖𝑛
TN 𝐾𝑢𝑟𝑡𝜔𝑥
Speci f icity = (21)
T N + FP 𝜔𝑦
𝑚𝑎𝑥
The F1 is a robust evaluation metric that balances 𝛾𝑆𝑀𝑉
sensitivity and specificity. In large-scale datasets, sen-
sitivity and specificity often have a trade-off relation-
ship, and the F-score provides a comprehensive mea- (b) Shap values for SisFall.
sure of their discrimination. The parameter β in the Figure 5: The Shap values of the top 15 features of the
F-score equation allows adjusting the weight between dataset (a) FallAllD; (b) SisFall.
sensitivity and specificity. Setting β to 0.5 assigns a
higher weight to specificity, which is crucial in fall consistently ranked among the top ten most important
detection as it reflects the detection of all fall signals features when both datasets were integrated, empha-
in the data. sizing their critical role in accurately detecting falls
Sensitivity × Speci f icity and Activities of Daily Living (ADLs). These find-
F1 = (1 + β2 ) 2 (22) ings are corroborated by Table 2, which offers an
β × (Sensitivity + Speci f icity) exhaustive evaluation of different machine learning
In summary, sensitivity, specificity, and F-score classifiers based on the feature sets. In which, fea-
provide comprehensive measures for evaluating the ture (1) utilizes all 65 extracted features and features
performance of fall detection classifiers, particu- (2) focuses on the 15 most important features as deter-
larly in scenarios with imbalanced class distributions. mined by feature integration. The model’s robustness
These metrics address the limitations of accuracy and was evident from its high accuracy rates across vary-
offer a more nuanced assessment of the classifier’s ing types of falls and ADLs. The results show that
ability to distinguish falls from ADL events. the model is robust to the input of different types of
falls/ADLs and achieved superior performance. The
4.2 Results and Discussion FallAllD dataset is collated from 3 different locations
with limited data size, it has been shown that the po-
Our analyses underscored the varying importance of sition of the sensors also plays an important role in
features across the two datasets—Sisfall and Fal- the fall detection models. And we can also see that
lAllD. Notably, features like TSMV , TSMA , and RSMV
579
BIOSIGNALS 2024 - 17th International Conference on Bio-inspired Systems and Signal Processing
Table 2: Fall detection results for Sisfall and FallAllD: the Table 3: Comparison of results between the proposed and
unit is %, feature (1) utilizes all 65 extracted features, and previous research models, the unit is %.
feature (2) focuses on the 15 most important features. Algorithem Sensitivity Specificity Accuracy
Dataset Model Sensitivity Specificity Accuracy F1 Yu et al. ResNet10 97.91 72.89 96.22
SisFall1 SVM 99.47 99.35 99.45 98.22 (2020) SMOTE 99.17 89.98 97.54
KNN 98.06 98.91 98.96 94.27 Sisfall CDL-Fall 99.33 91.86 97.52
DT 98.88 99.38 99.28 98.45 Santoyo et al.
RF 99.44 98.75 99.54 99.03 (2022) CNN 85.97 96.79 NAN
XGB 99.62 98.81 99.55 99.14 FallAllD waist
SisFall2 SVM 98.67 99.03 98.62 96.43 Jeong et al.
KNN 97.61 98.63 98.05 86.44 (2023) LightGBM 91.04 96.38 94.86
DT 97.29 99.23 98.70 97.98 FallAllD wrist
RF 99.01 97.87 99.26 98.88 Proposed Fusion
99.62 98.81 99.55
XGB 98.79 98.33 99.16 98.92 Sisfall method
FallAllD1 SVM 99.36 1 99.84 99.68 Proposed Fusion
99.62 99.47 99.68
(wrist) KNN 98.71 99.58 99.37 98.71 FallAllD wrist method
DT 98.87 99.61 99.55 98.82 Proposed Fusion
99.34 98.75 99.76
FallAllD waist method
RF 98.89 99.88 98.95 99.24
Proposed Fusion
XGB 99.62 99.47 99.68 99.47 98.24 99.47 99.52
FallAllD neck method
FallAllD2 SVM 98.27 99.40 99.37 97.55
(Wrist) KNN 97.67 97.44 97.49 94.02 Note: In this comparison, while the same open-source datasets are
DT 98.88 99.38 99.18 98.9 used, the training and testing datasets for the fall detection model
RF 96.44 98.75 99.66 99.4 differ due to variations in data processing methods, like filter fre-
XGB 98.62 98.51 99.55 98.7 quency and sample window size. Hence, the results should be
FallAllD1 SVM 99.77 99.03 99.34 99.27 viewed as indicative rather than conclusive.
(waist) KNN 98.06 98.91 98.96 98.69
chine learning approach benchmarked against Sis-
DT 97.88 99.02 99.18 99.22
RF 99.34 98.75 99.76 99.07
fall and FallAllD datasets. Employing a suite of
XGB 99.32 98.31 99.35 99.23 65 rigorously selected statistical features (as shown
FallAllD2 SVM 98.63 97.51 96.74 98.58 in table1) extracted from inertial sensors, the study
(waist) KNN 96.85 95.81 97.39 97.41 leveraged tree-based ensemble models to achieve un-
DT 96.86 99.84 97.68 97.88 precedented accuracy rates: 99.55%, 99.68% (wrist),
RF 98.62 98.80 99.44 98.47 99.76% (waist), and 99.52% (neck) across the exam-
XGB 97.82 98.82 98.84 98.64
ined datasets. This level of performance substantially
FallAllD1 SVM 96.53 98.94 99.15 91.25
(neck) KNN 89.06 98.91 95.96 87.10
outperforms existing benchmarks documented in the
DT 98.28 99.38 99.12 98.32 scholarly literature.
RF 98.24 99.47 99.52 99.40 SHAP value analysis was instrumental in distill-
XGB 98.88 99.46 99.47 99.51 ing the feature set down to the top 15 most influen-
FallAllD2 SVM 95.81 99.03 99.35 89.55 tial features. Comparative analysis indicated that the
(neck) KNN 79.66 91.34 86.90 86.79 reduced feature set incurred a statistically insignifi-
DT 97.48 99.38 98.18 97.44
cant diminution in performance metrics—less than a
RF 97.44 98.75 98.66 97.96
XGB 98.42 98.51 98.55 98.73
1% deviation relative to the exhaustive feature set.
The hybrid model architecture, ingeniously combin-
the tree-based models show superior and robust per- ing threshold-based and machine learning algorithms,
formance in different datasets. facilitates minimal data transference from the wear-
In Table 3, we present a comparative analysis able device to the computational node while sustain-
of fall detection capabilities between our proposed ing high fidelity in fall detection outcomes.
model and existing models, all evaluated using the While the current study’s accomplishments are
same dataset. Our model’s robustness and enhanced manifold, it is imperative to acknowledge its limita-
performance are evident; it consistently identifies a tions. The absence of real fall data in the utilized
range of falls and Activities of Daily Living (ADLs) datasets denotes an opportunity for future work to fur-
with remarkable accuracy. ther validate the model’s performance under invalid
conditions. In light of the latter, future research en-
deavors will be directed toward the integration of this
5 CONCLUSIONS validated model architecture into wearable technol-
ogy platforms, emphasizing the necessity of feature
In this study, we proposed a wearable fall detec- selection optimization for real-time fall detection.
tion model that combines the threshold and ma-
580
Fusion of Machine Learning and Threshold-Based Approaches for Fall Detection in Healthcare Using Inertial Sensors
581
BIOSIGNALS 2024 - 17th International Conference on Bio-inspired Systems and Signal Processing
APPENDIX
𝑺𝑴𝑽𝒎𝒂𝒙 of SisFall dataset ADLs FALLs 𝑺𝑴𝑨𝒎𝒂𝒙 of SisFall dataset ADLs FALLs
(a) Boxplot of SMVmax for SisFall dataset (b) Boxplot of SMAmax for SisFall dataset
(c) Boxplot of SMVmax for FallAllD Waist dataset (d) Boxplot of SMAmax for FallAllD Waist dataset
𝑺𝑴𝑽𝒎𝒂𝒙 of FallAllD Neck dataset ADLs FALLs
(e) Boxplot of SMVmax for FallAllD Neck dataset (f) Boxplot of SMAmax for FallAllD Neck dataset
𝑺𝑴𝑽𝒎𝒂𝒙 of FallAllD Wrist dataset ADLs FALLs
𝑺𝑴𝑨𝒎𝒂𝒙 of FallAllD Wrist dataset ADLs FALLs
(g) Boxplot of SMVmax for FallAllD Wrist dataset (h) Boxplot of SMAmax for FallAllD Wrist dataset
Figure 6: The boxplots of statistical features of the dataset provide a visual representation of their distribution and the thresh-
olds.
582