IJETAUTISMPAPER
IJETAUTISMPAPER
Abstract
Machine Learning based behavioural analytics emphasis the need to develop accurate prediction models for detecting the risk of autism
faster than the traditional diagnostic methods. Quality of prediction rely on the accuracy of the supplied dataset and the machine learning
model.To improve accuracy of prediction, dimensionality reduction with feature selection is applied to eliminate noisy features from a
dataset. In this work an ASD diagnosis dataset with 21 features obtained from UCI machine learning repository is experimented with
swarm intelligence based binay firefly feature selection wrapper. The alternative hypothesis of the experiment claims that it is possible
for a machine learning model to achieve a better classification accuracy with minimum feature subsets.Using Swarm intelligence based
single-objective binary firefly feature selection wrapper it is found that 10 features among 21 features of ASD dataset are sufficient to
distinguish between ASD and non-ASD patients.The results obtained with our approach justifies the hypothesis by producing an average
accuracy in the range of 92.12%-97.95% with optimum feature subsets which is approximately equal to the average accuracy produced
by entire ASD diagnosis dataset.
Keywords: Autism Spectrum Disorder, Behavioural Analytics, Machine Learning, Feature Selection
1. Introduction are also followed for behaviour classification. ADI-R and ADOS
are some common behaviour tests carried out by pediatricians for
detection of childhood autism symptoms. These clinical
Autism is a childhood disorder which has become more prevalent
experiments are practiced by certified professionals in laboratory
among younger generations in the recent decade. According to the
conditions. The assessments can last for 60 minutes of duration
centre for disease control and prevention, there is a sustainable
based on the patient’s responsiveness. The certified professional
growth in the number of children diagnosed with Autism disorder.
awards a binary score based on the quality of response.
According to them, 1 among 68 Children under the age of 8 in the
Consolidated scores decide the severity of autism in the patients.
United States of America is diagnosed with autism [1]. Autism
In [3], an ASD diagnostic dataset comprised of 21
diagnosis is a clinical examination procedure conducted according
behavioural attributes is taken for classification task of ASD
to the DSM-V standards for disorder classification [2]. These
patients from non-ASD. This work has adapted a mobile
standards are coined by the US Mental health professionals based
application based ASD screening approach obeying the DSM-V
on their successful diagnostic experiences and contributions.
fulfillment for Autism detection. The behaviour dataset has
These procedures are widely incorporated in behavioral analytics
collected 292 samples of children Autism screening episodes. In ,
for classification of ASD from non-ASD. In addition to DSM-V
the researcher suggests feature selection as a measure for
standards, interview and questionnaire based clinical examinations
improving prediction accuracy of machine learning models. This
Copyright © 2016 Authors. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted
use, distribution, and reproduction in any medium, provided the original work is properly cited.
2 International Journal of Engineering & Technology
Distance Euclidean
Number of Particles 30
No of Iterations 100
Fig. 2: Flowchart on the working of Binary Firefly algorithm for feature MLP Back propagation neural network based
selection on ASD diagnosis dataset classification algorithm [20]
Feature Selection Firefly wrapper ROC Area Area under the curve calculated by integrated
Algorithm start and end points of a graph
B A B A B A B A
5. Interpretations
Binary Firefly algorithm for feature selection opted is a fast [4] M. Duda, R. Ma, N. Haber, and D. P. Wall, “Use of
explorer than existing swarm intelligence search algorithms. machine learning for behavioral distinction of autism and
Comparison of results of machine learning models before and
ADHD,” Transl. Psychiatry, vol. 6, no. 2, p. e732, 2017.
after feature selection showed that 3/5 machine learning models
have considerable performance improvement with the optimum [5] J. A. Kosmicki, V. Sochat, M. Duda, and D. P. Wall,
behaviour sets. “Searching for a minimal set of behaviors for autism detection
Presence of 15% missing values in the selected Relation attribute through feature selection-based machine learning,” Transl.
might have caused deterioration of quality of models in the Psychiatry, vol. 5, no. 2, p. e514, 2015.
functional classifiers such as SVM and MLP. However the
[6] F. Thabtah, “Machine learning in autistic spectrum
performance of functional models built with optimum behaviour
set is better than the other classification models. disorder behavioral research: A review and ways forward,”
Due to lesser amount of instances in the dataset, there exist of Informatics Heal. Soc. Care, vol. 0, no. 0, pp. 1–20, 2018.
chance of model overfitting on the dataset. [7] R. C. Team and others, “R: A language and environment
From the above interpretations it is clear that the optimum for statistical computing,” 2013.
behaviour set has improved the prediction performance of
[8] G. Holmes, A. Donkin, and I. H. Witten, “Weka: A
machine learning models in 3/5 cases and in 2/5 cases the
behaviour set has exhibited a decent performance with minimum machine learning workbench,” in Intelligent Information Systems,
features. These observations validate the alternative hypothesis: 1994. Proceedings of the 1994 Second Australian and New
Minimum behaviour sets can retain the structure of the entire Zealand Conference on, 1994, pp. 357–361.
dataset in machine learning.
[9] W. Siedlecki and J. Sklansky, “On automatic feature
selection,” Int. J. Pattern Recognit. Artif. Intell., vol. 2, no. 02, pp.
6. Conclusion
This paper aimed to design an automated ASD prediction model 197–220, 1988.
with minimum behaviour sets selected from ASD diagnosis [10] L. Tolosi and T. Lengauer, “Classification with
dataset with Binary Firefly algorithm for feature selection. The correlated features: unreliability of feature ranking and solutions,”
hypothesis of this paper is to find whether machine learning Bioinformatics, vol. 27, no. 14, pp. 1986–1994, 2011.
models trained with minimum behaviour sets are capable of better
[11] X. Wang, J. Yang, X. Teng, W. Xia, and R. Jensen,
performance or not. In order to select features a swarm
intelligence based wrapper is considered as a better alternative to “Feature selection based on rough sets and particle swarm
Ranking based feature elimination algorithms. From the above optimization,” Pattern Recognit. Lett., vol. 28, no. 4, pp. 459–471,
results and discussions the hypothesis is validated. 2007.
[12] A. Unler, A. Murat, and R. B. Chinnam, “mr2PSO: A
7. Future work
maximum relevance minimum redundancy feature selection
UCI repository indicates the presence of missing instances in the
ASD child dataset which is not handled in the present work. method based on swarm intelligence for support vector machine
Rather it is assumed that the dataset is complete and evaluation is classification,” Inf. Sci. (Ny)., vol. 181, no. 20, pp. 4625–4641,
done. This assumption could have impacted on the performance of 2011.
feature selection and machine learning. In future, a suitable [13] H. Banati and M. Bajaj, “Fire fly based feature selection
missing data imputation framework should be designed to check
approach,” IJCSI Int. J. Comput. Sci. Issues, vol. 8, no. 4, 2011.
the presence of missing data in the dataset. Even Though swarm
intelligence wrappers are better explorers than traditional feature [14] G. H. John and P. Langley, “Estimating continuous
selection , there exist their own disadvantages in terms of risk of distributions in Bayesian classifiers,” in Proceedings of the
overfitting, time complexity and search complexity. These factors Eleventh conference on Uncertainty in artificial intelligence,
should be addressed in the future work.
1995, pp. 338–345.
[15] J. R. Quinlan, “C4. 5: Programming for machine
Acknowledgement
The authors thank VIT University for providing ‘VIT SEED learning,” Morgan Kauffmann, vol. 38, p. 48, 1993.
GRANT’ for carrying out this research. [16] S. S. Keerthi, S. K. Shevade, C. Bhattacharyya, and K.
R. K. Murthy, “Improvements to Platt’s SMO algorithm for SVM
References classifier design,” Neural Comput., vol. 13, no. 3, pp. 637–649,
2001.
[1] J. Baio, “Prevalence of Autism Spectrum Disorder
[17] J. C. Platt, “12 fast training of support vector machines
Among Children Aged 8 Years—Autism and Developmental
using sequential minimal optimization,” Adv. kernel methods, pp.
Disabilities Monitoring Network, 11 Sites, United States, 2014,”
185–208, 1999.
MMWR. Surveill. Summ., vol. 67, 2018.
[18] T. Hastie and R. Tibshirani, “Classification by pairwise
[2] A. P. Association and others, Diagnostic and statistical
coupling,” in Advances in neural information processing systems,
manual of mental disorders (DSM-5®). American Psychiatric
1998, pp. 507–513.
Pub, 2013.
[19] D. W. Aha, D. Kibler, and M. K. Albert,
[3] F. Thabtah, “Autism Spectrum Disorder Screening:
“Instance-based learning algorithms,” Mach. Learn., vol. 6, no. 1,
Machine Learning Adaptation and DSM-5 Fulfillment,” in
pp. 37–66, 1991.
Proceedings of the 1st International Conference on Medical and
[20] S. K. Pal and S. Mitra, “Multilayer perceptron, fuzzy
Health Informatics 2017, 2017, pp. 1–6.
sets, and classification,” IEEE Trans. neural networks, vol. 3, no.
5, pp. 683–697, 1992.
6 International Journal of Engineering & Technology