An Overview of Data Mining in Medical Field: Reema Arora, Sandeep Jaglan
An Overview of Data Mining in Medical Field: Reema Arora, Sandeep Jaglan
An Overview of Data Mining in Medical Field: Reema Arora, Sandeep Jaglan
198 www.erpublication.org
An Overview of Data Mining In Medical Field
is a collection of data objects, similar data are taking in the deeply a tree can grow. The C4.5 algorithm is capable of
same cluster, dissimilar data are taking in different clusters. managing continuous attributes, which are vital in case of
medical data (e.g. blood pressure, temperature, etc.). Other
Association: very familiar aspect missing values was also taken into
consideration in C4.5. Moreover the algorithm handles
Association analysis is the unearthing of association rules.
attributes with differing costs.
It depends on the occurrence of transactional data occur
The utility of C4.5 algorithm was widely proven in medicine .
together in database, also depends on a threshold called
This algorithm suits medical data because it copes with
support, and identifies the frequent item sets. Association data
missing values. What is more the algorithm handles
mining designed to find association between attributes,
continuous data which are common among medical
generate rules from data sets. The association rule mining role
symptoms. The efficiency of C4.5 was shown e.g. in breast
is to arrive at all rules having support minsup (minimum
cancer and prostate cancer classification to generate a
support) threshold and confidence minconf (minimum
decision tree and rules which may be helpful in medical
confidence) threshold.
diagnosing process.
Sequential Patterns
A Nave Bayes
Sequential patterns analysis is another data mining A nave bayes classifier assumes that the occurrence (or
technique that seeks to discover or classify similar patterns, absence) of a particular feature of a class is unrelated to the
regular events or trends in transaction data over a business presence (or absence) of any other feature. For illustration, a
period. fruit may be assumed to be a tomato if it is red in colour, round
in shape, and about 2.5" in diameter. This classifier takes all
Decision trees: these description to contribute separately to the probability
that this fruit is a tomato, whether or not they're in fact related
Decision tree is one of the most used data mining techniques to each other or to the existence of the other features. The
because its model is easy to comprehend by users. In decision Bayes theorem is as follows: Let X={x1, x2,.....,xn} be a set
tree technique, the root of the decision tree is a simple of n attributes. In Bayesian, X is assumed as proof and H be
question or condition that has numerous answers. Each some hypothesis means the data of X belongs to precise class
answer then leads to a set of questions or conditions that help C. To determine P (H|X), the probability that the hypothesis H
us determine the data so that we can make the final decision holds specified facts i.e. data sample X. According to Bayes
based on it. theorem the P (H|X) is expressed as P (H|X) = P (X| H) P (H)
/ P (X) As Nave Bayes classifiers depends on the precise
nature of the probability model , so it can be trained very
efficiently in a supervised learning setting. Here independent
variables are considered for the principle of prediction or
occurrence of the event. It has been shown that Nave Bayes
classifiers often work much better in many complex real
world situations. An assistance of the Nave Bayes classifier is
that it requires a small amount of training data to estimate the
parameters (means and variances of the variables) necessary
for classification.
V. NEURAL NETWORKS
199 www.erpublication.org
International Journal of Engineering and Technical Research (IJETR)
ISSN: 2321-0869 (O) 2454-4698 (P), Volume-3, Issue-7, July 2015
their high performance. The drawback of this method is its Knowledge-Based MDSS Most MDSS are divided into three
complexity and difficulty in understanding the predictions. parts, the knowledge base, inference engine and mechanism to
communicate. The knowledge base possessed the IF-THEN
rules. The inference engine gather the rules from the
knowledge base with the patients data. The communication
mechanism will permit the system to show the results to the
user as well as have input into the system. Features of a
non-Knowledge-Based MDSS Two types of
non-knowledge-based systems are neural networks and
genetic algorithm. Neural networks use nodes and weighted
connections between them to analyze the patterns found in the
patient data to develop the associations between the
symptoms and a diagnosis. Genetic Algorithms are based on
basic evolutionary processes using directed selection to
Multilayer perceptions (MLPs) achieve optimal MDSS results. The MDSS features
associated with success include the following:
Multilayer perceptions are feedforward neural networks It is incorporated into the health care workflow rather than
skilled with the paradigm backpropagation algorithm. They as a separate log-in or screen.
are called as supervised networks because they require a
desire reply to be skilled. They gain knowledge of how to It is electronic unlike paper-based templates.
transform input data into a desired response,, therefore they
It gives decision support at the time and location of care
are widely used for pattern classification. With single or two
rather than prior to or after the patient encounter.
hidden layers, they can fairly accurate virtually any
input-output map. They have been revealed to approximate It gives(active voice) recommendations for care, not just
the performance of optimal statistical classifiers in difficult assessments.
problems. Most neural network applications involve MLPs.
VI. MEDICAL DECISION SUPPORT SYSTEM IN DATA MINING VII. CHARACTERISTICS OF MEDICAL DECISION SUPPORT
Data mining applications are presently being applied to two SYSTEMS
main branches in health care and medicine: Medical decision The Medical DSSs are the type of computer programs that
support system, and policy planning/decision making. A. help out physicians and medical staff in Medical decision
Medical decision support system MDSS is an interactive making tasks. Most of the Medical decision support systems
Decision support system (DSS) Computer Software, which is (MDSSs) are equipped with diagnostic assistance module,
intended to lend a hand to physicians and other health therapy critiquing and planning module, medications
professionals with judgment making tasks, such as prescribing module, information retrieval subsystem (for
determining diagnosis of patient data. . The main purpose of instance formulating accurate clinical questions) and image
modern MDSS is to help clinicians at the point of care. It recognition and interpretation section (X-rays, CT, MRI
means, a clinician would interact with a MDSS to help scans) Interesting examples of MDSSs are machine learning
determine diagnosis, analysis, etc. of patient data. It is a systems which are able to create new healthcare knowledge.
decision-support system program that offers employees in By analyzing healthcare cases a Medical Decision Support
detail, purpose, custom-made, and current information on all System can produce a detailed description of input features
healthcare conditions. Employees receive the information, with a unique characteristic of healthcare conditions. It
implements and support they need from incorporated web, supports may be priceless in looking for changes in patients
phone, and print based materials. This helps employees health condition. These systems may improve patients safety
formulate more informed healthcare decisions while working by reducing errors in diagnosing. They may also get enhanced
with their own physician. An example of how a MDSS might medications and test ordering. Furthermore, the quality of
be helpful in medicinal gather from the subset of Medical care gets better due to the lengthening of the time clinicians
Decision Support System and Diagnosis Decision Support spend with a patient. It may be an effect of application of
Systems. A DDSS would obtain the patients data and proper guidelines, up-to date healthcare evidence and
recommend a set of correct diagnoses. The doctor then takes improved documentation. Moreover, the efficiency of the
the output of the DDSS and point out which are relevant and health care delivery is improved by reducing costs through
which are not. Another important classification of a MDSS is faster order processing or eliminated duplication of tests.
based on the timing of its use. Doctors apply these systems at Examples of Medical Decision Support Systems
point of care to aid them as they are handling a patient, with There exist several Medical Decision Support Systems
the time of use as either pre-diagnoses, during diagnoses, or (MDSSs). They help in early detection of diseases. In this
post diagnoses. Pre-diagnoses MDSS systems are used to help survey a few of the most significant systems are accessible.
the physician prepare the diagnoses. MDSS helpful during They are utilized in hospitals. To provide you the idea of
diagnoses in reviewing and filtering the physicians Medical Decision Support Systems three sample ones are
preliminary diagnostic choices to improve their final results. described: HELP, DX plain and ERA.
And post-diagnoses MDSS systems are used to mine data to HELP
derive connections between patients and their past medical One of the most accepted and advanced Medical Decision
history and to predict future events. Features of a Support System is called HELP. It helps the clinicians in
200 www.erpublication.org
An Overview of Data Mining In Medical Field
201 www.erpublication.org