0% found this document useful (0 votes)
92 views

Heart Disease Prediction Using Machine Learning

Contact us for project abstract, enquiry, explanation, code, execution, documentation. Phone/Whatsap : 9573388833 Email : info@datapro.in Website : https://dcs.datapro.in/contact-us-2 Tags: btech, mtech, final year project, datapro, machine learning, cyber security, cloud computing, blockchain,

Uploaded by

dataprodcs
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
0% found this document useful (0 votes)
92 views

Heart Disease Prediction Using Machine Learning

Contact us for project abstract, enquiry, explanation, code, execution, documentation. Phone/Whatsap : 9573388833 Email : info@datapro.in Website : https://dcs.datapro.in/contact-us-2 Tags: btech, mtech, final year project, datapro, machine learning, cyber security, cloud computing, blockchain,

Uploaded by

dataprodcs
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
You are on page 1/ 11

ABSTRACT

Over the years we have seen that the spread of disease is a global problem. Accurate
and timely analysis of any health-related problems is essential to treat and treat them
early but with this growing number it becomes more difficult. Machine learning strategies
have been accelerating the field of health with more research and methodologies.

Machine learning strategies can contribute significantly to the process of predicting and
diagnosing diseases early, becoming a field of research and proving to be a viable option
in the health sector.

The research paper focuses on the prediction of diseases caused in Heart, predicting the
likelihood of a person having a specific disease that can lead to heart attack based on
various medical factors and related parameters.

We set up 4 different predictive systems using data set to predict whether a patient might
be diagnosed or not. This diagnosis is a difficult task because it must be done accurately
and effectively. We undertake various machine learning classifications such as SVM and
all to predict and differentiate an infected patient and used a temperature map to analyze
the disease.

The most useful method was used to control how the model could be used to improve the
accuracy of the disease prediction in any individual. The power of the proposed model is
relatively satisfactory and able to predict disease and show good accuracy compared to
the previously used category such as non-abrasive bays etc. So, the silent amount of
pressure has been removed using the given model in identifying opportunities for the
divider to be able to diagnose diseases correctly and efficiently.

Keywords: Customer segmentation, Clustering, K-means clustering, Mini Batch


Kmeans clustering, hierarchical clustering

v
TABLE OF CONTENTS

Chapter No. TITLE Page No.


v
ABSTRACT
LIST OF FIGURES viii
LIST OF TABLES ix
LIST OF ABBREVIATIONS x
1 INTRODUCTION 1
1.1. OVERVIEW 2
1.2. MACHINE LEARNING 3
1.2.1 HISTORY & RELATIONSHIPS
1.3 MACHINE LEARNING STRATEGIES 4
1.3.1. SUPERVISED LEARNING 5
1.3.2. UNSUPERVISED LEARNING 6

2 LITERATURE SURVEY 11
2.1. RELATED WORK 11

3 METHODOLOGY 14
3.1. EXISTING SYSTEM 14
3.2. PROPOSED SYSTEM 14
3.3. OBJECTIVE 14
3.4. SOFTWARE AND HARDWARE REQUIREMENTS 15
3.4.1. SOFTWARE REQUIREMENTS 15
3.4.2. HARDWARE REQUIREMENTS 15
3.4.3. LIBRARIES 15
3.5. PROGRAMMING LANGUAGES 16
3.5.1 PYTHON 16
3.5.2. DOMAIN 18

[Type here]
3.6. SYSTEM ARCHITECTURE 21
3.7. ALGORITHMS USED 23
3.7.1. LOGISTIC REGRESSION 23
3.7.2. K NEAREST NEIGHBOR 23
3.7.3. SUPPORT VECTOR MACHINE
23
3.7.4. RANDOM FOREST
24
3.8. MODULES
25
3.8.1. DATASET COLLECTION
25
3.8.2. TRAIN AND TEST THE MODELS
3.8.3. HYPERPARAMETER TUNNING 25

3.9. WEB APPLICATION 26


RESULT AND DISCUSSION 29
4.1. PERFORMANCE ANALYSIS 29
CONCLUSION
30
5.1. CONCLUSION
30
REFERENCES
31
APPENDICES
A. SOURCE CODE 32

B. SCREENSHOTS 38

2
LIST OF FIGURES

Figure No. Figure Name Page No.


1.1. MACHINE LEARNING CLASSIFICATION 7

3.1. WORKFLOW DIAGRAM 21

4.1. ACCURACIES OBTAINED 29

B.1. DATASET 38

B.2. CORRELATION MATRIX 38

B.3. HISTOGRAM 39

B.4. BOX PLOT 40

B.5 SEABORN VISUALIZATION 41

B.6 BAR PLOT 42

B.7. GRAPH PLOT 44

B.8. LR CLASSIFICATION REPORT 46

B.9. KNN CLASSIFICATION REPORT 46

B.10. SVM CLASSIFICATION REPORT 46

B.11. RF CLASSIFICATION REPORT 47

B.12. ACCURACY TABLE 47

B.13. WEB APPLICATION OF PREDICTION 48

C. RESEARCH PAPER PUBLICATION 49

D. PLAGARISM REPORT 51

3
LIST OF ABBREVIATIONS

ABBREVIATIONS EXPANSION

ML Machine Learning

AI Artificial Intelligence

RF Random Forest

UI User Interface

SVM Support Vector Machine

KNN K Nearest neighbor

LR Logistic Regression

4
CHAPTER 1

INTRODUCTION

1.1 INTRODUCTION OF THE PROJECT

Health care data is generally large in volume and complexity in structure. Records
of a large set of medical data created by medical professionals are available to
analyze and extract important information from it. Machine learning strategies are
able to manage large contents of data and helps to obtain needful information.
This type of Machine learning framework can encourage physicians to take
immediate action so that more patients can receive medication in a shorter period
of time, thus saving a significant number of lives.
This paper covers the predictions of different types of diseases caused in Heart
that causes death using different ML strategies. Heart disease, also known as
cardiovascular disease, encompasses a wide range of cardiovascular conditions
and has been a major cause of death worldwide for the past few decades. It
encompasses a wide range of risk factors as well as the need for time to find
accurate, reliable, and logical ways to make an early diagnosis.
Heart is a vital organ of the human frame. It pumps blood to every part of our
anatomy. If it fails to feature efficiently, then the brain and diverse different organs
will stop operating, and inside few minutes, the man or woman will die. exchange
in lifestyle, paintings related pressure and awful food behavior contribute to the
growth in the price of numerous heart-associated sicknesses. coronary heart
diseases have emerged as one of the most distinguished reasons of dying all
around the global. in step with global fitness employer, heart related sicknesses
are responsible for taking 17.7 million lives every year, 31% of all international
deaths. In India too, coronary heart-associated diseases have emerged as the
main reason of mortality coronary heart sicknesses have killed 1.7 million Indians
in 2016, in accordance to the 2016 global Burden of ailment file, released on
September 15,2017. Heart-associated diseases increase the spending on fitness
care and also lessen the productiveness of a man or woman. Estimates made by
the sector fitness employer (WHO), advocate that India has lost up to $237 billion,

1
from 2005-2015, because of heart-related or cardiovascular diseases. accordingly,
possible and accurate prediction of heart-related sicknesses could be very vital.
medical organizations, all over the international, gather facts on various fitness-
associated troubles. This information may be exploited using numerous machine
mastering strategies to benefit beneficial insights. but the data accrued is very
huge and, often, these statistics may be very noisy. These datasets, which might
be too overwhelming for human minds to realize, may be easily explored using
various device getting to know techniques. hence, those algorithms have end up
very useful, in recent instances, to expect the presence or absence of coronary
heart-related diseases as it should be the usage of records technology inside the
fitness care enterprise is increasing day by day to resource docs in choice- making
activities. It facilitates docs and physicians in ailment management, medicines,
and discovery of styles and relationships amongst analysis information. cutting-
edge procedures to predict cardiovascular hazard fail to discover many folks that
would advantage from preventive treatment, while others acquire pointless
intervention. system-gaining knowledge of offers an opportunity to improve
accuracy by way of exploiting complicated interactions among risk elements. We
assessed whether system-getting to know can enhance cardiovascular danger
prediction. Using a variety of techniques, it was found that the SVM technique
provided the best accuracy among others.

The main purpose of this work is to determine whether patients are diagnosed with
the disease or not by using not only one algorithm in prior. Using database for
finding the disease and using different algorithms to have a better understanding
about the algorithms and analysing the data for the most effective result. It did
provide excellent accuracies for each disease.

1.2 MACHINE LEARNING

Machine learning (ML) is the study of computer algorithms that can improve
automatically through experience and by the use of data. It is seen as a part
of artificial intelligence. Machine learning algorithms build a model based on sample
data, known as training data, in order to make predictions or decisions without being
explicitly programmed to do so. Machine learning algorithms are used in a wide
2
variety of applications, such as in medicine, email filtering, speech recognition,
and computer vision, where it is difficult or unfeasible to develop conventional
algorithms to perform the needed tasks.

Despite the fact that the reasons mentioned are valid, we have added a dimension
in the last decade where data is being utilized for predicting what could potentially
happen in the future. Then comes Machine Learning which play a significant role
in doing so. Machine learning is a subset/subfield of Artificial Intelligence.
Generally, the main aim of Machine learning is to understand the structure of data
and apply the best possible models that can be utilized or identify a hidden
pattern. Developing a machine learning model is one of the key factors in
predicting a future problem which again requires machine learning algorithms.
There are numerous machine learning algorithms that have been developed and
mature enough to solve various real-world business problems.

Although machine learning is a field within computer science, it differs from


traditional computational approaches. In traditional computing, algorithms are sets
of explicitly programmed instructions used by computers to calculate or problem
solve. Machine learning algorithms instead allow for computers to train on data
inputs and use statistical analysis in order to output values that fall within a specific
range. Because of this, machine learning facilitates computers in building models
from sample data in order to automate decision-making processes based on data
inputs.

Using Machine learning, information is being turned into knowledge. In the last 5-6
decades, enormous data has been recorded or collected which will be of no use if
we don’t utilize or analyze to find hidden patterns. In order to find useful and
significant patterns with complex data, we have several Machine Learning
techniques available to ease our struggle for discovery. Subsequently, those
identified hidden patterns and knowledge of the problem can be helpful to perform
complex decision making and predict future occurrence.

1.2.1 History and relationships to other fields

The term machine learning was coined in 1959 by Arthur Samuel, an

3
American IBMer and pioneer in the field of computer gaming and artificial
intelligence. Also, the synonym self-teaching computers was used in this time
period. A representative book of machine learning research during the 1960s
was the Nilsson's book on Learning Machines, dealing mostly with machine
learning for pattern classification. Interest related to pattern recognition
continued into the 1970s, as described by Duda and Hart in 1973.In 1981 a
report was given on using teaching strategies so that a neural network learns
to recognize 40 characters (26 letters, 10 digits, and 4 special symbols) from
a computer terminal.

Modern day machine learning has two objectives, one is to classify data
based on models which have been developed, the other purpose is to make
predictions for future outcomes based on these models. A hypothetical
algorithm specific to classifying data may use computer vision of moles
coupled with supervised learning in order to train it to classify the cancerous
moles. Whereas, a machine learning algorithm for stock trading may inform
the trader of future potential predictions.

1.3 MACHINE LEARNING APPROACHES

In machine learning, tasks square measure is typically classified into broad classes.
These classes square measure supported however learning is received or however,
feedback on the education is given to the system developed. Two of the foremost
wide adopted machine learning strategies are square measure supervised learning
that trains algorithms supported example input and output information that's tagged
by humans, and unattended learning that provides the algorithmic program with no
tagged information to permit it to search out structure at intervals its computer file.

Machine learning approaches are traditionally divided into three broad categories,
depending on the nature of the "signal" or "feedback" available to the learning
system:

Supervised learning: The computer is presented with example inputs and their
4
desired outputs, given by a "teacher", and the goal is to learn a general rule that
maps inputs to outputs.

Unsupervised learning: No labels are given to the learning algorithm, leaving it on


its own to find structure in its input. Unsupervised learning can be a goal in itself
(discovering hidden patterns in data) or a means towards an end (feature learning).

Reinforcement learning: A computer program interacts with a dynamic


environment in which it must perform a certain goal (such as driving a vehicle or
playing a game against an opponent). As it navigates its problem space, the
program is provided feedback that's analogous to rewards, which it tries to
maximize.

1.3.1 Supervised Learning

In supervised learning, the pc is given example inputs that square measure


labeled with their desired outputs. The aim of this technique is for the algorithmic
program to be ready to “learn” by comparing its actual output with the “taught”
outputs to search out errors, and modify the model consequently. Supervised
learning thus uses patterns to predict label values on extra unlabeled information.
For example, with supervised learning, an algorithm may be fed data with images
of sharks labeled as fish and images of oceans labeled as water. By being trained
on this data, the supervised learning algorithm should be able to later identify
unlabeled shark images as fish and unlabeled ocean images as water.

A common use case of supervised learning is to use historical information to


predict statistically probably future events. It's going to use historical stock
exchange info to anticipate approaching fluctuations or be used to filter spam
emails. In supervised learning, labeled photos of dogs are often used as input files
to classify unlabeled photos of dogs.

Types of supervised learning algorithms include active learning, classification and


regression. Classification algorithms are used when the outputs are restricted to
5
a limited set of values, and regression algorithms are used when the outputs may
have any numerical value within a range. As an example, for a classification
algorithm that filters emails, the input would be an incoming email, and the output
would be the name of the folder in which to file the email.

Similarity learning is an area of supervised machine learning closely related to


regression and classification, but the goal is to learn from examples using a
similarity function that measures how similar or related two objects are. It has
applications in ranking, recommendation systems, visual identity tracking, face
verification, and speaker verification.

1.3.2 Unsupervised Learning

In unsupervised learning, information is unlabeled, and the learning rule is left to


seek out commonalities among its input file. The goal of unattended learning is
also as easy as discovering hidden patterns at intervals in a dataset, however, it
should even have a goal of feature learning, that permits the procedure machine
to mechanically discover the representations that square measure required to
classify data.

Unsupervised learning is usually used for transactional information. You will have
an oversized dataset of consumers and their purchases, however, as a person,
you'll probably not be able to add up what similar attributes will be drawn from
client profiles and their styles of purchases.

With this information fed into the Associate in Nursing unattended learning rule, it
should be determined that ladies of a definite age vary UN agency obtain
unscented soaps square measure probably to be pregnant, and so a promoting
campaign associated with physiological condition and baby will be merchandised

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy