0% found this document useful (0 votes)

13 views

SPPUML5

Machine learning lab assignment 5

Uploaded by

kanaseaditya800

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

SPPUML5

Machine learning lab assignment 5

Uploaded by

kanaseaditya800

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

14/08/2024, 15:56 ML-5 - Jupyter Notebook

Name : Kanase Aditya Madhukar

Roll No : 2441059

Batch : D

Assignment No.05 : Implement K-Nearest Neighbors algorithm on diabetes.csv dataset. Compute confusion matrix,
accuracy, error rate, precision and recall on the given dataset. Dataset link :
https://www.kaggle.com/datasets/abdallamahgoub/diabetes
(https://www.kaggle.com/datasets/abdallamahgoub/diabetes)

In [1]: import pandas as pd

import numpy as np
from sklearn import metrics
from sklearn.svm import SVC
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import confusion_matrix, accuracy_score, precision_score, recall_sco

In [2]: df = pd.read_csv('diabetes.csv')

In [3]: df.head()

Out[3]:
Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age Outcome

0 6 148 72 35 0 33.6 0.627 50 1

1 1 85 66 29 0 26.6 0.351 31 0

2 8 183 64 0 0 23.3 0.672 32 1

3 1 89 66 23 94 28.1 0.167 21 0

4 0 137 40 35 168 43.1 2.288 33 1

In [4]: df.tail()

Out[4]:
Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age Outcome

763 10 101 76 48 180 32.9 0.171 63 0

764 2 122 70 27 0 36.8 0.340 27 0

765 5 121 72 23 112 26.2 0.245 30 0

766 1 126 60 0 0 30.1 0.349 47 1

767 1 93 70 31 0 30.4 0.315 23 0

In [5]: df.isnull().sum()

Out[5]: Pregnancies 0
Glucose 0
BloodPressure 0
SkinThickness 0
Insulin 0
BMI 0
Pedigree 0
Age 0
Outcome 0
dtype: int64

localhost:8888/notebooks/ML/ML-5.ipynb 1/4
14/08/2024, 15:56 ML-5 - Jupyter Notebook

In [6]: X = df.drop("Outcome", axis=1)

y = df["Outcome"]

In [7]: X

Out[7]:
Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age

0 6 148 72 35 0 33.6 0.627 50

1 1 85 66 29 0 26.6 0.351 31

2 8 183 64 0 0 23.3 0.672 32

3 1 89 66 23 94 28.1 0.167 21

4 0 137 40 35 168 43.1 2.288 33

... ... ... ... ... ... ... ... ...

763 10 101 76 48 180 32.9 0.171 63

764 2 122 70 27 0 36.8 0.340 27

765 5 121 72 23 112 26.2 0.245 30

766 1 126 60 0 0 30.1 0.349 47

767 1 93 70 31 0 30.4 0.315 23

768 rows × 8 columns

In [8]: X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42

In [9]: scaler = StandardScaler()

X_train = scaler.fit_transform(X_train)
X_test = scaler.transform(X_test)

In [10]: X_train

Out[10]: array([[-0.52639686, -1.15139792, -3.75268255, ..., -4.13525578,

-0.49073479, -1.03594038],
[ 1.58804586, -0.27664283, 0.68034485, ..., -0.48916881,
2.41502991, 1.48710085],
[-0.82846011, 0.56687102, -1.2658623 , ..., -0.42452187,
0.54916055, -0.94893896],
...,
[ 1.8901091 , -0.62029661, 0.89659009, ..., 1.76054443,
1.981245 , 0.44308379],
[-1.13052335, 0.62935353, -3.75268255, ..., 1.34680407,
-0.78487662, -0.33992901],
[-1.13052335, 0.12949347, 1.43720319, ..., -1.22614383,
-0.61552223, -1.03594038]])

In [11]: k = 3
knn = KNeighborsClassifier(n_neighbors=k)
knn.fit(X_train, y_train)

Out[11]: KNeighborsClassifier(n_neighbors=3)

localhost:8888/notebooks/ML/ML-5.ipynb 2/4
14/08/2024, 15:56 ML-5 - Jupyter Notebook

In [12]: y_pred = knn.predict(X_test)

y_pred

/home/comp/anaconda3/lib/python3.9/site-packages/sklearn/neighbors/_classification.py:2
28: FutureWarning: Unlike other reduction functions (e.g. `skew`, `kurtosis`), the defa
ult behavior of `mode` typically preserves the axis it acts along. In SciPy 1.11.0, thi
s behavior will change: the default value of `keepdims` will become False, the `axis` o
ver which the statistic is taken will be eliminated, and the value None will no longer
be accepted. Set `keepdims` to True or False to avoid this warning.
mode, _ = stats.mode(_y[neigh_ind, k], axis=1)

Out[12]: array([0, 0, 0, 1, 1, 0, 0, 1, 1, 0, 1, 0, 0, 1, 0, 0, 0, 0, 1, 0, 1, 0,
0, 0, 1, 1, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 0, 0, 1, 0, 0, 1, 0,
0, 0, 0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 0, 1, 0, 0, 0, 1, 1, 0, 1,
0, 1, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 0,
0, 0, 0, 0, 0, 0, 0, 1, 1, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 1, 0, 1,
0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1,
0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0])

In [13]: conf_matrix = confusion_matrix(y_test, y_pred)

accuracy = accuracy_score(y_test, y_pred)

error_rate = 1 - accuracy
precision = precision_score(y_test, y_pred)
recall = recall_score(y_test, y_pred)

print("Confusion Matrix:")
print(conf_matrix)
print("Accuracy:", accuracy)
print("Error Rate:", error_rate)
print("Precision:", precision)
print("Recall:", recall)

Confusion Matrix:
[[81 18]
[27 28]]
Accuracy: 0.7077922077922078
Error Rate: 0.29220779220779225
Precision: 0.6086956521739131
Recall: 0.509090909090909

In [14]: model = SVC(C = 1)

model.fit(X_train, y_train)

y_pred = model.predict(X_test)

In [15]: kc = metrics.confusion_matrix(y_test, y_pred)

print("SVM accuracy: ", kc)

SVM accuracy: [[82 17]

[24 31]]

In [16]: sc = metrics.accuracy_score(y_test,y_pred)
print("SVM accuracy: ", sc)

SVM accuracy: 0.7337662337662337

In [19]: lr = LogisticRegression()

lr.fit(X_train, y_train)

y_pred = lr.predict(X_test)

In [20]: acc = metrics.accuracy_score(y_test,y_pred)

print("Logistic Regression accuracy: ", acc)

Logistic Regression accuracy: 0.7532467532467533

localhost:8888/notebooks/ML/ML-5.ipynb 3/4
14/08/2024, 15:56 ML-5 - Jupyter Notebook

In [ ]:

localhost:8888/notebooks/ML/ML-5.ipynb 4/4

Patellofemoral Instability Part I Evaluation And.8
No ratings yet
Patellofemoral Instability Part I Evaluation And.8
12 pages
Roger G. Barry - Synoptic and Dynamic Climatology (2001)
No ratings yet
Roger G. Barry - Synoptic and Dynamic Climatology (2001)
637 pages
List of Materials Properties
No ratings yet
List of Materials Properties
5 pages
Textbook Solutions Expert Q&A Practice: Find Solutions For Your Homework
No ratings yet
Textbook Solutions Expert Q&A Practice: Find Solutions For Your Homework
4 pages
I Avaliação Parcial - 25.0 PTS - Gabarito
No ratings yet
I Avaliação Parcial - 25.0 PTS - Gabarito
9 pages
210596_ML_Labtask5.ipynb_k - Colab
No ratings yet
210596_ML_Labtask5.ipynb_k - Colab
8 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
5 pages
ML 4
No ratings yet
ML 4
2 pages
Assignment 5 - SourceCode - Ipynb - Colab
No ratings yet
Assignment 5 - SourceCode - Ipynb - Colab
4 pages
AIML Report (1) 11
No ratings yet
AIML Report (1) 11
13 pages
PCA Codebase
No ratings yet
PCA Codebase
6 pages
ML Practical 3D
No ratings yet
ML Practical 3D
4 pages
KnnClassifier - Jupyter Notebook
No ratings yet
KnnClassifier - Jupyter Notebook
2 pages
AIML Report.
No ratings yet
AIML Report.
12 pages
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
No ratings yet
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
5 pages
mnbnmnbnnmbbhhuyrgh
No ratings yet
mnbnmnbnnmbbhhuyrgh
3 pages
Lab 8
No ratings yet
Lab 8
7 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
KNN Rainfall
No ratings yet
KNN Rainfall
9 pages
Knn
No ratings yet
Knn
7 pages
Final
No ratings yet
Final
13 pages
omml
No ratings yet
omml
1 page
Loading The Dataset: 'Diabetes - CSV'
No ratings yet
Loading The Dataset: 'Diabetes - CSV'
4 pages
Experiment 7 Ids
No ratings yet
Experiment 7 Ids
12 pages
Experiment 4
No ratings yet
Experiment 4
8 pages
Openlab1
No ratings yet
Openlab1
17 pages
Knn
No ratings yet
Knn
4 pages
Knn
No ratings yet
Knn
3 pages
BTVN4_code
No ratings yet
BTVN4_code
3 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
C5
No ratings yet
C5
3 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
33 pages
Machine Learning Assignment (1)
No ratings yet
Machine Learning Assignment (1)
8 pages
1 KNN - Jupyter Notebook
No ratings yet
1 KNN - Jupyter Notebook
3 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Assignment #1: K Nearest Neighbor Classifier: Name: Srikanth Mujjiga (Roll No: 2015-50-831
No ratings yet
Assignment #1: K Nearest Neighbor Classifier: Name: Srikanth Mujjiga (Roll No: 2015-50-831
8 pages
ML Lab
No ratings yet
ML Lab
7 pages
machine-learning-assignment (1)
No ratings yet
machine-learning-assignment (1)
7 pages
Practical 4
No ratings yet
Practical 4
2 pages
ML-journal
No ratings yet
ML-journal
45 pages
ML
No ratings yet
ML
11 pages
Python for Data Science IA 1 Programs
No ratings yet
Python for Data Science IA 1 Programs
14 pages
KNN (1)
No ratings yet
KNN (1)
2 pages
Lab 5
No ratings yet
Lab 5
2 pages
Python for Data Science IA 1 Programs
No ratings yet
Python for Data Science IA 1 Programs
14 pages
Experiment 4
No ratings yet
Experiment 4
5 pages
Vertopal.com Experiment01 Baseline Models Accuracy
No ratings yet
Vertopal.com Experiment01 Baseline Models Accuracy
35 pages
M.E MACHINE LEARNING -CP4252 LAB MANUAL4716718074353656238
No ratings yet
M.E MACHINE LEARNING -CP4252 LAB MANUAL4716718074353656238
26 pages
LabProgram 8 K-Nearest Neighbour Classifier
No ratings yet
LabProgram 8 K-Nearest Neighbour Classifier
3 pages
Implementing KNN Algorithm on the Iris Dataset
No ratings yet
Implementing KNN Algorithm on the Iris Dataset
7 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
DATA SCI ex12 KNN-correct and wrong predictions
No ratings yet
DATA SCI ex12 KNN-correct and wrong predictions
1 page
Assignment 1
No ratings yet
Assignment 1
17 pages
ML practical Kiranjot 6-10
No ratings yet
ML practical Kiranjot 6-10
10 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
Is Lab 7
No ratings yet
Is Lab 7
7 pages
ML Lab
No ratings yet
ML Lab
7 pages
St. John College of Engineering and Management, Palghar - Maharashtra
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
11 pages
Exp 5
No ratings yet
Exp 5
7 pages
20-SE-66 ML Assign 2
No ratings yet
20-SE-66 ML Assign 2
4 pages
22104057_Prakhar_Week 5
No ratings yet
22104057_Prakhar_Week 5
8 pages
A Book of Numbers
From Everand
A Book of Numbers
Maria Morisot
No ratings yet
A List of Factorial Math Constants
From Everand
A List of Factorial Math Constants
StreetLib
No ratings yet
Av Med Visual Illusion & Superstall
No ratings yet
Av Med Visual Illusion & Superstall
11 pages
Questions On Set Theory (79432)
No ratings yet
Questions On Set Theory (79432)
10 pages
Stucke 2003
100% (1)
Stucke 2003
14 pages
Anthropic_Claude_Code_Best_Practices_1745281865
No ratings yet
Anthropic_Claude_Code_Best_Practices_1745281865
30 pages
02 Dasar Machine Learning 02 - Supervised Vs Unsupervised
100% (1)
02 Dasar Machine Learning 02 - Supervised Vs Unsupervised
25 pages
MARK SCHEME For The June 2005 Question Paper 5090 BIOLOGY
No ratings yet
MARK SCHEME For The June 2005 Question Paper 5090 BIOLOGY
2 pages
1 s2.0 S1877705812029979 Main
No ratings yet
1 s2.0 S1877705812029979 Main
10 pages
Instruction Manual: Universal Vibration Monitor
No ratings yet
Instruction Manual: Universal Vibration Monitor
39 pages
Search Function Manual: Motoman XRC Controller
No ratings yet
Search Function Manual: Motoman XRC Controller
38 pages
SAP BW - Virtual Characteristic (Multiprovider & Infoset) - RSR - OLAP - BADI
No ratings yet
SAP BW - Virtual Characteristic (Multiprovider & Infoset) - RSR - OLAP - BADI
21 pages
New Check Sheet Press Line
No ratings yet
New Check Sheet Press Line
6 pages
Comprehensive Book On Glaciology, A (Helm, Nakita)
No ratings yet
Comprehensive Book On Glaciology, A (Helm, Nakita)
111 pages
File Information: Drive Information: Torque/Force Foldback Information
No ratings yet
File Information: Drive Information: Torque/Force Foldback Information
2 pages
Dynamic Analysis of G + 20 Multi Storied Building by Using Shear Walls
No ratings yet
Dynamic Analysis of G + 20 Multi Storied Building by Using Shear Walls
6 pages
2024 - Unit Outline - Linear Algebra
No ratings yet
2024 - Unit Outline - Linear Algebra
2 pages
Perfiles
No ratings yet
Perfiles
4 pages
Class 6 Maths
No ratings yet
Class 6 Maths
2 pages
Introduction To Vars and Structural Vars:: Estimation & Tests Using Stata
100% (1)
Introduction To Vars and Structural Vars:: Estimation & Tests Using Stata
69 pages
Nitrogen Compounds - Optical Isomerism: AS Organic Chemistry: Alkenes
No ratings yet
Nitrogen Compounds - Optical Isomerism: AS Organic Chemistry: Alkenes
3 pages
Theory of Machine Practicals
100% (2)
Theory of Machine Practicals
24 pages
Powerwave Antenna Guide PDF
No ratings yet
Powerwave Antenna Guide PDF
236 pages
Pengaruh Pembelajaran Project Based Learning Terhadap Keterampilan Psikomotorik Dan Hasil Belajar Praktek Proyek Work
No ratings yet
Pengaruh Pembelajaran Project Based Learning Terhadap Keterampilan Psikomotorik Dan Hasil Belajar Praktek Proyek Work
10 pages
Lecture 11 - COGENERATION
No ratings yet
Lecture 11 - COGENERATION
30 pages
Ce742 A2 PDF
No ratings yet
Ce742 A2 PDF
2 pages
Sample Cooler Application Note
No ratings yet
Sample Cooler Application Note
2 pages
C1681 Curva Perkins T0 DT130
No ratings yet
C1681 Curva Perkins T0 DT130
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

SPPUML5

Uploaded by

SPPUML5

Uploaded by

14/08/2024, 15:56 ML-5 - Jupyter Notebook

Name : Kanase Aditya Madhukar

In [1]: import pandas as pd

0 6 148 72 35 0 33.6 0.627 50 1

2 8 183 64 0 0 23.3 0.672 32 1

4 0 137 40 35 168 43.1 2.288 33 1

763 10 101 76 48 180 32.9 0.171 63 0

764 2 122 70 27 0 36.8 0.340 27 0

765 5 121 72 23 112 26.2 0.245 30 0

766 1 126 60 0 0 30.1 0.349 47 1

767 1 93 70 31 0 30.4 0.315 23 0

In [6]: X = df.drop("Outcome", axis=1)

0 6 148 72 35 0 33.6 0.627 50

2 8 183 64 0 0 23.3 0.672 32

4 0 137 40 35 168 43.1 2.288 33

... ... ... ... ... ... ... ... ...

763 10 101 76 48 180 32.9 0.171 63

764 2 122 70 27 0 36.8 0.340 27

765 5 121 72 23 112 26.2 0.245 30

766 1 126 60 0 0 30.1 0.349 47

767 1 93 70 31 0 30.4 0.315 23

768 rows × 8 columns

In [8]: X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42

In [9]: scaler = StandardScaler()

Out[10]: array([[-0.52639686, -1.15139792, -3.75268255, ..., -4.13525578,

In [12]: y_pred = knn.predict(X_test)

In [13]: conf_matrix = confusion_matrix(y_test, y_pred)

accuracy = accuracy_score(y_test, y_pred)

In [14]: model = SVC(C = 1)

In [15]: kc = metrics.confusion_matrix(y_test, y_pred)

SVM accuracy: [[82 17]

SVM accuracy: 0.7337662337662337

In [20]: acc = metrics.accuracy_score(y_test,y_pred)

Logistic Regression accuracy: 0.7532467532467533

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.