0% found this document useful (0 votes)

19 views5 pages

KNN For Classification

Uploaded by

snehalkotar1153

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views5 pages

KNN For Classification

Uploaded by

snehalkotar1153

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Name : Snehal Kotkar Div : A Roll No.

: 46

Practical No. : 2 Problem Statement : Build a machine learning model using k-Nearest
Neighbors algorithm to predict whether the patients in the "Pima Indians Diabetes Dataset"
have diabetes or not.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
plt.style.use('ggplot')

from google.colab import drive

drive.mount('/content/drive')

Drive already mounted at /content/drive; to attempt to forcibly

remount, call drive.mount("/content/drive", force_remount=True).

df = pd.read_csv('/content/drive/MyDrive/ML /diabetes.csv')
df.head()

{"summary":"{\n \"name\": \"df\",\n \"rows\": 768,\n \"fields\": [\

n {\n \"column\": \"Pregnancies\",\n \"properties\": {\n
\"dtype\": \"number\",\n \"std\": 3,\n \"min\": 0,\n
\"max\": 17,\n \"num_unique_values\": 17,\n \"samples\":
[\n 6,\n 1,\n 3\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Glucose\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 31,\n
\"min\": 0,\n \"max\": 199,\n \"num_unique_values\":
136,\n \"samples\": [\n 151,\n 101,\n
112\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"BloodPressure\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 19,\n \"min\": 0,\n
\"max\": 122,\n \"num_unique_values\": 47,\n
\"samples\": [\n 86,\n 46,\n 85\
n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"SkinThickness\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 15,\n \"min\": 0,\n
\"max\": 99,\n \"num_unique_values\": 51,\n \"samples\":
[\n 7,\n 12,\n 48\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Insulin\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 115,\n
\"min\": 0,\n \"max\": 846,\n \"num_unique_values\":
186,\n \"samples\": [\n 52,\n 41,\n
183\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"BMI\",\n \"properties\": {\n \"dtype\": \"number\",\n
\"std\": 7.884160320375446,\n \"min\": 0.0,\n \"max\":
67.1,\n \"num_unique_values\": 248,\n \"samples\": [\n
19.9,\n 31.0,\n 38.1\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"DiabetesPedigreeFunction\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
0.3313285950127749,\n \"min\": 0.078,\n \"max\": 2.42,\n
\"num_unique_values\": 517,\n \"samples\": [\n 1.731,\
n 0.426,\n 0.138\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Age\",\n \"properties\": {\n
\"dtype\": \"number\",\n \"std\": 11,\n \"min\": 21,\n
\"max\": 81,\n \"num_unique_values\": 52,\n \"samples\":
[\n 60,\n 47,\n 72\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Outcome\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 0,\n
\"min\": 0,\n \"max\": 1,\n \"num_unique_values\": 2,\n
\"samples\": [\n 0,\n 1\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n }\n ]\n}","type":"dataframe","variable_name":"df"}

df.shape

(768, 9)

df.isnull().sum()

Pregnancies 0
Glucose 0
BloodPressure 0
SkinThickness 0
Insulin 0
BMI 0
DiabetesPedigreeFunction 0
Age 0
Outcome 0
dtype: int64

X = df.drop('Outcome',axis=1).values
y = df['Outcome'].values

from sklearn.model_selection import train_test_split

X_train,X_test,y_train,y_test =
train_test_split(X,y,test_size=0.25,random_state=42, stratify=y)

#import KNeighborsClassifier
from sklearn.neighbors import KNeighborsClassifier

#Setup arrays to store training and test accuracies

neighbors = np.arange(1,15)
train_accuracy =np.empty(len(neighbors))
test_accuracy = np.empty(len(neighbors))

for i,k in enumerate(neighbors):

#Setup a knn classifier with k neighbors
knn = KNeighborsClassifier(n_neighbors=k)

#Fit the model

knn.fit(X_train, y_train)

#Compute accuracy on the training set

train_accuracy[i] = knn.score(X_train, y_train)

#Compute accuracy on the test set

test_accuracy[i] = knn.score(X_test, y_test)

#Generate plot
plt.title('k-NN Varying number of neighbors')
plt.plot(neighbors, test_accuracy, label='Testing Accuracy')
plt.plot(neighbors, train_accuracy, label='Training accuracy')
plt.legend()
plt.xlabel('Number of neighbors')
plt.ylabel('Accuracy')
plt.show()
#Setup a knn classifier with k neighbors
knn = KNeighborsClassifier(n_neighbors=4)

#Fit the model

knn.fit(X_train,y_train)

KNeighborsClassifier(n_neighbors=4)

#Get accuracy. Note: In case of classification algorithms score method

represents accuracy.
knn.score(X_test,y_test)

0.7291666666666666

#let us get the predictions using the classifier we had fit above
y_pred = knn.predict(X_test)

y_pred

array([0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 1, 1, 0, 0, 0, 0,
0,
0, 1, 1, 0, 0, 0, 0, 1, 1, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 0, 0,
0,
1, 0, 0, 0, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 1, 1, 0, 0, 0, 0, 1, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 0, 0, 1, 1,
0,
1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 1, 0, 0, 0,
1,
0, 0, 0, 1, 0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 0])

LAB8_LogisticReg_HeartDisease[1]
No ratings yet
LAB8_LogisticReg_HeartDisease[1]
31 pages
Thrive: Solar LED Home Lighting System
No ratings yet
Thrive: Solar LED Home Lighting System
2 pages
Diabetes
No ratings yet
Diabetes
97 pages
مختار النعيري - The Course Work Submission (1)
No ratings yet
مختار النعيري - The Course Work Submission (1)
31 pages
lab_8__(6)عفان عبدالله احمد_التكليف_
No ratings yet
lab_8__(6)عفان عبدالله احمد_التكليف_
18 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
20 pages
vertopal.com_python2025
No ratings yet
vertopal.com_python2025
25 pages
Diabetes_Prediction_1704256341
No ratings yet
Diabetes_Prediction_1704256341
17 pages
Covid_19_Analysis_and_Visualization_using_Plotly_Express
No ratings yet
Covid_19_Analysis_and_Visualization_using_Plotly_Express
11 pages
AML Sessional 1 Students
No ratings yet
AML Sessional 1 Students
16 pages
Java Graphical User Interfaces An Introduction To Java Programming David Etheridge download
100% (1)
Java Graphical User Interfaces An Introduction To Java Programming David Etheridge download
37 pages
Major project - Colab
No ratings yet
Major project - Colab
15 pages
B58_ Handling Missing Values,Feature_Selection (1)
No ratings yet
B58_ Handling Missing Values,Feature_Selection (1)
4 pages
Model2.ipynb - Colab
No ratings yet
Model2.ipynb - Colab
11 pages
Where Can Buy Lust On Trial Censorship and The Rise of American Obscenity in The Age of Anthony Comstock Werbel Ebook With Cheap Price
100% (1)
Where Can Buy Lust On Trial Censorship and The Rise of American Obscenity in The Age of Anthony Comstock Werbel Ebook With Cheap Price
62 pages
Lab Manual - MachineLearningLaboratory-DR.vaishnavi (1)
No ratings yet
Lab Manual - MachineLearningLaboratory-DR.vaishnavi (1)
71 pages
KNN - Jupyter Notebook (1)
No ratings yet
KNN - Jupyter Notebook (1)
7 pages
Data Pre-Processing
No ratings yet
Data Pre-Processing
22 pages
vertopal.com_Heart_Disease_Classification_Full-1
No ratings yet
vertopal.com_Heart_Disease_Classification_Full-1
3 pages
Documentation Code
No ratings yet
Documentation Code
20 pages
Preprocessing1.ipynb - Colab
No ratings yet
Preprocessing1.ipynb - Colab
13 pages
ExNo 08ml
No ratings yet
ExNo 08ml
4 pages
ML Proj Diabetes.pptx
No ratings yet
ML Proj Diabetes.pptx
51 pages
Diabetes and Glucose Correlation - IBM Machine Learning Training Project
No ratings yet
Diabetes and Glucose Correlation - IBM Machine Learning Training Project
10 pages
Data Science Practical 9
No ratings yet
Data Science Practical 9
6 pages
baseline.ipynb - Colab
No ratings yet
baseline.ipynb - Colab
5 pages
Diabetes Prediction
No ratings yet
Diabetes Prediction
1 page
ML Practical 04
No ratings yet
ML Practical 04
20 pages
Diagramas Eléctricos HYUNDAI TUCSON AWD L4-2.4L 2015
No ratings yet
Diagramas Eléctricos HYUNDAI TUCSON AWD L4-2.4L 2015
75 pages
ADS Exp-1
No ratings yet
ADS Exp-1
3 pages
healthcare-project-simplilearn- Week1
No ratings yet
healthcare-project-simplilearn- Week1
6 pages
Diabetes
No ratings yet
Diabetes
7 pages
diabetes-prediction-using-machine-learning
No ratings yet
diabetes-prediction-using-machine-learning
16 pages
Capstone Project 2
No ratings yet
Capstone Project 2
15 pages
ML Data Preprocessing in Python
No ratings yet
ML Data Preprocessing in Python
9 pages
Diabetis Project
No ratings yet
Diabetis Project
7 pages
Stroke Prediction Dataset
No ratings yet
Stroke Prediction Dataset
48 pages
ML 7
No ratings yet
ML 7
6 pages
Project 10 Movie Recommendation - Ipynb - Colaboratory
No ratings yet
Project 10 Movie Recommendation - Ipynb - Colaboratory
6 pages
Practical 4
No ratings yet
Practical 4
2 pages
Diabetes Prediction System
No ratings yet
Diabetes Prediction System
4 pages
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
No ratings yet
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
8 pages
AIML Report (1) 11
No ratings yet
AIML Report (1) 11
13 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
5 pages
AIML Report.
No ratings yet
AIML Report.
12 pages
My Code
No ratings yet
My Code
7 pages
20MIS7043 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7043 (LAB 7) .Ipynb Colaboratory
4 pages
20MIS7095 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7095 (LAB 7) .Ipynb Colaboratory
4 pages
Bio-Signal Analysis For Smoking
No ratings yet
Bio-Signal Analysis For Smoking
1 page
Project
No ratings yet
Project
8 pages
ML Practical 3D
No ratings yet
ML Practical 3D
4 pages
Experiment 4
No ratings yet
Experiment 4
5 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
Diabetes
No ratings yet
Diabetes
10 pages
KNN For Classification
No ratings yet
KNN For Classification
4 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
No ratings yet
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
8 pages
TQM HILTON
No ratings yet
TQM HILTON
11 pages
1728086737277
No ratings yet
1728086737277
26 pages
Exp 5
No ratings yet
Exp 5
7 pages
Diabetes EDA and Kears Modeling
No ratings yet
Diabetes EDA and Kears Modeling
26 pages
Loading The Dataset: 'Diabetes - CSV'
No ratings yet
Loading The Dataset: 'Diabetes - CSV'
4 pages
Essential n8n Playbook
From Everand
Essential n8n Playbook
Leandro Calado
No ratings yet
Paper - II Linguistics
No ratings yet
Paper - II Linguistics
16 pages
Practical Handbook To Dissertation and Thesis Writing
No ratings yet
Practical Handbook To Dissertation and Thesis Writing
9 pages
Budget sheet format
No ratings yet
Budget sheet format
8 pages
Steel Girder
No ratings yet
Steel Girder
42 pages
Cardio Screen RF
100% (1)
Cardio Screen RF
27 pages
Work Positions Ranking - Methods and Techniques
No ratings yet
Work Positions Ranking - Methods and Techniques
8 pages
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
No ratings yet
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
12 pages
MOS Current Mode Logic Near Threshold Circuits: Low Power Electronics and Applications
No ratings yet
MOS Current Mode Logic Near Threshold Circuits: Low Power Electronics and Applications
15 pages
Syllabus of Applied Math in Cu
No ratings yet
Syllabus of Applied Math in Cu
98 pages
Matlab - Image Processing: (Food Technology)
No ratings yet
Matlab - Image Processing: (Food Technology)
2 pages
1Z0-184 (Final_Last_One) 2 2
No ratings yet
1Z0-184 (Final_Last_One) 2 2
10 pages
FP100 SCHEMATIC Rev.8
No ratings yet
FP100 SCHEMATIC Rev.8
1 page
Fernandez Del Rio Et Al 2020
No ratings yet
Fernandez Del Rio Et Al 2020
6 pages
Bongolan, Stephanie N - FS 2 Activity 2 1
No ratings yet
Bongolan, Stephanie N - FS 2 Activity 2 1
7 pages
ISO 13341-2010
No ratings yet
ISO 13341-2010
14 pages
ProMatura Brochure
No ratings yet
ProMatura Brochure
16 pages
2013 MSME Survey Summary Report
No ratings yet
2013 MSME Survey Summary Report
49 pages
Model of Human Occupation Frame of Reference: Theoretical Base
No ratings yet
Model of Human Occupation Frame of Reference: Theoretical Base
14 pages
Neuroscience PDF
No ratings yet
Neuroscience PDF
4 pages
QP-STD-Q-004 R1 Quality Reqts For Projects
100% (7)
QP-STD-Q-004 R1 Quality Reqts For Projects
49 pages
General Manager
No ratings yet
General Manager
2 pages
Zelio Time Re8rb11bu
No ratings yet
Zelio Time Re8rb11bu
2 pages
Hot Sauce Experiment
No ratings yet
Hot Sauce Experiment
3 pages
M.Babu: Certified Sap SD Consultant
No ratings yet
M.Babu: Certified Sap SD Consultant
3 pages
E-Governance and Service Delivery Innovations in Malaysia: An Overview
No ratings yet
E-Governance and Service Delivery Innovations in Malaysia: An Overview
12 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Type e Repeat Back Unit Operator's Manual
100% (1)
Type e Repeat Back Unit Operator's Manual
9 pages
Determination of Viscosity Through Brookfield Viscometer.
No ratings yet
Determination of Viscosity Through Brookfield Viscometer.
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

KNN For Classification

Uploaded by

KNN For Classification

Uploaded by

Name : Snehal Kotkar Div : A Roll No.

from google.colab import drive

Drive already mounted at /content/drive; to attempt to forcibly

{"summary":"{\n \"name\": \"df\",\n \"rows\": 768,\n \"fields\": [\

from sklearn.model_selection import train_test_split

#Setup arrays to store training and test accuracies

for i,k in enumerate(neighbors):

#Fit the model

#Compute accuracy on the training set

#Compute accuracy on the test set

#Fit the model

#Get accuracy. Note: In case of classification algorithms score method

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.