0% found this document useful (0 votes)

54 views

Chapter II - Lecture 2 - KNN

This document provides an overview of the k-nearest neighbors (k-NN) algorithm. It explains that k-NN is a simple supervised learning algorithm that stores all training examples and classifies new examples based on their similarity to existing examples. The document outlines the k-NN classification process, discusses how to choose the k value for k nearest neighbors, provides examples of k-NN classification, and reviews the strengths and weaknesses of the k-NN algorithm.

Uploaded by

Halbeega Waayaha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views

Chapter II - Lecture 2 - KNN

Uploaded by

Halbeega Waayaha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Chapter II:

Supervised Learning
Lecture 2.2
Road Map
◼ Introduction
◼ Generalization, Overfitting, and Underfitting
◼ Some Sample Datasets
◼ Supervised Machine Learning Algorithms
◼ k-Nearest Neighbors
◼ Linear Models
◼ Naive Bayes Classifiers
◼ Decision Trees
◼ Support Vector Machines

Chapter 2: Supervised Machine Learning 2

What is k-NN?
◼ A powerful classification algorithm
used in pattern recognition.
◼ KNN stores all available cases and
classifies new cases based on a Similarity
Measure (e.g. Distance Function)
◼ One of the top data mining algorithm
used today.
◼ A non-parametric lazy learning algorithm
(An Instance-based Learning method).

Chapter 2: Supervised Machine Learning 3

k-Nearest Neighbor
◼ The k-NN algorithm is arguably the
simplest machine learning algorithm.
◼ To make a prediction for a new data
point, the algorithm finds the closest data
points in the training dataset—its “nearest
neighbors.
◼ k-NN algorithm, given an input, chooses
the most common class out of the k
nearest data points to that input.

Chapter 2: Supervised Machine Learning 4

Simple Analogy!

Chapter 2: Supervised Machine Learning 5

k-Nearest Neighbor Classification (kNN)
◼ Unlike all the other learning methods, kNN
does not build model from the training data.
◼ To classify a test instance d, define k-
neighborhood P as k nearest neighbors of d
◼ Count number n of training instances in P that
belong to class cj
◼ No training is needed. Classification time is
linear in training set size for each test case.

Chapter 2: Supervised Machine Learning 6

k-NN Classification Process

Chapter 2: Supervised Machine Learning 7

kNNAlgorithm

◼ k is usually chosen empirically via a validation

set or cross-validation by trying a range of k
values.
◼ Distance function is crucial, but depends on
applications.
Chapter 2: Supervised Machine Learning 8
Example: k=6 (6NN)
Government
Science
Arts

A new point
Pr(science| )?

Chapter 2: Supervised Machine Learning 9

Cont..

◼ K-nearest neighbors of a record x are data points

that have the k smallest distance to x

Chapter 2: Supervised Machine Learning 10

How to choose K?
◼ If K is too small, it is sensitive to noise points.
◼ Larger K works well. But too large K may include
majority points from other classes.

◼ Rule of thumb is K < sqrt(n), n is the number of

data points.

Chapter 2: Supervised Machine Learning 11

k-NN Example!
◼ The k-NN algorithm only considers exactly one
nearest neighbor, which is the closest training data
point to the point we want to make a prediction for.
◼ The prediction is then simply the known output for
this training point. Below Figure, illustrates this for
the case of classification on the forge dataset:

Input:

Chapter 2: Supervised Machine Learning 12

Cont..
◼ Here, we added three new data points, shown as stars. For each
of them, we marked the closest point in the training set. The
prediction of the one-nearest-neighbor algorithm is the label of that
point (shown by the color of the cross).
◼ Instead of considering only the closest neighbor, we can also
consider an arbitrary number, k, of neighbors. This is where the
name of the k-nearest neighbors algorithm comes from. The
following example (uses the three closest neighbors:
Input:

Chapter 2: Supervised Machine Learning 13

Cont..
◼ Again, the prediction is shown as the color of
the cross. You can see that the prediction for
the new data point at the top left is not the
same as the prediction when we used only one
neighbor.
◼ Now let’s look at how we can apply the k-
nearest neighbors algorithm using scikit-learn.
◼ First, we split our data into a training and a test
set so we can evaluate generalization
performance, as discussed in this Chapter.

Chapter 2: Supervised Machine Learning 14

Cont..
◼ Input:

◼ Input:

Chapter 2: Supervised Machine Learning 15

Cont..
◼ The train_test_split() function is used to split
the dataset into train and test sets. By default,
the function shuffles the data (with shuffle=True)
before splitting.
◼ The random state hyperparameter in
the train_test_split() function controls the
shuffling process.
◼ With random_state = None, we get different
train and test sets across different executions
and the shuffling process is out of control.
◼ With random_state = 0, we get the same train
and test sets across different executions.
Chapter 2: Supervised Machine Learning 16
Cont..
◼ Input:

◼ Input:

◼ Output:

Chapter 2: Supervised Machine Learning 17

Cont..
◼ Input:

◼ Output:

Chapter 2: Supervised Machine Learning 18

Strengths and Weaknesses of KNN
◼ Strengths of KNN
- Very simple and intuitive.
- Can be applied to the data from any distribution.
- Good classification if the number of samples is large
enough.
◼ Weaknesses of KNN
- Take more time to classify a new examples
- Need to calculate and compare distance from new
examples to all other examples.
- Choosing K may be tricky.
- Need large number of samples for accuracy.

Chapter 2: Supervised Machine Learning 19

Discussions
◼ kNN can deal with complex and arbitrary
decision boundaries.
◼ Despite its simplicity, researchers have
shown that the classification accuracy of kNN
can be quite strong and in many cases as
accurate as those elaborated methods.
◼ kNN is slow at the classification time
◼ kNN does not produce an understandable
model

Chapter 2: Supervised Machine Learning 20

Thank You!

Chapter 2: Supervised Machine Learning 21

STR348 - Week 2 - Introduction To Operations Management
50% (2)
STR348 - Week 2 - Introduction To Operations Management
31 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
Unit - 2 ML
No ratings yet
Unit - 2 ML
32 pages
KNN HMM
No ratings yet
KNN HMM
51 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
02-knn Notes
No ratings yet
02-knn Notes
23 pages
01 Basics 02knn 01
No ratings yet
01 Basics 02knn 01
7 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
23 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
Chapter#10 (Part#01) SL (K-NN)
No ratings yet
Chapter#10 (Part#01) SL (K-NN)
27 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
11 pages
ML Supervised Learning Unit 3
No ratings yet
ML Supervised Learning Unit 3
51 pages
ML-KN
No ratings yet
ML-KN
12 pages
KNN Algorithm
No ratings yet
KNN Algorithm
3 pages
Lecture 22 - K-Nearnest Neighbours
No ratings yet
Lecture 22 - K-Nearnest Neighbours
11 pages
ML04_KNN-SVM_2024-2025
No ratings yet
ML04_KNN-SVM_2024-2025
57 pages
1694600817-Unit2.3 KNN CU 2.0
No ratings yet
1694600817-Unit2.3 KNN CU 2.0
25 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
5. K-Nearest Neighbors
No ratings yet
5. K-Nearest Neighbors
35 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
Unit 5 Learning with Algorithm
No ratings yet
Unit 5 Learning with Algorithm
7 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
DWDM PPT
No ratings yet
DWDM PPT
35 pages
Week 10
No ratings yet
Week 10
41 pages
Machine Learning KNN - Supervised
No ratings yet
Machine Learning KNN - Supervised
9 pages
Untitled 9
No ratings yet
Untitled 9
17 pages
KNN v2
No ratings yet
KNN v2
31 pages
AI Unit 5 Part1
No ratings yet
AI Unit 5 Part1
6 pages
statistic inference unit 2 notes
No ratings yet
statistic inference unit 2 notes
34 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
KNN Algorithm
No ratings yet
KNN Algorithm
9 pages
Lecture 3 - kNN algorithm
No ratings yet
Lecture 3 - kNN algorithm
28 pages
ML U4
No ratings yet
ML U4
48 pages
K- Nearest Neighbor
No ratings yet
K- Nearest Neighbor
13 pages
Research and Implementation of Machine
No ratings yet
Research and Implementation of Machine
6 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
2 K-Nearest Neighbors: ( (X, Y, Y) Be The Set of Ob-X (X) R Y (Y) R
No ratings yet
2 K-Nearest Neighbors: ( (X, Y, Y) Be The Set of Ob-X (X) R Y (Y) R
2 pages
KNN Algorithm
No ratings yet
KNN Algorithm
16 pages
Experiment 2.2 KNN Classifier
No ratings yet
Experiment 2.2 KNN Classifier
7 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
16 pages
ML DSBA Lab4
No ratings yet
ML DSBA Lab4
5 pages
Day43 KNN Intro
No ratings yet
Day43 KNN Intro
4 pages
(KtabPDF Com) xrwA7TEBGp
No ratings yet
(KtabPDF Com) xrwA7TEBGp
32 pages
Lecture 17 - KNN
No ratings yet
Lecture 17 - KNN
18 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
Training Machine Learning KNN 2017
No ratings yet
Training Machine Learning KNN 2017
17 pages
ML Lab2 pgm
No ratings yet
ML Lab2 pgm
3 pages
KNN Presentation
No ratings yet
KNN Presentation
19 pages
ML Unit-2
No ratings yet
ML Unit-2
24 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
(Ebook) Behavioural Economics and Business Ethics: Interrelations and Applications by Philip Alexander Rajko ISBN 9780415682640, 0415682649 all chapter instant download
100% (1)
(Ebook) Behavioural Economics and Business Ethics: Interrelations and Applications by Philip Alexander Rajko ISBN 9780415682640, 0415682649 all chapter instant download
76 pages
Top Ten Tips To Speed Up Your VBA Code
No ratings yet
Top Ten Tips To Speed Up Your VBA Code
10 pages
CBSE Results 2024
No ratings yet
CBSE Results 2024
1 page
Rotational and Translational Motion
No ratings yet
Rotational and Translational Motion
37 pages
Questionnaire Environmental Awareness and Practices of Science Students
100% (1)
Questionnaire Environmental Awareness and Practices of Science Students
3 pages
Persamaan Multiple Non-Linear Regression
No ratings yet
Persamaan Multiple Non-Linear Regression
4 pages
Modelling A Pension Fund-2
No ratings yet
Modelling A Pension Fund-2
43 pages
Journal of African Earth Sciences: Mohammad Abdelfattah Sarhan
No ratings yet
Journal of African Earth Sciences: Mohammad Abdelfattah Sarhan
12 pages
Théodore_Monod
No ratings yet
Théodore_Monod
4 pages
Ecam2012 268237
No ratings yet
Ecam2012 268237
11 pages
IB MAA SL Calculus Practice
100% (1)
IB MAA SL Calculus Practice
13 pages
Bag of Words Algorithm: Paragraph
No ratings yet
Bag of Words Algorithm: Paragraph
3 pages
Dental Pins
No ratings yet
Dental Pins
17 pages
JTG D30-2004
0% (1)
JTG D30-2004
294 pages
Introduction To Survey Sampling7 PDF
No ratings yet
Introduction To Survey Sampling7 PDF
10 pages
Pranay Mini Project
No ratings yet
Pranay Mini Project
18 pages
MELC IA Plumbing G7-8
100% (2)
MELC IA Plumbing G7-8
3 pages
Teaching Safety Tips
No ratings yet
Teaching Safety Tips
4 pages
Errors and Uncertainties: Physics Laboratory
No ratings yet
Errors and Uncertainties: Physics Laboratory
15 pages
Speaking for b2
No ratings yet
Speaking for b2
2 pages
Mandatory Disclosure 2022
No ratings yet
Mandatory Disclosure 2022
24 pages
Job Order Cost Systems and Overhead Allocations
No ratings yet
Job Order Cost Systems and Overhead Allocations
57 pages
8 Managing Change and Innovation
No ratings yet
8 Managing Change and Innovation
30 pages
CH-2 Earth Dams
100% (1)
CH-2 Earth Dams
18 pages
Rizal 1
No ratings yet
Rizal 1
5 pages
Hômework
No ratings yet
Hômework
6 pages
Utc Tda2030l Datasheet - Retroamplis
No ratings yet
Utc Tda2030l Datasheet - Retroamplis
14 pages
Ta5 BS8110
No ratings yet
Ta5 BS8110
9 pages
20 - Priti Kulkarni - RIA Draft II
No ratings yet
20 - Priti Kulkarni - RIA Draft II
20 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter II - Lecture 2 - KNN

Uploaded by

Chapter II - Lecture 2 - KNN

Uploaded by

Chapter II:

Chapter 2: Supervised Machine Learning 2

Chapter 2: Supervised Machine Learning 3

Chapter 2: Supervised Machine Learning 4

Chapter 2: Supervised Machine Learning 5

Chapter 2: Supervised Machine Learning 6

Chapter 2: Supervised Machine Learning 7

◼ k is usually chosen empirically via a validation

Chapter 2: Supervised Machine Learning 9

◼ K-nearest neighbors of a record x are data points

Chapter 2: Supervised Machine Learning 10

◼ Rule of thumb is K < sqrt(n), n is the number of

Chapter 2: Supervised Machine Learning 11

Chapter 2: Supervised Machine Learning 12

Chapter 2: Supervised Machine Learning 13

Chapter 2: Supervised Machine Learning 14

Chapter 2: Supervised Machine Learning 15

Chapter 2: Supervised Machine Learning 17

Chapter 2: Supervised Machine Learning 18

Chapter 2: Supervised Machine Learning 19

Chapter 2: Supervised Machine Learning 20

Chapter 2: Supervised Machine Learning 21

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.