0% found this document useful (0 votes)

72 views26 pages

3a KNN PDF

Instance-based learning involves storing all training examples and classifying new examples based on their similarity to stored examples. The k-nearest neighbors algorithm is a common instance-based learning method where the class is determined by the majority class of the k closest training examples. Key considerations for k-NN include how to measure similarity, choosing an appropriate value for k, and addressing the curse of dimensionality from irrelevant features. Distance weighting and feature selection help k-NN perform well even with many irrelevant features.

Uploaded by

jithu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views26 pages

3a KNN PDF

Uploaded by

jithu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Foundations of Machine Learning

Module 3: Instance Based Learning and Feature

Selection
Part D: Instance Based Learning

Sudeshna Sarkar
IIT Kharagpur
Instance-Based Learning
• One way of solving tasks of approximating
discrete or real valued target functions
• Have training examples: (xn, f(xn)), n=1..N.
• Key idea:
– just store the training examples
– when a test example is given then find the closest
matches

2
Inductive Assumption

• Similar inputs map to similar outputs

– If not true => learning is impossible
– If true => learning reduces to defining “similar”

• Not all similarities created equal

– predicting a person’s weight may depend on
different attributes than predicting their IQ
Basic k-nearest neighbor classification

• Training method:
– Save the training examples
• At prediction time:
– Find the k training examples (x1,y1),…(xk,yk) that
are closest to the test example x
– Predict the most frequent class among those yi’s.

• Example:
http://cgm.cs.mcgill.ca/~soss/cs644/projects/simard/

4
What is the decision boundary?
Voronoi diagram

5
Basic k-nearest neighbor classification

• Training method:
– Save the training examples
• At prediction time:
– Find the k training examples (x1,y1),…(xk,yk) that are closest
to the test example x
– Predict the most frequent class among those yi’s.

• Improvements:
– Weighting examples from the neighborhood
– Measuring “closeness”
– Finding “close” examples in a large training set quickly

6
k-Nearest Neighbor
N
 
2
Dist(c1 , c2 )   attri (c1 )  attri (c2 )
i1


k  NearestNeighbors  k  MIN(Dist(ci , ctest )) 
1 k 1 k
predictiontest   classi (or  valuei )
k i1 k i1

• Average of k points more reliable when:

– noise in attributes

attribute_2
o
+ o oooo
– noise in class labels o + oo+o
o oo o
– classes partially overlap + ++++
+ +
+
+
attribute_1
How to choose “k”

• Large k:
– less sensitive to noise (particularly class noise)
– better probability estimates for discrete classes
– larger training sets allow larger values of k
• Small k:
– captures fine structure of problem space better
– may be necessary with small training sets
• Balance must be struck between large and small k
• As training set approaches infinity, and k grows large, kNN
becomes Bayes optimal
From Hastie, Tibshirani, Friedman 2001 p418
From Hastie, Tibshirani, Friedman 2001 p418
From Hastie, Tibshirani, Friedman 2001 p419
Distance-Weighted kNN

• tradeoff between small and large k can be difficult

– use large k, but more emphasis on nearer neighbors?

k k

 w  class
i i  w  value
i i
predictiontest  i1
k (or i 1
k )
w i w i
i 1 i 1

1
wk 
Dist(ck , ctest )
Locally Weighted Averaging
• Let k = number of training points
• Let weight fall-off rapidly with distance
k k

 w  class
i i  w  value
i i
predictiontest  i1
k (or i 1
k )
w
i 1
i w
i 1
i

1
wk 
e KernelWidthDist(ck ,c test )

 KernelWidth controls size of neighborhood that

has large effect on value (analogous to k)
Locally Weighted Regression
• All algs so far are strict averagers: interpolate,
but can’t extrapolate
• Do weighted regression, centered at test
point, weight controlled by distance and
KernelWidth
• Local regressor can be linear, quadratic, n-th
degree polynomial, neural net, …
• Yields piecewise approximation to surface that
typically is more complex than local regressor
Euclidean Distance

N
D(c1, c2)  
 attri (c1)  attri (c2)
i1

2

• gives all attributes equal weight?

– only if scale of attributes and differences are
similar

attribute_2
– scale attributes to equal range or equal o
o oooo
oo o
variance + o
o
+ +++
• assumes spherical classes + +
+
+
attribute_1
Euclidean Distance?

attribute_2

attribute_2
o + o o
o oooo + + o
oo o
o + + o oo
o + o
+ ++++ + + o oo
+
+ +
+ + o
+
attribute_1 attribute_1

• if classes are not spherical?

• if some attributes are more/less important
than other attributes?
• if some attributes have more/less noise in
them than other attributes?
Weighted Euclidean Distance

N
D(c1, c2)   
 wi  attri (c1)  attri (c2)
i1
2

• large weights => attribute is more important

• small weights => attribute is less important
• zero weights => attribute doesn’t matter

• Weights allow kNN to be effective with axis-parallel

elliptical classes
• Where do weights come from?
Curse of Dimensionality

• as number of dimensions increases, distance between points becomes larger and more uniform
• if number of relevant attributes is fixed, increasing the number of less relevant attributes may swamp
distance

• when more irrelevant than relevant dimensions, distance becomes less reliable
• solutions: larger k or KernelWidth, feature selection, feature weights, more complex distance functions

attr (c1)  attr (c2)

relevant irrelevant 2
D(c1,c2)  
i1
attri (c1)  attri (c2)  2

j 1
j j
K-NN and irrelevant features
+ + + oo o oo? o + + o + o oooo+ o ooooo +

19
K-NN and irrelevant features
+
o
+ o
? o o
+
o o
o o
o o
+ o
+ +
+ o
o + o o
o
o
o

20
K-NN and irrelevant features

+
+ oo o o
? +
o o o oo
+ o o +
+ o o + o +
o oo
o

21
Ways of rescaling for KNN
Normalized L1 distance:

Scale by IG:

Modified value distance metric:

22
Ways of rescaling for KNN
Dot product:

Cosine distance:

TFIDF weights for text: for doc j, feature i: xi=tfi,j * idfi :

#docs in
#occur. of
corpus
term i in
doc j
#docs in corpus
that contain
term i
23
Combining distances to neighbors
Standard KNN: yˆ  arg max y C ( y, Neighbors ( x))
C ( y, D ' ) | {( x' , y ' )  D ': y '  y} |
Distance-weighted KNN:

yˆ  arg max y C ( y, Neighbors ( x))

C ( y, D' )   (SIM ( x, x' ))
{( x ', y ')D ': y ' y }

C ( y, D' )  1   (1  SIM ( x, x' ))

{( x ', y ')D ': y ' y }

SIM ( x, x' )  1  ( x, x' )

24
Advantages of Memory-Based Methods
• Lazy learning: don’t do any work until you know what you
want to predict (and from what variables!)
– never need to learn a global model
– many simple local models taken together can represent a more
complex global model
– better focussed learning
– handles missing values, time varying distributions, ...
• Very efficient cross-validation
• Intelligible learning method to many users
• Nearest neighbors support explanation and training
• Can use any distance metric: string-edit distance, …
Weaknesses of Memory-Based Methods

• Curse of Dimensionality:
– often works best with 25 or fewer dimensions
• Run-time cost scales with training set size
• Large training sets will not fit in memory
• Many MBL methods are strict averagers
• Sometimes doesn’t seem to perform as well as other methods
such as neural nets
• Predicted values for regression not continuous

MLT Unit 3 Part 2
No ratings yet
MLT Unit 3 Part 2
57 pages
K Nearest Neighbor Algorithm PDF
No ratings yet
K Nearest Neighbor Algorithm PDF
40 pages
Difference Between Instance-And Model-Based Learning
No ratings yet
Difference Between Instance-And Model-Based Learning
35 pages
K-Nearest Neighbourhood
100% (1)
K-Nearest Neighbourhood
7 pages
ML - 3 - Sovan - KNN - 1
No ratings yet
ML - 3 - Sovan - KNN - 1
95 pages
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
No ratings yet
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
9 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
05 KNN
No ratings yet
05 KNN
49 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
cs4302 Lecture2
No ratings yet
cs4302 Lecture2
40 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
KNN CIML
No ratings yet
KNN CIML
12 pages
Siddu AIml
No ratings yet
Siddu AIml
8 pages
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
No ratings yet
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
33 pages
CH 2
No ratings yet
CH 2
30 pages
Lecture 07 Slides
No ratings yet
Lecture 07 Slides
45 pages
Classification and Regression: Arturo Calder On Mora
No ratings yet
Classification and Regression: Arturo Calder On Mora
8 pages
5c. Nearest Neighbour Classifier
No ratings yet
5c. Nearest Neighbour Classifier
2 pages
Week 7 Nearest Neighbours
No ratings yet
Week 7 Nearest Neighbours
21 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
Chapter 6: Classification and Prediction: Classify Predictions
No ratings yet
Chapter 6: Classification and Prediction: Classify Predictions
23 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
CSE445 NSU Week - 5
No ratings yet
CSE445 NSU Week - 5
26 pages
Machine Learning Lecture 02
No ratings yet
Machine Learning Lecture 02
25 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
KNN 2
No ratings yet
KNN 2
53 pages
Module 4 A
No ratings yet
Module 4 A
29 pages
ML Lecture 13 KNN
No ratings yet
ML Lecture 13 KNN
14 pages
ML Unit-2
No ratings yet
ML Unit-2
33 pages
ML Unit2
No ratings yet
ML Unit2
38 pages
w5 Classification
No ratings yet
w5 Classification
34 pages
Instance-Based Learning: K-Nearest Neighbour Learning
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
21 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Lecture 2 - Nearest-Neighbors Methods
No ratings yet
Lecture 2 - Nearest-Neighbors Methods
57 pages
Nearest-Neighbor Classifier: MTL 782 Iit Delhi
No ratings yet
Nearest-Neighbor Classifier: MTL 782 Iit Delhi
16 pages
K-Nearest Neighbour Classifiers
No ratings yet
K-Nearest Neighbour Classifiers
18 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
KNN Algorithm
No ratings yet
KNN Algorithm
16 pages
Example 1: Riding Mowers
No ratings yet
Example 1: Riding Mowers
6 pages
Quantification Process of Carbon Emissions in The Construction Industry
No ratings yet
Quantification Process of Carbon Emissions in The Construction Industry
21 pages
20 KNN Presentation
No ratings yet
20 KNN Presentation
16 pages
Tax Invoice: For The Month of
No ratings yet
Tax Invoice: For The Month of
3 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
31 pages
KNN - Algorithm - SVM - Algorithm
No ratings yet
KNN - Algorithm - SVM - Algorithm
27 pages
Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
Lecture8 KNN1
No ratings yet
Lecture8 KNN1
16 pages
Week 07
No ratings yet
Week 07
24 pages
Ocr Biology A2 Coursework Kidney
100% (2)
Ocr Biology A2 Coursework Kidney
8 pages
KNN Presentation
No ratings yet
KNN Presentation
16 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
CH5 Data Mining Classification Prepared by Dr. Maher Abuhamdeh
No ratings yet
CH5 Data Mining Classification Prepared by Dr. Maher Abuhamdeh
61 pages
10-238-MS Software Engineering-2nd-0
No ratings yet
10-238-MS Software Engineering-2nd-0
1 page
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
Husqvarna k4000 UsMan (EN ES FR)
No ratings yet
Husqvarna k4000 UsMan (EN ES FR)
76 pages
Nonparametric Density Estimation Nearest Neighbors, KNN
No ratings yet
Nonparametric Density Estimation Nearest Neighbors, KNN
31 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
Offer Letter Template V2
No ratings yet
Offer Letter Template V2
16 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
14 Public Goods
No ratings yet
14 Public Goods
27 pages
Lectures 1 2 Practice Problems 2
No ratings yet
Lectures 1 2 Practice Problems 2
2 pages
3D-07-13F1-135 SP3D+Operator+Training+Guide+ Lab8+Piping+Task EN
100% (1)
3D-07-13F1-135 SP3D+Operator+Training+Guide+ Lab8+Piping+Task EN
34 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
Mahle Product Catalog
100% (3)
Mahle Product Catalog
16 pages
Textbook ML - Removed
No ratings yet
Textbook ML - Removed
10 pages
002 - CL
No ratings yet
002 - CL
2 pages
Startegic Analysis PRIL
No ratings yet
Startegic Analysis PRIL
8 pages
HDFC Large and Mid Cap Fund Regular Plan
No ratings yet
HDFC Large and Mid Cap Fund Regular Plan
1 page
GERARDO C. ROXAS v. BALIWAG TRANSIT
No ratings yet
GERARDO C. ROXAS v. BALIWAG TRANSIT
7 pages
A Survey Report On The Preferred Restaur
No ratings yet
A Survey Report On The Preferred Restaur
23 pages
Bayot Vs Court of Appeals
100% (1)
Bayot Vs Court of Appeals
2 pages
Troubleshooting: 7.1 Power Supply Section
No ratings yet
Troubleshooting: 7.1 Power Supply Section
12 pages
Digital Electronics
100% (1)
Digital Electronics
3 pages
Ap Grama/Ward Sachivalayam-2019: Online
No ratings yet
Ap Grama/Ward Sachivalayam-2019: Online
4 pages
Asteroid 449 Hamburga
No ratings yet
Asteroid 449 Hamburga
1 page
Biobaja Cecair Multifungsisebagai Inputagronomiinovatif Untuk Pertanian Moden
No ratings yet
Biobaja Cecair Multifungsisebagai Inputagronomiinovatif Untuk Pertanian Moden
4 pages
Recriutment and Selection - P&G
90% (21)
Recriutment and Selection - P&G
28 pages
LTE Design Requirements - Intermodulation (Passive) Issue On LTE800 - 1800
No ratings yet
LTE Design Requirements - Intermodulation (Passive) Issue On LTE800 - 1800
12 pages
Bank Account Management - Create (FINAL) - 090126
No ratings yet
Bank Account Management - Create (FINAL) - 090126
1 page
Lesson 7-Drama, Music and Film
No ratings yet
Lesson 7-Drama, Music and Film
22 pages
Upenn Admission Guide
No ratings yet
Upenn Admission Guide
15 pages
Flow Chart Definition: What Is Flowchart
No ratings yet
Flow Chart Definition: What Is Flowchart
9 pages
ITW Cored Wire Catalogue ENG PDF
No ratings yet
ITW Cored Wire Catalogue ENG PDF
70 pages
Locgov Digest - Sebastian - Estate of Gregoria Francisco v. Court of Appeals
No ratings yet
Locgov Digest - Sebastian - Estate of Gregoria Francisco v. Court of Appeals
2 pages
Preventive Maintenance Program For Spherical Blowout Preventer
100% (1)
Preventive Maintenance Program For Spherical Blowout Preventer
19 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

3a KNN PDF

Uploaded by

3a KNN PDF

Uploaded by

Foundations of Machine Learning

Module 3: Instance Based Learning and Feature

• Similar inputs map to similar outputs

• Not all similarities created equal

• Average of k points more reliable when:

• tradeoff between small and large k can be difficult

 KernelWidth controls size of neighborhood that

• gives all attributes equal weight?

• if classes are not spherical?

• large weights => attribute is more important

• Weights allow kNN to be effective with axis-parallel

attr (c1)  attr (c2)

Modified value distance metric:

TFIDF weights for text: for doc j, feature i: xi=tfi,j * idfi :

yˆ  arg max y C ( y, Neighbors ( x))

C ( y, D' )  1   (1  SIM ( x, x' ))

SIM ( x, x' )  1  ( x, x' )

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.