0% found this document useful (0 votes)

8 views7 pages

Inf2b Learn Note10 2up

Gvb

Uploaded by

Dilip Das

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views7 pages

Inf2b Learn Note10 2up

Gvb

Uploaded by

Dilip Das

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Learning and Data Note 10 Informatics 2B Learning and Data Note 10 Informatics 2B

Decision regions for 3−class example

Discriminant functions
4

Hiroshi Shimodaira∗ 2

2
4 March 2015

x
−2

−4

In the previous chapter we saw how we can combine a Gaussian probability density function with class
prior probabilities using Bayes’ theorem to estimate class-conditional posterior probabilities. For each −6

point in the input space we can estimate the posterior probability of each class, assigning that point to
−8
the class with the maximum posterior probability. We can view this process as dividing the input space −8 −6 −4 −2
x
0 2 4 6 8
1
into decision regions, separated by decision boundaries. In the next section we investigate whether
the maximum posterior probability rule is indeed the best decision rule (in terms of minimising the Figure 1: Decision regions for the three-class two-dimensional problem from the previous chapter.
number of errors). In the following sections we introduce discriminant functions which define the Class A (red), class B (blue), class C (cyan).
decision boundaries, and investigate the form of decision functions induced by Gaussian pdfs with
different constraints on the covariance matrix.
Thus the probability of the total error may be written as:

P(error) = P(x ∈ R2, c1) + P(x ∈ R1, c2) .

1 Decision boundaries
Expanding the terms on the right hand side as conditional probabilities, we may write:
We may assign each point in the input space as a particular class. This divides the input space into
decision regions Rc, such that a point falling in R
c is assigned to class C. In the general case, a decision P(error) = P(x ∈ R2 | c1) P(c1) + P(x ∈ R1 | c2) P(c2) . (1)
region Rc need not be contiguous, but may consist of several disjoint regions each associated with
class C. The boundaries between these regions are called decision boundaries.
1.2 Overlapping Gaussians
Figure 1 shows the decision regions that result from assigning each point to the class with the maximum
posterior probability, using the Gaussians estimated for classes A, B and C from the example in the Figure 2 illustrates two overlapping Gaussian distributions (assuming equal priors). Two possible
previous chapter. decision boundaries are illustrated and the two regions of error are coloured.
We can obtain P(x ∈ R2 | c1) by integrating p(x |c1) within R 2, and similarly for P(x ∈ R 1 |c2), and
1.1 Placement of decision boundaries thus rewrite (1) as:
∫ ∫
Estimating posterior probabilities for each class results in the input space being divided into decision P(error) = p(x | c1) P(c1) dx + p(x | c2) P(c2) dx . (2)
R2 R1
regions, if each point is classified as the class with the highest posterior probability. But is this an
optimal placement of decision boundaries? Minimising the probability of misclassification is equivalent to minimising P(error). From (2) we can
Consider a 1-dimensional feature space (x) and two classes c1 and c2. A reasonable criterion for the see that this is achieved as follows, for a given x:
placement of decision boundaries is one that minimises the probability of misclassification. To estimate
the probability of misclassification we need to consider the two ways that a point can be classified • if p(x | c1) P(c1) > p(x | c2) P(c2), then point x should be in region R1;
wrongly:
• if p(x | c2) P(c2) > p(x | c1) P(c1), then point x should be in region R2.
1. assigning x to c1 when it belongs to c2 (x is in decision region R1 when it belongs to class c2); The probability of misclassification is thus minimised by assigning each point to the class with the
maximum posterior probability.
2. assigning x to c2 when it belongs to c1 (x is in R2 when it belongs to c1).
It is possible to extend this justification for a decision rule based on the maximum posterior probability
∗Heavily based on notes inherited from Steve Renals and Iain Murray. to d-dimensional feature vectors and K classes. In this case consider the probability of a pattern being

1 2
Learning and Data Note 10 Informatics 2B Learning and Data Note 10 Informatics 2B

0.2 0.2
The posterior probability could also be used as a discriminant function, with the same results: choosing
0.18 0.18
the class with the largest posterior probability is an identical decision rule to choosing the class with
0.16 0.16 the largest log posterior probability.
0.14 0.14
As discussed above, classifying a point as the class with the largest (log) posterior probability cor-
0.12 0.12 responds to the decision rule which minimises the probability of misclassification. In that sense, it
0.1 0.1 forms an optimal discriminant function. A decision boundary occurs at points in the input space where
0.08 0.08
discriminant functions are equal. If the region of input space classified as class ck (Rk) and the region
classified as class c4 (R4) are contiguous, then the decision boundary separating them is given by:
0.06 0.06

0.04 0.04 yk(x) = y4(x) .

0.02 0.02
Decision boundaries are not changed by monotonic transformations (such as taking the log) of the
0 0
−10 −8 −6 −4 −2 0 2 4 6 8 10 −10 −8 −6 −4 −2 0 2 4 6 8 10 discriminant functions.
Figure 2: Overlapping Gaussian pdfs. Two possible decision boundaries are shown by the dashed line. Formulating a pattern classification problem in terms of discriminant functions is useful since it is
The decision boundary on the left hand plot is optimal, assuming equal priors. The overall probability possible to estimate the discriminant functions directly from data, without having to estimate probability
of error is given by the area of the shaded regions under the pdfs. density functions on the inputs. Direct estimation of the decision boundaries is sometimes referred
to as discriminative modelling. In contrast, the models that we have considered so far are generative
models: they could generate new ‘fantasy’ data by choosing a class label, and then sampling an input
correctly classified:
from its class-conditional model.
X
K
P(correct) = P(x ∈ R k, ck)
k=1 3 Discriminant functions for class-conditional Gaussians
X
K
= P(x ∈ R k| ck) P(ck) What is the form of the discriminant function when using a Gaussian pdf? As before, we take the
k=1
discriminant function as the log posterior probability:
XK ∫
p(x | ck) P(ck) dx . yc(x) = ln P(c | x) = ln p(x | c) + ln P(c) + const.
= k=1 Rk
1 1
= − (x − µc )T Σ−1
c (x − µc ) − ln |Σc | + ln P(c) . (3)
2 2
This performance measure is maximised by choosing theRk such that each x is assigned to the class k We have dropped the term −1/2 ln(2π), since it is a constant that occurs in the discriminant function
that maximises p(x |ck) P(ck). This procedure is equivalent to assigning each x to the class with the for each class. The first term on the left hand side of (3) is quadratic in the elements of x (i.e., if you
maximum posterior probability. multiply out the elements, there will be some terms containing xi2 or xix j).
Thus the maximum posterior probability decision rule is equivalent to minimising the probability of
misclassification. However, to obtain this result we assumed both that the class-conditional models are
correct, and that the models are well-estimated from the data. 4 Linear discriminants

Let’s consider the case in which the Gaussian pdfs for each class all share the same covariance matrix.
2 Discriminant functions That is, for all classes c, Σc =Σ. In this case Σ is class-independent (since it is equal for all classes),
therefore the term −1/2 ln |Σ| may also be dropped from the discriminant function and we have:
If we have a set of K classes then we may define a set of K discriminant functions yk(x), one for each
1
class. Data point x is assigned to class c if yc(x) = − (x − µc )T Σ−1(x − µc ) + ln P(c) .
2
yc(x) > yk(x) for all k ≠ c. If we explicitly expand the quadratic matrix-vector expression we obtain the following:
In other words: assign x to the class c whose discriminant function yc(x) is biggest. 1 T
y (x) (x −1x xT −1 T −1
x T −1
) ln P(c) (4)
This is precisely what we did in the previous chapter when classifying based on the values of the log c = − Σ − Σ µc − µc Σ + µc Σ µc + .
2
posterior probability. Thus the log posterior probability of class c given a data point x is a possible
The mean µc depends on class c, but (as stated before) the covariance matrix is class-independent.
discriminant function:
Therefore, terms that do not include the mean or the prior probabilities are class independent, and may
yc(x) = ln P(c | x) = ln p(x | c) + ln P(c) + const. be dropped. Thus we may drop xT Σ−1x from the discriminant.

3 4
Learning and Data Note 10 Informatics 2B Learning and Data Note 10 Informatics 2B

x2
the diagonal terms (variances) are equal for all components. In this case the matrix may be defined by
a single number, σ2, the value of the variances:

Σ = σ2I
C2 −1 1
Σ = I
σ2
y1(x)=y2(x)
where I is the identity matrix.
C1 Since this is a special case of Gaussians with equal covariance, the discriminant functions are linear,
and may be written as (8). However, we can get another view of the discriminant functions if we write
them as:
x µ 2
x1 yc(x) = − || − ||c + ln P(c) . (9)
2σ2
Figure 3: Discriminant function for equal covariance Gaussians If the prior probabilities are equal for all classes, the decision rule simply assigns an unseen vector to
the nearest class mean (using the Euclidean distance). In this case the class means may be regarded as
We can simplify this discriminant function further. It is a fact that for a symmetric matrix M and class templates or prototypes.
vectors a and b: Exercise: Show that (9) is indeed reduced to a linear discriminant.
aT Mb = bT Ma .
Now since the covariance matrix Σ is symmetric, it follows that Σ−1 is also symmetric1. Therefore:
6 Two-class linear discriminants
xT Σ−1µc = µTc Σ−1x .
To get some more insights into linear discriminants, we can look at another special case: two-class
We can thus simplify (4) as: problems. Two class problems occur quite often in practice, and they are more straightforward to think
1 about because we are considering a single decision boundary between the two classes.
yc(x) = µTc Σ−1x − µTc Σ−1µ c + ln P(c) . (5)
2 In the two-class case it is possible to use a single discriminant function: for example one which takes
This equation has three terms on the right hand side, but only the first depends on x. We can define two value zero at the decision boundary, negative values for one class and positive values for the other. A
new variables wc (d-dimension vector) and wc0, which are derived from µc, P(c), and Σ: suitable discriminant function in this case is the log odds (log ratio of posterior probabilities):
wTc = µTc Σ−1 (6) y(x) = ln P(c1 | x) = ln p(x | c1) + ln P(c1)
1 T −1 1 T P(c | x) p(x | c ) P(c )
w = − µ Σ µ + ln P(c) = − w µ + ln P(c) . (7) 2 2 2
c0
2 c c
2 c c = ln p(x | c1) − ln p(x | c2) + ln P(c1) − ln P(c2) . (10)
Substituting (6) and (7) into (5) we obtain: Feature vector x is assigned to class c1 when y(x) > 0; x is assigned to class c2 when y(x) < 0. The
decision boundary is defined by y(x) = 0.
yc(x) = wTc x + wc0 . (8)
If the pdf for each class is a Gaussian, and the covariance matrix is shared, then the discriminant
This is a linear equation in d dimensions. We refer to wc as the weight vector and wc0 as the bias for function is linear:
class c. y(x) = wT x + w0 ,
We have thus shown that the discriminant function for a Gaussian which shares the same covariance where w is a function of the class-dependent means and the class-independent covariance matrix, and
matrix with the Gaussians pdfs of all the other classes may be written as (8). We call such discriminant the w0 is a function of the means, the covariance matrix and the prior probabilities.
functions linear discriminants: they are linear functions of x. If x is two-dimensional, the decision
The decision boundary for the two-class linear discriminant corresponds to a (d −1)-dimensional
boundaries will be straight lines, illustrated in Figure 3. In three dimensions the decision boundaries
hyperplane in the input space. Let xna and xnb be two points on the decision boundary. Then:
will be planes. In d-dimensions the decision boundaries are called hyperplanes.
y(xna) = 0 = y(xnb) .
5 Spherical Gaussians with equal covariance And since y(x) is a linear discriminant:

Let’s look at an even more constrained case, where not only do all the classes share a covariance wT xna + w0 = 0 = wT xnb + w0.
matrix, but that covariance matrix is spherical: the off-diagonal terms (covariances) are all zero, and And a little rearranging gives us:
1 It also follows that xT Σ−1x ≥ 0 for any x. wT (xna − xnb) = 0 . (11)

5 6
Learning and Data Note 10 Informatics 2B

y( x)=0
w

x1
−w0
||w||

Figure 4: Geometry of a two-class linear discriminant

In three dimensions (11) is the equation of a plane, with w being the vector normal to the plane. In
higher dimensions, this equation describes a hyperplane, and w is normal to any vector lying on the
hyperplane. The hyperplane is the decision boundary in this two-class problem.
If x is a point on the hyperplane, then the normal distance from the hyperplane to the origin is given by:
wT x w0
4= =− (using y(x) = 0) ,
||w|| ||w||
which is illustrated in Figure 4.

A Brief Introduction to Linear Discriminant Analysis

Introduction
Linear Discriminant Analysis (LDA), as its name suggests, serves as a linear model for classification and dimensionality reduction. You most commonly use it for feature
extraction in pattern classification problems. This technique has been around for quite a long time. Initially, in 1936, Fisher formulated linear discriminant for two classes.
Later on, in 1948, C.R. Rao generalized it for multiple classes. LDA projects data from a D-dimensional feature space down to a D’ (D > D’) dimensional space, thereby
maximizing the variability between the classes and reducing the variability within the classes.

7
Learning and Data Note 10 Informatics 2B

Learning Objectives:
• Understand the concept and purpose of Linear Discriminant Analysis (LDA)
• Learn how LDA performs dimensionality reduction and classification
• Grasp the mathematical principles behind Fisher’s Linear Discriminant
• Explore LDA implementation and applications using Python
What is Linear Discriminant Analysis?

• Linear Discriminant Analysis (LDA) is a statistical technique for categorizing data into groups. It identifies patterns in features to distinguish between different
classes. For instance, it may analyze characteristics like size and color to classify fruits as apples or oranges. LDA aims to find a straight line or plane that best
separates these groups while minimizing overlap within each class. By maximizing the separation between classes, it enables accurate classification of new data
points. In simpler terms, LDA helps make sense of data by effectively finding the most efficient way to separate different categories. Consequently, this aids in
tasks like pattern recognition and classification.

Why Linear Discriminant Analysis (LDA)?

• Logistic Regression is one of the most popular linear classification models that perform well for binary classification but falls short in the case of multiple
classification problems with well-separated classes. While LDA handles these quite efficiently.
• Linear Discriminant Analysis (LDA) also reduces the number of features in data preprocessing, akin to Principal Component Analysis (PCA), thereby
significantly reducing computing costs.
• LDA is also used in face detection algorithms. In Fisherfaces LDA is used to extract useful data from different faces. Coupled with eigenfaces it produces effective
results.
Shortcomings
• Linear decision boundaries may not effectively separate non-linearly separable classes. More flexible boundaries are desired.
• In cases where the number of observations exceeds the number of features, LDA might not perform as desired. This is called Small Sample Size (SSS) problem.
Regularization is required.
Assumptions
Linear Discriminant Analysis (LDA) makes some assumptions about the data:
• It assumes that the data follows a normal or Gaussian distribution, meaning each feature forms a bell-shaped curve when plotted.
• Each of the classes has identical covariance matrices.

7
Learning and Data Note 10 Informatics 2B

Fisher’s Linear Discriminant

Linear Discriminant Analysis in Machine Learning is a generalized form of Fisher’s Linear Discriminant (FLD). Initially, Fisher, in his paper, used a discriminant function to
classify between two plant species, namely Iris Setosa and Iris Versicolor.

The basic idea of FLD is to project data points onto a line in order to maximize the between-class scatter and minimize the within-class scatter. Consequently, this
approach aims to enhance the separation between different classes by optimizing the distribution of data points along a linear dimension.

This might sound a bit cryptic but it is quite straightforward. So, before we delve deep into the derivation part, we need to familiarize ourselves with certain terms and
expressions.

• Let’s suppose we have d-dimensional data points x1….xn with 2 classes Ci=1,2 each having N1 & N2 samples.
• Consider W as a unit vector onto which we will project the data points. Since we are only concerned with the direction, we choose a unit vector for this purpose.
• Number of samples : N = N1 + N2
• If x(n) are the samples on the feature space then WTx(n) denotes the data points after projection.
• Means of classes before projection: mi
• Means of classes after projection: Mi = WTmi

7
Learning and Data Note 10 Informatics 2B

Scatter matrix: Used to make estimates of the covariance matrix. IT is a m X m positive semi-definite matrix.
Given by: sample variance * no. of samples.
Note: Scatter and variance measure the same thing but on different scales. So, we might use both words interchangeably. So, do not get confused.
Two Types of Scatter Matrices
Here we will be dealing with two types of scatter matrices
• Between class scatter = Sb = measures the distance between class means
• Within class scatter = Sw = measures the spread around means of each class

Linear Discriminant
No ratings yet
Linear Discriminant
25 pages
Notes and Solutions For: Pattern Recognition by Sergios Theodoridis and Konstantinos Koutroumbas.
100% (1)
Notes and Solutions For: Pattern Recognition by Sergios Theodoridis and Konstantinos Koutroumbas.
209 pages
CS263 - Bayesian Decision Theory
No ratings yet
CS263 - Bayesian Decision Theory
16 pages
Genomic Signal Processing: Classification of Disease Subtype Based On Microarray Data
No ratings yet
Genomic Signal Processing: Classification of Disease Subtype Based On Microarray Data
26 pages
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
No ratings yet
Linear Models For Classification: Sumeet Agarwal, EEL709 (Most Figures From Bishop, PRML)
21 pages
Cs101-Lec-01-F17 Fast
No ratings yet
Cs101-Lec-01-F17 Fast
24 pages
PR January20 04 PDF
No ratings yet
PR January20 04 PDF
40 pages
Weatherwax Theodoridis Solutions
No ratings yet
Weatherwax Theodoridis Solutions
212 pages
Pattern Classification
No ratings yet
Pattern Classification
39 pages
Chapter 07
No ratings yet
Chapter 07
68 pages
Pattern Recognition Linear Classifier by Zaheer Ahmad
0% (1)
Pattern Recognition Linear Classifier by Zaheer Ahmad
37 pages
Machine Learning: Tools, Techniques, Applications (2013-14-I) # 1
No ratings yet
Machine Learning: Tools, Techniques, Applications (2013-14-I) # 1
5 pages
Homework Decision Solutions
No ratings yet
Homework Decision Solutions
3 pages
Sergios Theodoridis Konstantinos Koutroumbas
No ratings yet
Sergios Theodoridis Konstantinos Koutroumbas
80 pages
Pattern Recognition 21BR551 MODULE 02 NOTES
No ratings yet
Pattern Recognition 21BR551 MODULE 02 NOTES
16 pages
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Bayesian Decision Theory: CS479/679 Pattern Recognition Dr. George Bebis
64 pages
Materi 5 - 2
No ratings yet
Materi 5 - 2
25 pages
Lec 9
No ratings yet
Lec 9
15 pages
8
No ratings yet
8
141 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
Financial Modelling
No ratings yet
Financial Modelling
440 pages
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
No ratings yet
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
21 pages
Linear Discriminant Functions: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Linear Discriminant Functions: CS479/679 Pattern Recognition Dr. George Bebis
41 pages
Introduction To Machine Learning Lecture 3: Linear Classification Methods
No ratings yet
Introduction To Machine Learning Lecture 3: Linear Classification Methods
40 pages
Linear - Classification
No ratings yet
Linear - Classification
72 pages
n9 PDF
No ratings yet
n9 PDF
6 pages
Bayesian Classifier Implementation Using MATLAB
No ratings yet
Bayesian Classifier Implementation Using MATLAB
21 pages
Discriminant, Generative, Discriminative Models
No ratings yet
Discriminant, Generative, Discriminative Models
98 pages
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
No ratings yet
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
10 pages
Raindrop Catalog
No ratings yet
Raindrop Catalog
9 pages
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
No ratings yet
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
5 pages
Lec 8
No ratings yet
Lec 8
16 pages
SVM
No ratings yet
SVM
57 pages
MNRAO LINUX Documentx Updated 12th May 2021
100% (1)
MNRAO LINUX Documentx Updated 12th May 2021
277 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
7. Statistical Perspective
No ratings yet
7. Statistical Perspective
85 pages
Lec 9
No ratings yet
Lec 9
15 pages
Previous Exam Exercises On Classification: Exercise 4 2012: Classification With 2 Features
No ratings yet
Previous Exam Exercises On Classification: Exercise 4 2012: Classification With 2 Features
9 pages
Pattern Recognition
No ratings yet
Pattern Recognition
9 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
Q. 1) What Is Class Condition Density? (3 Marks) Ans
No ratings yet
Q. 1) What Is Class Condition Density? (3 Marks) Ans
12 pages
Napcat PoE System Quick Start Guide - Mul-Language
No ratings yet
Napcat PoE System Quick Start Guide - Mul-Language
70 pages
Cours FLD
No ratings yet
Cours FLD
28 pages
Umair Internship Report
No ratings yet
Umair Internship Report
71 pages
Lec 9
No ratings yet
Lec 9
15 pages
Introduction To Pattern Recognition
No ratings yet
Introduction To Pattern Recognition
12 pages
Atm Simulator Capstone Ppt
No ratings yet
Atm Simulator Capstone Ppt
27 pages
Pattern Classification
No ratings yet
Pattern Classification
39 pages
ECS7020P ClassificationExercisesSolutions II
No ratings yet
ECS7020P ClassificationExercisesSolutions II
7 pages
Asdfghjkl
No ratings yet
Asdfghjkl
22 pages
Week2_Part1_Summer_Partial_Notes
No ratings yet
Week2_Part1_Summer_Partial_Notes
75 pages
Linearclassification
No ratings yet
Linearclassification
31 pages
PR- Unit 2
No ratings yet
PR- Unit 2
17 pages
Supervised Unsupervised
No ratings yet
Supervised Unsupervised
39 pages
Cse6009 01
No ratings yet
Cse6009 01
31 pages
FAM CASE STUDY
No ratings yet
FAM CASE STUDY
2 pages
document
No ratings yet
document
9 pages
ai final logbook by ksr
No ratings yet
ai final logbook by ksr
36 pages
linear-models-for-classification
No ratings yet
linear-models-for-classification
21 pages
MLSlides2 Selected Shared (3)
No ratings yet
MLSlides2 Selected Shared (3)
29 pages
Bayesian
No ratings yet
Bayesian
21 pages
AREA54-1FUL: Installation and Maintenance Instructions
No ratings yet
AREA54-1FUL: Installation and Maintenance Instructions
54 pages
Bayesian_theory_daniel_restrepo
No ratings yet
Bayesian_theory_daniel_restrepo
8 pages
AA1_Tema4
No ratings yet
AA1_Tema4
37 pages
AE - Tema 5 - Two-class Fisher Discriminant Analysis
No ratings yet
AE - Tema 5 - Two-class Fisher Discriminant Analysis
6 pages
Passing Parameters To Base Class Constructors
No ratings yet
Passing Parameters To Base Class Constructors
5 pages
Armato_Anthony_5044466120(2)
No ratings yet
Armato_Anthony_5044466120(2)
4 pages
lec41
No ratings yet
lec41
6 pages
DBMS Practical List
No ratings yet
DBMS Practical List
5 pages
Oligopoly Presentation - Group 7
No ratings yet
Oligopoly Presentation - Group 7
13 pages
FS3100 Field Installation Procedure
No ratings yet
FS3100 Field Installation Procedure
64 pages
Marwadi University Unit Test Paper (Take Home Test) Year 2021
No ratings yet
Marwadi University Unit Test Paper (Take Home Test) Year 2021
2 pages
Si4732 A10 Short
No ratings yet
Si4732 A10 Short
3 pages
Baina Siva Shankar Resume
No ratings yet
Baina Siva Shankar Resume
1 page
Parts and Service News-At22138
No ratings yet
Parts and Service News-At22138
3 pages
Smart Pianist Manual
No ratings yet
Smart Pianist Manual
9 pages
1906.02590v1
No ratings yet
1906.02590v1
16 pages
Quanta Zu2 R1a Schematics
No ratings yet
Quanta Zu2 R1a Schematics
36 pages
TW - Datasheet Piercing RPG
No ratings yet
TW - Datasheet Piercing RPG
2 pages
Data Structures Questions
No ratings yet
Data Structures Questions
3 pages
Online Tailoring System: V.Gayathri Dr. Arumugam.S
No ratings yet
Online Tailoring System: V.Gayathri Dr. Arumugam.S
3 pages
Cloud Computing (RCS-075) Unit-1
No ratings yet
Cloud Computing (RCS-075) Unit-1
8 pages
Temenos Store - PACS - AC.DATA - EXTRACT - User Guide
No ratings yet
Temenos Store - PACS - AC.DATA - EXTRACT - User Guide
13 pages
Print PC - PC Builder - Star Tech2
No ratings yet
Print PC - PC Builder - Star Tech2
2 pages
Hitachi Virtual Storage Platform 5000 Series Enterprise Storage
No ratings yet
Hitachi Virtual Storage Platform 5000 Series Enterprise Storage
2 pages
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
From Everand
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
Rob Porter
No ratings yet
Hexagon Number Sense
From Everand
Hexagon Number Sense
Christopher Casey
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
From Everand
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
Mohmmad Khaja Shareef
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Inf2b Learn Note10 2up

Uploaded by

Inf2b Learn Note10 2up

Uploaded by

Learning and Data Note 10 Informatics 2B Learning and Data Note 10 Informatics 2B

Decision regions for 3−class example

P(error) = P(x ∈ R2, c1) + P(x ∈ R1, c2) .

0.04 0.04 yk(x) = y4(x) .

Figure 4: Geometry of a two-class linear discriminant

A Brief Introduction to Linear Discriminant Analysis

Why Linear Discriminant Analysis (LDA)?

Fisher’s Linear Discriminant

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.