0% found this document useful (0 votes)

31 views

2B Naive Bayes

Uploaded by

animehv5500

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views

2B Naive Bayes

Uploaded by

animehv5500

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 90

Bayesian Learning

• Bayes Theorem
• Concept Learning
• Bayes Optimal Classifier
• Naïve Bayes Classifier
• Bayesian Belief Network
• EM Algorithm
Concept Learning: A task of acquiring potential hypothesis(solution)
that best fits the given training examples.

Bayes Theorem calculates the probability of each possible hypothesis

and outputs the most probable one.
In other words, the probability of data D given hypothesis h is 1 if D is consistent with h, and 0 otherwise.
Bayes Theorem:
Bayes Theorem:
Now the question arises how to find the conditional probability of attributes w.r.t. the class labels?

Answer is Naïve Bayes Classifier assumes attributes independence. (Because of this assumption
only the word “naïve” is being added with this.)
Example
Naïve Bayes Classifier Example
Bayes Theorem:

This is how Bayes Theorem looks like. To solve real world problems, In dataset D we have multiple attributes, so we
rewrite it in different way.
Naïve Bayes Classifier Algorithm

Where VNB stands for the Naive Bayes classifier’s target value.
Naïve Bayesian Classifier
• Advantages
• Easy to implement
• Good results obtained in most of the cases
• Disadvantages
• Assumption: class conditional independence, therefore loss of accuracy
• Practically, dependencies exist among variables
• E.g., hospitals: patients: Profile: age, family history, etc.
Symptoms: fever, cough etc., Disease: lung cancer, diabetes, etc.
• Dependencies among these cannot be modeled by Naïve Bayesian Classifier
• How to deal with these dependencies?
• Bayesian Belief Networks
Bayesian Belief Networks
• A Bayesian network (or a belief network) is a probabilistic graphical model that represents a set of variables
and their probabilistic independencies.
• Bayesian belief network allows a subset of the variables conditionally independent

• Bayesian Belief Networks is defined by two components

a) A Directed acyclic graph b) A set of conditional Probability Tables

• A graphical model of causal relationships

• Represents dependency among the variables
• Gives a specification of joint probability distribution

❑ Nodes: random variables

❑ Links: dependency
❑ X & Y are the parents of Z, & Y is the parent of P and Z
❑ No dependency between Z and P
❑ Has no loops or cycles
Types of Probabilistic Relationships
Expectation–Maximization (EM) algorithm
The EM algorithm is considered a latent variable model to find the local
maximum likelihood parameters of a statistical model, proposed by Arthur
Dempster, Nan Laird, and Donald Rubin in 1977.

The EM (Expectation-Maximization) algorithm is one of the most commonly

used terms in machine learning to obtain maximum likelihood estimates of
variables that are sometimes observable and sometimes not. However, it is
also applicable to unobserved data or sometimes called latent. It has various
real-world applications in statistics, including obtaining the mode of the
posterior marginal distribution of parameters in machine learning and data
mining applications.
Expectation–Maximization (EM) algorithm
• In statistics, an expectation–maximization (EM) algorithm is
an iterative method to find (local) maximum likelihood or maximum a
posteriori (MAP) estimates of parameters in statistical models, where
the model depends on unobserved latent variables. The EM iteration
alternates between performing an expectation (E) step, which creates
a function for the expectation of the log-likelihood evaluated using
the current estimate for the parameters, and a maximization (M)
step, which computes parameters maximizing the expected log-
likelihood found on the E step. These parameter-estimates are then
used to determine the distribution of the latent variables in the next E
step.
Expectation–Maximization (EM) algorithm
• The Expectation-Maximization (EM) algorithm is defined as the
combination of various unsupervised machine learning algorithms,
which is used to determine the local maximum likelihood estimates
(MLE) or maximum a posteriori estimates (MAP) for unobservable
variables in statistical models. Further, it is a technique to find
maximum likelihood estimation when the latent variables are present.
It is also referred to as the latent variable model.
• A latent variable model consists of both observable and unobservable
variables where observable can be predicted while unobserved are
inferred from the observed variable. These unobservable variables
are known as latent variables.
Expectation–Maximization (EM) Algorithm
• Expectation step (E - step): It involves the estimation (guess) of all
missing values in the dataset so that after completing this step, there
should not be any missing value.
• Maximization step (M - step): This step involves the use of estimated
data in the E-step and updating the parameters.
• Repeat E-step and M-step until the convergence of the values occurs.
Convergence in the EM algorithm?
• Convergence is defined as the specific situation in probability based
on intuition, e.g., if there are two random variables that have very
less difference in their probability, then they are known as converged.
In other words, whenever the values of given variables are matched
with each other, it is called convergence.
Steps in EM Algorithm
SVM(Support Vector Machine )
• Introduction
• Types of Support Vector Kernel( Linear, Polynomial, Gaussian)
• Hyperplane (Decision Surface)
• Properties of SVM
• Issues in SVM
Support Vector Machine ( SVM)
• Support Vector Machine or SVM is one of the most popular Supervised Learning
algorithms, which is used for Classification as well as Regression problems.
However, primarily, it is used for Classification problems in Machine Learning.

• The goal of the SVM algorithm is to create the best line or decision boundary that
can segregate n-dimensional space into classes so that we can easily put the new
data point in the correct category in the future. This best decision boundary is
called a hyperplane.

• SVM chooses the extreme points/vectors that help in creating the hyperplane.
These extreme cases are called as support vectors, and hence algorithm is termed
as Support Vector Machine. Consider the below diagram in which there are two
different categories that are classified using a decision boundary or hyperplane.
Hyperplane and Support Vectors in the SVM
algorithm:
• Hyperplanes are decision boundaries that help classify the data
points. Data points falling on either side of the hyperplane can be
attributed to different classes. It is a subspace whose dimension is
one less than that of its ambient space. If a space is 3-dimensional
then its hyperplanes are the 2-dimensional planes, while if the
space is 2-dimensional, its hyperplanes are the 1-dimensional lines.
There can be multiple lines/decision boundaries to segregate the
classes in n-dimensional space, but we need to find out the best
decision boundary that helps to classify the data points. This best
boundary is known as the hyperplane of SVM.
Hyperplane and Support Vectors in the SVM
algorithm:
The dimensions of the hyperplane depend on the features present in the
dataset, which means if there are 2 features (as shown in image), then
hyperplane will be a straight line. And if there are 3 features, then
hyperplane will be a 2-dimension plane and so on
We always create a hyperplane that has a maximum margin, which means
the maximum distance between the data points. So, key idea behind the
SVM is to maximize the margin.

Support Vectors:
The data points or vectors that are the closest to the hyperplane and which
affect the position of the hyperplane are termed as Support Vector. Since
these vectors support the hyperplane, hence called a Support vector.
Issues in SVM-
SVM algorithm is not suitable for large data sets.
SVM does not perform very well when the data set has more noise i.e. target
classes are overlapping.
In cases where the number of features for each data point exceeds
the number of training data samples, the SVM will underperform.

Support Vector Machine for Multi-Class Problems

To perform SVM on multi-class problems, we can create a binary classifier for
each class of the data. The two results of each classifier will be :
The data point belongs to that class OR The data point does not belong to
that class.
Advantages of SVM
Disadvantages of SVM
Areas where SVM can be applied:
Types of SVM
• Linear SVM: Linear SVM is used for linearly separable data, which
means if a dataset can be classified into two classes by using a single
straight line, then such data is termed as linearly separable data, and
classifier is used called as Linear SVM classifier.
• Non-linear SVM: Non-Linear SVM is used for non-linearly separated
data, which means if a dataset cannot be classified by using a straight
line, then such data is termed as non-linear data and classifier used is
called as Non-linear SVM classifier
SVM for complex (Non Linearly Separable)
• SVM works very well without any modifications for linearly separable
data. Linearly Separable Data is any data that can be plotted in a
graph and can be separated into classes using a straight line
• We use Kernelized SVM for non-linearly separable data. Say,
we have some non-linearly separable data in one dimension. We
can transform this data into two-dimensions and the data will
become linearly separable in two dimensions. This is done by
mapping each 1-D data point to a corresponding 2-D ordered pair. So
for any non-linearly separable data in any dimension, we can just map
the data to a higher dimension and then make it linearly separable.
This is a very powerful and general transformation
Kernel Functions in Support Vector Machine
(SVM)
• Kernel Function is a method used to take data as input and transform
it into the required form of processing data.
• “Kernel” is used due to a set of mathematical functions used in
Support Vector Machine providing the window to manipulate the
data.
• So, Kernel Function generally transforms the training set of data so
that a non-linear decision surface is able to transform to a linear
equation in a higher number of dimension spaces.
Types of Support Vector Kernels
• Linear Kernel: It is used when data is linearly separable.
• Gaussian Kernel: It is used to perform transformation when there is no
prior knowledge about data.
• Gaussian Kernel Radial Basis Function (RBF): Same as gaussian kernel
function, adding radial basis method to improve the transformation.
• Sigmoid Kernel: this function is equivalent to a two-layer, perceptron
model of the neural network, which is used as an activation function for
artificial neurons.
• Polynomial Kernel: It represents the similarity of vectors in the training set
of data in a feature space over polynomials of the original variables used in
the kernel.
Differentiate between Support Vector Machine
and Logistic Regression
• SVM try to maximize the margin between the closest support vectors whereas logistic regression
maximize the posterior class probability

• SVM is deterministic (but we can use Platts model for probability score) while Logistic Regression
is probabilistic.

• For the kernel space, SVM is faster

• Problems that can be solved using SVM are Image Classification, Recognizing handwriting, Cancer
Detection.

• Problems to apply logistic regression algorithm are:

1. Cancer Detection( It can be used to detect if a patient has cancer(1) or not(0)
2. Test Score: Predict if the student is passed(1) or not(0).
3. Marketing: Predict if a customer will purchase a product(1) or not(0).

(Subrahmanyam Sanjay) The Political Economy of Com
No ratings yet
(Subrahmanyam Sanjay) The Political Economy of Com
411 pages
ML Unit 3 Part B Material
No ratings yet
ML Unit 3 Part B Material
15 pages
Unit 2
No ratings yet
Unit 2
7 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
Tutorial On Support Vector Machine (SVM) : Abstract
No ratings yet
Tutorial On Support Vector Machine (SVM) : Abstract
13 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
QUESTIONS
No ratings yet
QUESTIONS
20 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
16 pages
SVMs[1]
No ratings yet
SVMs[1]
30 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
Basic of SVM Algorithm
No ratings yet
Basic of SVM Algorithm
10 pages
ML and Ai Unit 04 and Unit 05
No ratings yet
ML and Ai Unit 04 and Unit 05
58 pages
ML Unit 3 V1
No ratings yet
ML Unit 3 V1
25 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
UNIT-III Support Vector Machines
No ratings yet
UNIT-III Support Vector Machines
43 pages
SVM
No ratings yet
SVM
11 pages
1. Topic wise Lecture notes unit 4
No ratings yet
1. Topic wise Lecture notes unit 4
31 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
Business Data Mining Week 6
No ratings yet
Business Data Mining Week 6
20 pages
data mining techniques
No ratings yet
data mining techniques
27 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
SVMs
No ratings yet
SVMs
30 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
Algorithm of Neural Network M4
No ratings yet
Algorithm of Neural Network M4
25 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
Unit 2
No ratings yet
Unit 2
47 pages
Unit-1 DL
No ratings yet
Unit-1 DL
29 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
SVM notes unit 4.docx
No ratings yet
SVM notes unit 4.docx
8 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Unit 2 SVM
No ratings yet
Unit 2 SVM
16 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
Ijetae 0812 11
No ratings yet
Ijetae 0812 11
4 pages
ML_Lec-19
No ratings yet
ML_Lec-19
20 pages
Module 3
No ratings yet
Module 3
79 pages
Support Vector Machines Ymod
No ratings yet
Support Vector Machines Ymod
4 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
6. Support Vector Machine for Classification
No ratings yet
6. Support Vector Machine for Classification
38 pages
CS-601-Machine-learning-Unit-5 (1)
No ratings yet
CS-601-Machine-learning-Unit-5 (1)
18 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
Linear Regression & SVM
No ratings yet
Linear Regression & SVM
33 pages
SVM
No ratings yet
SVM
6 pages
This Is
No ratings yet
This Is
7 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
17 pages
SVM&Decision Tree
No ratings yet
SVM&Decision Tree
10 pages
ML UNIT 3
No ratings yet
ML UNIT 3
17 pages
SVM 1
No ratings yet
SVM 1
17 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
Support Vector Machine-1
No ratings yet
Support Vector Machine-1
12 pages
SVM
No ratings yet
SVM
12 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Fourier Transform
No ratings yet
Fourier Transform
24 pages
Iq-Test Compress PDF
No ratings yet
Iq-Test Compress PDF
11 pages
CHAPTER 18 Eletric Force and Electric Field PDF
No ratings yet
CHAPTER 18 Eletric Force and Electric Field PDF
25 pages
Parts Catalogue: KARIZMA ZMR (May, 2014)
No ratings yet
Parts Catalogue: KARIZMA ZMR (May, 2014)
93 pages
Multinational Strategy
No ratings yet
Multinational Strategy
2 pages
UGE 1 Syllabus
No ratings yet
UGE 1 Syllabus
14 pages
Study On Ripening of Custard Apple Fruit (Annona Squamosa L.)
No ratings yet
Study On Ripening of Custard Apple Fruit (Annona Squamosa L.)
4 pages
ODS Analysis and Vibration Solving PDF
No ratings yet
ODS Analysis and Vibration Solving PDF
49 pages
ER Diagram: Mostafijur Rahman Akhond Lecture, CSE BRAC University
No ratings yet
ER Diagram: Mostafijur Rahman Akhond Lecture, CSE BRAC University
34 pages
Witch: The Aged Female: Albrecht Dürer's The Body, Infertility, and The Child
No ratings yet
Witch: The Aged Female: Albrecht Dürer's The Body, Infertility, and The Child
15 pages
A Review of Data Mining Technologies in Building Energy Systems
No ratings yet
A Review of Data Mining Technologies in Building Energy Systems
16 pages
05 - Principles of Inheritance and Variation
No ratings yet
05 - Principles of Inheritance and Variation
35 pages
Chapter 6-Pneumatic Transport
100% (1)
Chapter 6-Pneumatic Transport
18 pages
Energy Work and Power QP 9
No ratings yet
Energy Work and Power QP 9
11 pages
GL-TWL-02 - Tower Light Pre-Delivery Inspection Checklist Guideline (Rev01) 20180307
No ratings yet
GL-TWL-02 - Tower Light Pre-Delivery Inspection Checklist Guideline (Rev01) 20180307
1 page
AM Tutorial: Wednesday, May 6, 2020
No ratings yet
AM Tutorial: Wednesday, May 6, 2020
3 pages
CAPM Certification Training: A First Step For A Project Management Career
100% (1)
CAPM Certification Training: A First Step For A Project Management Career
11 pages
LAB REPORT 1MP
No ratings yet
LAB REPORT 1MP
5 pages
NSTP SERBISYO PARA SA BARANGAY 596 Finalllll
No ratings yet
NSTP SERBISYO PARA SA BARANGAY 596 Finalllll
6 pages
Vegan
No ratings yet
Vegan
297 pages
Electronic Spreadsheet Advanced - MCQ - Set1
0% (1)
Electronic Spreadsheet Advanced - MCQ - Set1
3 pages
DC Motor Depth Study
No ratings yet
DC Motor Depth Study
18 pages
Simulacion Con Superpro PDF
No ratings yet
Simulacion Con Superpro PDF
6 pages
Module - Week 5 STS
No ratings yet
Module - Week 5 STS
8 pages
Genetic Algorithm Synthesis of Four-Bar Mechanisms
No ratings yet
Genetic Algorithm Synthesis of Four-Bar Mechanisms
36 pages
The Ultimate Lead Magnet Examples List PDF
No ratings yet
The Ultimate Lead Magnet Examples List PDF
67 pages
Problem Solving Compilation
No ratings yet
Problem Solving Compilation
4 pages
Employment News This Week 22nd To 28th October 2022
No ratings yet
Employment News This Week 22nd To 28th October 2022
6 pages
Notes On Git
No ratings yet
Notes On Git
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

2B Naive Bayes

Uploaded by

2B Naive Bayes

Uploaded by

Bayesian Learning

Bayes Theorem calculates the probability of each possible hypothesis

• Bayesian Belief Networks is defined by two components

a) A Directed acyclic graph b) A set of conditional Probability Tables

• A graphical model of causal relationships

❑ Nodes: random variables

The EM (Expectation-Maximization) algorithm is one of the most commonly

Support Vector Machine for Multi-Class Problems

• For the kernel space, SVM is faster

• Problems to apply logistic regression algorithm are:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.