0% found this document useful (0 votes)

2 views

Module 3- Bayesian Classifier (1)

Bayesian classification is a statistical method that uses Bayes' Theorem to predict class membership probabilities. The naïve Bayesian classifier simplifies the process by assuming conditional independence among attributes, allowing for efficient computation and often yielding comparable performance to other classifiers. However, it may suffer from accuracy loss due to its independence assumption and requires careful handling of zero-probability issues.

Uploaded by

adityasriram701

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Module 3- Bayesian Classifier (1)

Uploaded by

adityasriram701

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Bayesian Classification

Bayes Theorem - Thomas Bayes

Bayes Theorem - Basics
Bayesian Classification
 A statistical classifier: performs probabilistic prediction,
i.e., predicts class membership probabilities
 Foundation: Based on Bayes’ Theorem.
 Performance: A simple Bayesian classifier, naïve Bayesian
classifier, has comparable performance with decision tree
and selected neural network classifiers
 Incremental: Each training example can incrementally
increase/decrease the probability that a hypothesis is
correct — prior knowledge can be combined with observed
data
 Standard: Even when Bayesian methods are
computationally intractable, they can provide a standard of
optimal decision making against which other methods can
be measured
Bayes’ Theorem: Basics
M
 P(B) 
Total probability Theorem:  P(B | A )P( A )
i i
i 1

 Bayes’ Theorem:P( H | X) P(X | H ) P( H ) P(X | H )P( H ) / P(X)

P(X)
 Let X be a data sample (“evidence”): class label is unknown
 Let H be a hypothesis that X belongs to class C
 Classification is to determine P(H|X), (i.e., posteriori probability):
the probability that the hypothesis holds given the observed
data sample X
 P(H) (prior probability): the initial probability
 E.g., X will buy computer, regardless of age, income, …
 P(X): probability that sample data is observed
 P(X|H) (likelihood): the probability of observing the sample X,
given that the hypothesis holds
 E.g., Given that X will buy computer, the prob. that X is

5 31..40, medium income

Prediction Based on Bayes’ Theorem

 Given training data X, posteriori probability of a

hypothesis H, P(H|X), follows the Bayes’ theorem

P(H | X) P(X | H ) P(H ) P(X | H )P(H ) / P(X)

P(X)
 Informally, this can be viewed as
posteriori = likelihood x prior/evidence
 Predicts X belongs to Ci iff the probability P(Ci|X) is the
highest among all the P(Ck|X) for all the k classes
 Practical difficulty: It requires initial knowledge of
many probabilities, involving significant computational
cost
6
Naïve Bayes Classification
 Let D be a training set of tuples and their associated
class labels, and each tuple is represented by an n-D
attribute vector X = (x1, x2, …, xn)
 Suppose there are m classes C1, C2, …, Cm.
 Given a tuple, X, the classifier will predict that X
belongs to the class having the highest posterior
probability, conditioned on X. That is, the naıve
Bayesian classifier predicts that tuple X belongs to the
class Ci if and only if

7
Classification Is to Derive the
Maximum Posteriori
 Classification is to derive the maximum
posteriori, i.e., the maximal P(Ci|X) This
can be derived from Bayes’ theorem Since
P(X) is constant for all classes, only P (X|
Ci)P(Ci) needs to be maximized.
P(X | C )P(C )
P(C | X)  i i P(C | X) P(X | C )P(C )
i P(X) i i i
Naïve Bayes Classifier
 A simplified assumption: attributes are conditionally
independent (i.e., no dependence relation between
attributes):
n
P ( X | C i )   P ( x | C i ) P ( x | C i ) P ( x | C i ) ... P ( x | C i )
k 1 2 n
k 1

 This greatly reduces the computation cost: Only

counts the class distribution
 If Ak is categorical, P(xk|Ci) is the number of tuples in
Ci having value xk for Ak divided by |Ci, D| (number of
tuples of Ci in D)
 If Ak is continous-valued, P(xk|C 1 i) is
 usually computed
2
( x  )

( x,  ,  ) 
2
2
based on Gaussian g distribution 2 
ewith a mean μ and
standard deviation σ
P ( X | C i )  g ( xk ,  C ,  C )
and P(xk|Ci) is
i i

9
How to Predict a class label using naıve
Bayesian classification?

 Given class labeled training tuples from

AllElectronics Customer Database.

 The data tuples are described by the

attributes age, income, student, and
credit rating.
Naïve Bayes Classifier: Training Dataset

Class: C1:buys_computer = ‘yes’ & C2:buys_computer = ‘no’

11
 P(Ci): P(buys_computer = “yes”) = 9/14
= 0.643
age
P(buys_computer
income studentcredit_rating
= “no”) =
buys_computer
5/14=<=30
0.357
<=30
high
high
no
no
fair
excellent
no
no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

The tuple we wish to classify is

X = (youth , income = medium, student
=yes, credit_rating = fair)
age income studentcredit_rating
buys_computer
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes

Naïve Bayes Classifier: An Example >40

>40
medium
low
no fair
yes fair
yes
yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no
<=30 low yes fair yes
 Compute P(X|Ci) for each class >40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

P(age = “youth” | buys_computer = “yes”) = 2/9 = 0.222

P(age = “youth” | buys_computer = “no”) = 3/5 = 0.6
P(income = “medium” | buys_computer = “yes”) = 4/9 =
0.444
P(income = “medium” | buys_computer = “no”) = 2/5 = 0.4
P(student = “yes” | buys_computer = “yes) = 6/9 = 0.667
P(student = “yes” | buys_computer = “no”) = 1/5 = 0.2
P(credit_rating = “fair” | buys_computer = “yes”) = 6/9 =
0.667
P(credit_rating = “fair” | buys_computer = “no”) = 2/5 =
0.4

 X = (youth , income = medium, student = yes, credit_rating =

fair)
13
Conditio
n
satisfied

Therefore the Naïve Bayesian Classifier predicts

buys_computer = yes for tuple X
Avoiding the Zero-Probability Problem
 Naïve Bayesian prediction requires each conditional
prob. be non-zero. Otherwise, the predicted prob.
will be zero n
P( X | C i)   P( x k | C i)
k 1

 Ex. Suppose a dataset with 1000 tuples, income=low

(0), income= medium (990), and income = high (10)
 Use Laplacian correction (or Laplacian estimator)
 Adding 1 to each case
Prob(income = low) = 1/1003
Prob(income = medium) = 991/1003
Prob(income = high) = 11/1003
 The “corrected” prob. estimates are close to their
15“uncorrected” counterparts
Naïve Bayes Classifier: Comments
 Advantages
 Easy to implement
 Good results obtained in most of the cases
 Disadvantages
 Assumption: class conditional independence,
therefore loss of accuracy
 Practically, dependencies exist among variables
 E.g., hospitals: patients: Profile: age, family history, etc.
Symptoms: fever, cough etc., Disease: lung
cancer, diabetes, etc.
 Dependencies among these cannot be modeled by Naïve
Bayes Classifier

16
Thank you….

Applied ML notes
No ratings yet
Applied ML notes
123 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
AI notes
No ratings yet
AI notes
19 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Lecture 8 - Naive Bayes
No ratings yet
Lecture 8 - Naive Bayes
27 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
Unit6 -3 Classification-Bayesian_e224638f-6bb6-4684-a1a1-adb33ef1b15d
No ratings yet
Unit6 -3 Classification-Bayesian_e224638f-6bb6-4684-a1a1-adb33ef1b15d
15 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
Lecture12-Ch8-ClassBasic-Part2
No ratings yet
Lecture12-Ch8-ClassBasic-Part2
22 pages
2.3 Bayes classification
No ratings yet
2.3 Bayes classification
15 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
IME672 - Lecture 44
No ratings yet
IME672 - Lecture 44
16 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
TTDS Lecture 5
No ratings yet
TTDS Lecture 5
8 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
UNIT- iv
No ratings yet
UNIT- iv
169 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Lecture Slide 03 - Bayesian Classifier - Summer 2023
No ratings yet
Lecture Slide 03 - Bayesian Classifier - Summer 2023
23 pages
Classification-Clustering
No ratings yet
Classification-Clustering
44 pages
Bayesian
No ratings yet
Bayesian
23 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
6 Classification
No ratings yet
6 Classification
53 pages
CSC-325-AI-Lecture08-Supervised-Learning
No ratings yet
CSC-325-AI-Lecture08-Supervised-Learning
32 pages
CSC 325 AI Lecture08 Supervised Learning Fall2024 Dr Raheel 20022025 034558pm
No ratings yet
CSC 325 AI Lecture08 Supervised Learning Fall2024 Dr Raheel 20022025 034558pm
29 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
Bayes Classification Methods
No ratings yet
Bayes Classification Methods
22 pages
Bayes Classification Method
No ratings yet
Bayes Classification Method
18 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
DWDM Unit 3 Part 2
No ratings yet
DWDM Unit 3 Part 2
8 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
Naïve Bayesv1
No ratings yet
Naïve Bayesv1
31 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Unit-Iv Data Classification: Data Warehousing and Data Mining
No ratings yet
Unit-Iv Data Classification: Data Warehousing and Data Mining
7 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
Naive Bayes.ppt
No ratings yet
Naive Bayes.ppt
24 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
BSC ML CH2.pptx
No ratings yet
BSC ML CH2.pptx
79 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
ML Naive Bayes 1
No ratings yet
ML Naive Bayes 1
19 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
07_Naive_Bayes
No ratings yet
07_Naive_Bayes
6 pages
Lecture-7 Classification Using Naive Bays
No ratings yet
Lecture-7 Classification Using Naive Bays
19 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
ML-09-naive-bayes-classifier
No ratings yet
ML-09-naive-bayes-classifier
24 pages
Bays Classifier (Machine Learning)
No ratings yet
Bays Classifier (Machine Learning)
16 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
Naive Bayes
No ratings yet
Naive Bayes
38 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
Bayesian Classification- problem (1)
No ratings yet
Bayesian Classification- problem (1)
4 pages
14 - Naive Baysean Classification
No ratings yet
14 - Naive Baysean Classification
20 pages
ML Lec 15 Naive Bayes
No ratings yet
ML Lec 15 Naive Bayes
16 pages
Calculus Super Review
From Everand
Calculus Super Review
Editors of REA
No ratings yet
Pre-Calculus Essentials
From Everand
Pre-Calculus Essentials
Ernest Woodward
No ratings yet
Lect 8-Decision Tree-2
No ratings yet
Lect 8-Decision Tree-2
16 pages
svm
No ratings yet
svm
33 pages
Intro
No ratings yet
Intro
12 pages
revTwo
No ratings yet
revTwo
33 pages
Perceptron
No ratings yet
Perceptron
8 pages
Introduction To Pattern Recognition and Machine Learning PDF
No ratings yet
Introduction To Pattern Recognition and Machine Learning PDF
402 pages
Comparative Research On Network Intrusion Detection Methods Based
No ratings yet
Comparative Research On Network Intrusion Detection Methods Based
17 pages
ML Exp 8
No ratings yet
ML Exp 8
22 pages
Working As A Zoologist Lesson Plans
No ratings yet
Working As A Zoologist Lesson Plans
23 pages
Circular 20240429201652 Cho Ai and ML 22cs015 v2
No ratings yet
Circular 20240429201652 Cho Ai and ML 22cs015 v2
8 pages
Active Learning
No ratings yet
Active Learning
16 pages
MADHU-IEEE-updated-27-05-24
No ratings yet
MADHU-IEEE-updated-27-05-24
5 pages
GRP 5 - Ar - Als4083 - L2
No ratings yet
GRP 5 - Ar - Als4083 - L2
20 pages
Week 1 Lecture Slides BUS265 2023
No ratings yet
Week 1 Lecture Slides BUS265 2023
37 pages
Anomaly Detection: A Tutorial
No ratings yet
Anomaly Detection: A Tutorial
101 pages
Estimating IRI Based On Pavement Distress Type, Density, and Severity Insights From
No ratings yet
Estimating IRI Based On Pavement Distress Type, Density, and Severity Insights From
19 pages
All Project Ideas
No ratings yet
All Project Ideas
38 pages
NMR Val Guideline II V6
No ratings yet
NMR Val Guideline II V6
20 pages
Trading Simulation Using Python and Visualization On Streamlit With Machine Learning Decision Tree
No ratings yet
Trading Simulation Using Python and Visualization On Streamlit With Machine Learning Decision Tree
5 pages
PRACTICAL FILE fml - Jatin
No ratings yet
PRACTICAL FILE fml - Jatin
15 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
20 pages
Paper 1-Bidirectional LSTM With Attention Mechanism and Convolutional Layer
100% (1)
Paper 1-Bidirectional LSTM With Attention Mechanism and Convolutional Layer
51 pages
AI Unit 4
No ratings yet
AI Unit 4
15 pages
Human Behavior Recognition Algorithm Based on HOG Feature and SVM Classifier
No ratings yet
Human Behavior Recognition Algorithm Based on HOG Feature and SVM Classifier
4 pages
An Overview and Comparison of Free Python Libraries For Data Mining and Big Data Analysis
No ratings yet
An Overview and Comparison of Free Python Libraries For Data Mining and Big Data Analysis
6 pages
Complete Download Bio Imaging Principles Techniques and Applications 1st Edition Rajagopal Vadivambal PDF All Chapters
100% (2)
Complete Download Bio Imaging Principles Techniques and Applications 1st Edition Rajagopal Vadivambal PDF All Chapters
86 pages
Toxic Commefbnt Final Report
No ratings yet
Toxic Commefbnt Final Report
10 pages
Lichouri LID 2023
No ratings yet
Lichouri LID 2023
6 pages
ML Lab Manual CSE (1)
No ratings yet
ML Lab Manual CSE (1)
50 pages
FINAL
No ratings yet
FINAL
18 pages
DLA QB
No ratings yet
DLA QB
3 pages
Data Science in Medicine - Precision & Recall or Specificity & Sensitivity? - by Alon Lekhtman - Towards Data Science
No ratings yet
Data Science in Medicine - Precision & Recall or Specificity & Sensitivity? - by Alon Lekhtman - Towards Data Science
11 pages
Unit 4 Descriptive Modeling
No ratings yet
Unit 4 Descriptive Modeling
18 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Module 3- Bayesian Classifier (1)

Uploaded by

Module 3- Bayesian Classifier (1)

Uploaded by

Bayesian Classification

Bayes Theorem - Thomas Bayes

 Bayes’ Theorem:P( H | X) P(X | H ) P( H ) P(X | H )P( H ) / P(X)

5 31..40, medium income

 Given training data X, posteriori probability of a

P(H | X) P(X | H ) P(H ) P(X | H )P(H ) / P(X)

 This greatly reduces the computation cost: Only

 Given class labeled training tuples from

 The data tuples are described by the

Class: C1:buys_computer = ‘yes’ & C2:buys_computer = ‘no’

The tuple we wish to classify is

Naïve Bayes Classifier: An Example >40

P(age = “youth” | buys_computer = “yes”) = 2/9 = 0.222

 X = (youth , income = medium, student = yes, credit_rating =

Therefore the Naïve Bayesian Classifier predicts

 Ex. Suppose a dataset with 1000 tuples, income=low

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.