0% found this document useful (0 votes)

12 views

07 Naive - Bayes

Naive bayes

Uploaded by

Rushikesh Sontakke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

07 Naive - Bayes

Naive bayes

Uploaded by

Rushikesh Sontakke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Naive Bayes

Another very popular Supervised Classification algorithm is Naive Bayes.

Before diving deep lets understand What does Naive and Bayes signify.

1. This algorithm is called “Naive” because it makes a naive assumption that each feature is independent
of other features which is not true in real life.
2. As for the “Bayes” part, it refers to the statistician and philosopher, Thomas Bayes and the theorem
named after him, Bayes’ theorem, which is the base for Naive Bayes Algorithm.

Befor Going Naive Bayes we see about Bayes Therom ¶

Bayes Theorm

Where,
P(A|B) is the probability of hypothesis A given the data B. This is called the posterior probability.
P(B|A) is the probability of data B given that the hypothesis A was true.
P(A) is the probability of hypothesis A being true (regardless of the data). This is called the prior
probability of A.
P(B) is the probability of the data (regardless of the hypothesis).
P(A|B) or P(B|A) are conditional probabilities P(B|A) = P(A and B)/P(A)

Types of Naive Bayes Classifier:

1. Multinomial Naive Bayes
2. Bernoulli Naive Bayes
3. Gaussian Naive Bayes

a. Multinomial Naive Bayes: This is mostly used for document classification problem, i.e whether a
document belongs to the category of sports, politics, technology etc. The features/predictors used by the
classifier are the frequency of the words present in the document.
b. Bernoulli Naive Bayes: This is similar to the multinomial naive bayes but the predictors are boolean
variables. The parameters that we use to predict the class variable take up only values yes or no, for
example if a word occurs in the text or not.

c. Gaussian Naive Bayes : When the predictors take up a continuous value and are not discrete, we
assume that these values are sampled from a gaussian distribution.

How Naive Bayes algorithm works?

Let’s understand it using an clasic example.

Below I have a training data set of weather and corresponding target variable ‘Play’ (suggesting possibilities of
playing). Now, we need to classify whether players will play or not based on weather condition. Let’s follow the
below steps to perform it.

Step 1: Convert the data set into a frequency table

Step 2: Create Likelihood table by finding the probabilities like Overcast probability = 0.29 and probability of
playing is 0.64.
Step 3: Now, use Naive Bayesian equation to calculate the posterior probability for each class. The class
with the highest posterior probability is the outcome of prediction.

See the below example

This is the problem statement :

Here is the Answer:

Applications of Naive Bayes Algorithm :
Naive Bayes is widely used for text classification
Another example of Text Classification where Naive Bayes is mostly used is Spam Filtering in Emails
Other Examples include Sentiment Analysis ,Recommender Systems etc

Now Implement This Algorithm

Importing the libraries

In [0]:

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

Importing the dataset

In [0]:

dataset = pd.read_csv('Social_Network_Ads.csv')
X = dataset.iloc[:, [2, 3]].values
y = dataset.iloc[:, -1].values

Splitting the dataset into the Training set and Test set
In [0]:

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.25, random_state =

Feature Scaling
In [0]:

from sklearn.preprocessing import StandardScaler

sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)

Training the Naive Bayes model on the Training set

In [5]:

from sklearn.naive_bayes import GaussianNB

classifier = GaussianNB()
classifier.fit(X_train, y_train)

Out[5]:

GaussianNB(priors=None, var_smoothing=1e-09)

Predicting the Test set results

In [0]:

y_pred = classifier.predict(X_test)

Making the Confusion Matrix

In [7]:

from sklearn.metrics import confusion_matrix

cm = confusion_matrix(y_test, y_pred)
print(cm)

[[65 3]
[ 7 25]]

Visualising the Training set results

In [8]:

from matplotlib.colors import ListedColormap

X_set, y_set = X_train, y_train
X1, X2 = np.meshgrid(np.arange(start = X_set[:, 0].min() - 1, stop = X_set[:, 0].max() + 1,
np.arange(start = X_set[:, 1].min() - 1, stop = X_set[:, 1].max() + 1,
plt.contourf(X1, X2, classifier.predict(np.array([X1.ravel(), X2.ravel()]).T).reshape(X1.sh
alpha = 0.75, cmap = ListedColormap(('red', 'green')))
plt.xlim(X1.min(), X1.max())
plt.ylim(X2.min(), X2.max())
for i, j in enumerate(np.unique(y_set)):
plt.scatter(X_set[y_set == j, 0], X_set[y_set == j, 1],
c = ListedColormap(('red', 'green'))(i), label = j)
plt.title('Naive Bayes (Training set)')
plt.xlabel('Age')
plt.ylabel('Estimated Salary')
plt.legend()
plt.show()

'c' argument looks like a single numeric RGB or RGBA sequence, which should
be avoided as value-mapping will have precedence in case its length matches
with 'x' & 'y'. Please use a 2-D array with a single row if you really want
to specify the same RGB or RGBA value for all points.
'c' argument looks like a single numeric RGB or RGBA sequence, which should
be avoided as value-mapping will have precedence in case its length matches
with 'x' & 'y'. Please use a 2-D array with a single row if you really want
to specify the same RGB or RGBA value for all points.

Visualising the Test set results

In [9]:

from matplotlib.colors import ListedColormap

X_set, y_set = X_test, y_test
X1, X2 = np.meshgrid(np.arange(start = X_set[:, 0].min() - 1, stop = X_set[:, 0].max() + 1,
np.arange(start = X_set[:, 1].min() - 1, stop = X_set[:, 1].max() + 1,
plt.contourf(X1, X2, classifier.predict(np.array([X1.ravel(), X2.ravel()]).T).reshape(X1.sh
alpha = 0.75, cmap = ListedColormap(('red', 'green')))
plt.xlim(X1.min(), X1.max())
plt.ylim(X2.min(), X2.max())
for i, j in enumerate(np.unique(y_set)):
plt.scatter(X_set[y_set == j, 0], X_set[y_set == j, 1],
c = ListedColormap(('red', 'green'))(i), label = j)
plt.title('Naive Bayes (Test set)')
plt.xlabel('Age')
plt.ylabel('Estimated Salary')
plt.legend()
plt.show()

F Table
100% (1)
F Table
5 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Unit 2 AAM
No ratings yet
Unit 2 AAM
32 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
3 pages
ML practical Manjot 6-10
No ratings yet
ML practical Manjot 6-10
10 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
ML practical Lovepreet 6-10
No ratings yet
ML practical Lovepreet 6-10
10 pages
ML practical Kunal 6-10
No ratings yet
ML practical Kunal 6-10
10 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
21BRS1379 Exp3 ML
No ratings yet
21BRS1379 Exp3 ML
5 pages
Exp 3 Bi
No ratings yet
Exp 3 Bi
12 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
ML practical Kiranjot 6-10
No ratings yet
ML practical Kiranjot 6-10
10 pages
Naive Bayes Numericals
No ratings yet
Naive Bayes Numericals
9 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
Lec 9 Supervised Learning Final
100% (1)
Lec 9 Supervised Learning Final
182 pages
Unit 3 PPT
No ratings yet
Unit 3 PPT
20 pages
Practical # 11
No ratings yet
Practical # 11
10 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
8 pages
Lab 4 solved (1)
No ratings yet
Lab 4 solved (1)
6 pages
ML Lab 7 - Naive Bayes
No ratings yet
ML Lab 7 - Naive Bayes
6 pages
Unit 5-6
No ratings yet
Unit 5-6
18 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Naive Bayes Classification numerical Example with code_fcbef48a167a871e25b2543fd16d203e
No ratings yet
Naive Bayes Classification numerical Example with code_fcbef48a167a871e25b2543fd16d203e
8 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
L10-Naive Bayes Continuous
No ratings yet
L10-Naive Bayes Continuous
16 pages
UNIT IV Na-Ve Bayes Classifier Algorithm
No ratings yet
UNIT IV Na-Ve Bayes Classifier Algorithm
33 pages
Practical_3 (2)
No ratings yet
Practical_3 (2)
11 pages
unit 6 ai
No ratings yet
unit 6 ai
28 pages
ML Lab Experiments (1) - Pages-3
No ratings yet
ML Lab Experiments (1) - Pages-3
11 pages
ML Lab1 pgm
No ratings yet
ML Lab1 pgm
4 pages
ML Python Exercises UOM BDS Classification
No ratings yet
ML Python Exercises UOM BDS Classification
18 pages
lecture3-linear-classifiers
No ratings yet
lecture3-linear-classifiers
36 pages
DWM Exp 4
No ratings yet
DWM Exp 4
7 pages
Wa0001
No ratings yet
Wa0001
39 pages
6d7701 - Bayesean Classifer
No ratings yet
6d7701 - Bayesean Classifer
8 pages
07 - ML - Naive-Bayes-update
No ratings yet
07 - ML - Naive-Bayes-update
26 pages
Naive Bayes Algorithm With Classification Example 1697128543
No ratings yet
Naive Bayes Algorithm With Classification Example 1697128543
16 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Unit 3 ML
No ratings yet
Unit 3 ML
28 pages
ML Lab PT
No ratings yet
ML Lab PT
25 pages
5 ML NaiveBayes
No ratings yet
5 ML NaiveBayes
45 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
Naive Biase
No ratings yet
Naive Biase
6 pages
6. Naive Bayes
No ratings yet
6. Naive Bayes
26 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Naive Bayes - Report (Repaired)
No ratings yet
Naive Bayes - Report (Repaired)
5 pages
16_Naïve Bayes Classifier
No ratings yet
16_Naïve Bayes Classifier
21 pages
Lab Manual
No ratings yet
Lab Manual
17 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
An Introduction to Naive Bayes Algorithm for Beginners
No ratings yet
An Introduction to Naive Bayes Algorithm for Beginners
11 pages
Mechine Learning
No ratings yet
Mechine Learning
7 pages
MLT Unit 2 - Updated
No ratings yet
MLT Unit 2 - Updated
58 pages
Module - 4 - ECE3047 - Machine Learning
No ratings yet
Module - 4 - ECE3047 - Machine Learning
81 pages
Naive Bayes etc.
No ratings yet
Naive Bayes etc.
1 page
ML_Lab_01999676272
No ratings yet
ML_Lab_01999676272
12 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
What Is Cloud Computing
No ratings yet
What Is Cloud Computing
9 pages
What Is Cloud Computing - New
No ratings yet
What Is Cloud Computing - New
50 pages
CC Unit-II
No ratings yet
CC Unit-II
39 pages
Unit-9 VP-2
No ratings yet
Unit-9 VP-2
4 pages
Symbol
No ratings yet
Symbol
22 pages
New Doc Jan 27, 2021 6.11 PM
No ratings yet
New Doc Jan 27, 2021 6.11 PM
6 pages
NFA Examples
No ratings yet
NFA Examples
2 pages
Unit 4
No ratings yet
Unit 4
20 pages
C# MCQ
No ratings yet
C# MCQ
12 pages
Unit - 3 VP-1
No ratings yet
Unit - 3 VP-1
15 pages
Assignment 2 Dbms
No ratings yet
Assignment 2 Dbms
9 pages
Unit 5 Session
No ratings yet
Unit 5 Session
8 pages
MTH302 Midterm Solved Subjective With Reference by Uzair
No ratings yet
MTH302 Midterm Solved Subjective With Reference by Uzair
30 pages
PDF (Ebook PDF) Social Statistics For A Diverse Society 7th Edition Download
100% (9)
PDF (Ebook PDF) Social Statistics For A Diverse Society 7th Edition Download
51 pages
T Test
No ratings yet
T Test
17 pages
Pearson's Correlation Coefficient
No ratings yet
Pearson's Correlation Coefficient
7 pages
20ma402 Ps Unit III DCM
No ratings yet
20ma402 Ps Unit III DCM
77 pages
LM-Webinar On Multivariate Techniques For Research - Intro and MRA
No ratings yet
LM-Webinar On Multivariate Techniques For Research - Intro and MRA
24 pages
Quantitative Approaches For Second Language Education Research
No ratings yet
Quantitative Approaches For Second Language Education Research
129 pages
FIT2086 Lecture 6 Linear Regression: Daniel F. Schmidt
No ratings yet
FIT2086 Lecture 6 Linear Regression: Daniel F. Schmidt
72 pages
Basic Statistics A Primer for the Biomedical Sciences 4th Edition Olive Jean Dunn instant download
No ratings yet
Basic Statistics A Primer for the Biomedical Sciences 4th Edition Olive Jean Dunn instant download
49 pages
QNM223 Week 12 Correlation & Regression w2017
No ratings yet
QNM223 Week 12 Correlation & Regression w2017
52 pages
TOS 1st QUARTER Statistics
100% (2)
TOS 1st QUARTER Statistics
5 pages
Wk 07 - Tutorial Chp 6
No ratings yet
Wk 07 - Tutorial Chp 6
2 pages
Forest
No ratings yet
Forest
2 pages
Project - 8: Finance &risk Analytics - India Credit Risk
No ratings yet
Project - 8: Finance &risk Analytics - India Credit Risk
28 pages
IEEE Standard 1366 - Classifying Reliability (SAIDI, SAIFI, CAIDI) Into Normal, Major Event and Catastrophic Days
No ratings yet
IEEE Standard 1366 - Classifying Reliability (SAIDI, SAIFI, CAIDI) Into Normal, Major Event and Catastrophic Days
30 pages
Using_the_Arima_model_to_forecast_the_share_of_rai
No ratings yet
Using_the_Arima_model_to_forecast_the_share_of_rai
11 pages
Selvanathan 7e - 13
No ratings yet
Selvanathan 7e - 13
87 pages
STATISTICS AND PROBABILITY REVIEWER
No ratings yet
STATISTICS AND PROBABILITY REVIEWER
10 pages
Dependent and Independent Variables - Wikipedia
No ratings yet
Dependent and Independent Variables - Wikipedia
6 pages
Shrinkage Regression: Rolf Sundberg Volume 4, PP 1994-1998 in
No ratings yet
Shrinkage Regression: Rolf Sundberg Volume 4, PP 1994-1998 in
5 pages
KCA-054 (Assignment 2)
No ratings yet
KCA-054 (Assignment 2)
2 pages
Cheat Sheet Stats For Exam Cheat Sheet Stats For Exam
No ratings yet
Cheat Sheet Stats For Exam Cheat Sheet Stats For Exam
3 pages
機率論期末考
No ratings yet
機率論期末考
2 pages
5.attribute Control ChartNew
No ratings yet
5.attribute Control ChartNew
52 pages
424-433, Ni Putu Hanisa Noptiana Putri, I Ketut Sunarwijaya, Ni Putu Lisa Ernawatiningsih
No ratings yet
424-433, Ni Putu Hanisa Noptiana Putri, I Ketut Sunarwijaya, Ni Putu Lisa Ernawatiningsih
10 pages
LP 1 (DA) Question Bank
No ratings yet
LP 1 (DA) Question Bank
4 pages
Reliability
No ratings yet
Reliability
15 pages
(Ebook) Complete business statistics by Amir D. Aczel; Jayavel Sounderpandian ISBN 9780073373607, 0073373605 - Download the full ebook now for a seamless reading experience
100% (1)
(Ebook) Complete business statistics by Amir D. Aczel; Jayavel Sounderpandian ISBN 9780073373607, 0073373605 - Download the full ebook now for a seamless reading experience
53 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

07 Naive - Bayes

Uploaded by

07 Naive - Bayes

Uploaded by

Naive Bayes

Another very popular Supervised Classification algorithm is Naive Bayes.

Befor Going Naive Bayes we see about Bayes Therom ¶

Types of Naive Bayes Classifier:

How Naive Bayes algorithm works?

Step 1: Convert the data set into a frequency table

See the below example

This is the problem statement :

Here is the Answer:

Now Implement This Algorithm

Importing the libraries

Importing the dataset

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler

Training the Naive Bayes model on the Training set

from sklearn.naive_bayes import GaussianNB

Predicting the Test set results

Making the Confusion Matrix

from sklearn.metrics import confusion_matrix

Visualising the Training set results

from matplotlib.colors import ListedColormap

Visualising the Test set results

from matplotlib.colors import ListedColormap

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.