0% found this document useful (0 votes)

17 views

ML Unit 3 V1

Uploaded by

sampathmandru18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

ML Unit 3 V1

Uploaded by

sampathmandru18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

MACHINE LEARNING

Ensemble Learning and Random Forests

BTech III Year – II Semester
Computer Science & Engineering

UNIT-III
By

Dr.Satyabrata Dash
Professor
Department of Computer Science & Engineering
Ramachandra College of Engineering, Eluru

1
SYLLABUS
UNIT-III
MACHINE LEARNING
Ensemble Learning and Random Forests:
• Introduction,
• Voting Classifiers,
• Bagging and Pasting,
• Random Forests,
• Boosting,
• Stacking.
Support Vector Machine:
• Linear SVM Classification,
• Nonlinear SVM Classification
• SVM Regression,
• Naïve Bayes Classifiers.
2
Support Vector Machines(Linear SVM Classification)

1. Support Vector Machine(SVM) is a supervised machine learning algorithm used for both
classification and regression.
2. The objective of the support vector machine algorithm is to find a hyperplane in an N-
dimensional space(N the number of features) that distinctly classifies the data points.
3. The dimension of the hyperplane depends upon the number of features.
4. SVM chooses the extreme points/vectors that help in creating the hyperplane. These extreme
cases are called as support vectors, and hence algorithm is termed as Support Vector Machine.
5. If the number of input features is two, then the hyperplane is just a line. If the number of input
features is three, then the hyperplane becomes a 2-D plane. It becomes difficult to imagine when
the number of features exceeds three.
Support Vector Machines

1. Hyperplane: There can be multiple lines/decision boundaries to segregate the classes in n-

dimensional space, but we need to find out the best decision boundary that helps to classify the
data points. This best boundary is known as the hyperplane of SVM.
2. Support Vectors:
The data points or vectors that are the closest to the hyperplane and which affect the position of
the hyperplane are termed as Support Vector. Since these vectors support the hyperplane, hence
called a Support vector.
Support Vector Machines Linear Separators
• Binary classification can be viewed as the task of separating classes in feature space:

f(x) = sign(wTx + b)

wTx + b = 0
wTx + b > 0
wTx + b < 0
Support Vector Machines Linear Separators

• Which of the linear separators is optimal?

Linear Separators
Support Vector Machines Classification Margin

• Distance from example xi to the separator is

w T xi  b
r
w
• Examples closest to the hyperplane are support vectors.
• Margin ρ of the separator is the distance between support vectors.

r
Support Vector Machines Maximum Margin Classification

• Maximizing the margin is good according to intuition and PAC theory.

• Implies that only support vectors matter; other training examples are ignorable.
Support Vector Machines Linear SVM Mathematically

• Let training set {(xi, yi)}i=1..n, xiRd, yi  {-1, 1} be separated by a hyperplane with
margin ρ. Then for each training example (xi, yi):

wTxi + b ≤ - ρ/2 if yi = -1
wTxi + b ≥ ρ/2 if yi = 1  yi(wTxi + b) ≥ ρ/2
• For every support vector xs the above inequality is an equality. After rescaling w and b by ρ/2 in
the equality, we obtain that distance between each xs and the hyperplane is

y s ( w T x s  b) 1
r 
• Then the margin can be expressed through (rescaled) w and b as: w w

2
  2r 
w
Support Vector Machines Advantages

Advantages of SVM:

1. Effective in high dimensional cases

2. Its memory efficient as it uses a subset of training points in the decision function
called support vectors
3. Different kernel functions can be specified for the decision functions and its
possible to specify custom kernels
Support Vector Machines(Nonlinear SVM Classification )

Non-linear SVM: Non-Linear SVM is used for non-linearly separated data, which
means if a dataset cannot be classified by using a straight line, then such data is termed
as non-linear data and classifier used is called as Non-linear SVM classifier
Support Vector Machines(Nonlinear SVM Classification )

1. This can be done by projecting the dataset into a higher dimension in which it is linearly
separable.
2. By using a linear classifier we can separate a non-linearly separable dataset
3. When the data is not linearly separable, we use the non-linear SVM classifier to separate
the data points.
4. It hypothetically takes the data points to a higher dimension, so that they are linearly separable in
that dimension and then the algorithm classifies them.
5. What is a Kernel Function?
6. In machine learning, a kernel refers to a method that allows us to apply linear classifiers to
non-linear problems by mapping non-linear data into a higher-dimensional space without
the need to visit or understand that higher-dimensional space.
7. This function transforms the n-dimensional input space to an m-dimensional space where
n>>m, so that we can do the required calculations in a higher dimension efficiently.
Support Vector Machines(Nonlinear SVM Classification )
Support Vector Machines(Nonlinear SVM Classification
types of Kernel Functions)
Support Vector Regression(SVR)
1. Support Vector Regression is a supervised learning algorithm that is used to predict discrete
values.
2. Support Vector Regression (SVR) uses the same principle as SVM, but for regression
problems.
3. The basic idea behind SVR is to find the best fit line.
4. In SVR, the best fit line is the Hyperplane that has the maximum number of points.
Support Vector Regression(SVR)
1. Unlike other Regression models that try to minimize the error between the real and predicted
value, the SVR tries to fit the best line within a threshold value. The threshold value is the
distance between the hyperplane and boundary line.

2. In SVR, the best fit line is the Hyperplane that has the maximum number of points.
Support Vector Regression(SVR)
In the case of regression, a margin of tolerance (epsilon) is set in approximation to the SVM
the main idea is always the same: to minimize error, individualizing the hyperplane which
maximizes the margin, keeping in mind that part of the error is tolerated.
Support Vector Regression(SVR)

Linear SVR

Non-linear SVR
The kernel functions transform the data into a higher dimensional feature space to make it
possible to perform the linear separation.
Machine Learning Basic Methods: Naive Bayes Methods

1. Naïve Bayes algorithm is a supervised learning algorithm, which is based on Bayes

theorem and used for solving classification problems.
2. It is mainly used in text classification that includes a high-dimensional training dataset.
3. It is a probabilistic classifier, which means it predicts on the basis of the probability
of an object.
4. In simple terms, a Naive Bayes classifier assumes that the presence of a particular feature
in a class is unrelated to the presence of any other feature.
5. Some popular examples of Naïve Bayes Algorithm are spam filtration, Sentimental
analysis, and classifying articles.
Machine Learning Basic Methods: Naive Bayes Methods
Example:

A fruit may be considered to be an apple if it is red, round, and about 3 inches in diameter. Even if
these features depend on each other or upon the existence of the other features, all of these properties
independently contribute to the probability that this fruit is an apple and that is why it is known as
‘Naive’.

The Naïve Bayes algorithm is comprised of two words Naïve and Bayes, Which can be described as:

1. Naïve: It is called Naïve because it assumes that the occurrence of a certain feature is independent
of the occurrence of other features. Such as if the fruit is identified on the bases of color, shape, and
taste, then red, spherical, and sweet fruit is recognized as an apple. Hence each feature individually
contributes to identify that it is an apple without depending on each other.
2. Bayes: It is called Bayes because it depends on the principle of Bayes' Theorem.
Machine Learning Basic Methods: Naive Bayes Methods
1. Naive Bayes model is easy to build and particularly useful for very large data sets.
2. Along with simplicity, Naive Bayes is known to outperform even highly sophisticated classification
methods.
3. Bayes theorem provides a way of calculating posterior probability P(c|x) from P(c), P(x) and P(x|c).
Look at the equation below:
Where
• P(c|x) is the posterior probability of
class (c, target) given predictor (x,
attributes).
• P(c) is the prior probability of class.
• P(x|c) is the likelihood which is the
probability of predictor given class.
• P(x) is the prior probability of
predictor.
Machine Learning Basic Methods: Naive Bayes algorithm
1. Convert the given dataset into frequency tables.
2. Generate Likelihood table by finding the probabilities of given features.
3. Now, use Bayes theorem to calculate the posterior probability.
Example:

Consider raining data set of weather and corresponding target variable ‘Play’
(suggesting possibilities of playing). Now, we need to classify whether players
will play or not based on weather condition.
Machine Learning Basic Methods: Naive Bayes Example

Problem: Players will play if weather is sunny. Is this statement is correct?

solve it using above discussed method of posterior probability.

P(Yes | Sunny) = P( Sunny | Yes) * P(Yes) / P (Sunny)

Here we have
P (Sunny |Yes) = 3/9 = 0.33, P(Sunny) = 5/14 = 0.36, P( Yes)= 9/14 = 0.64

Now, P (Yes | Sunny) = 0.33 * 0.64 / 0.36 = 0.60, which has higher probability.
Machine Learning Basic Methods: Naive Bayes Method

Advantages

1. It is easy and fast to predict class of test data set. It also perform well in multi class prediction
2. When assumption of independence holds, a Naive Bayes classifier performs better compare to
other models like logistic regression and you need less training data.
3. It perform well in case of categorical input variables compared to numerical variable(s). For
numerical variable, normal distribution is assumed (bell curve, which is a strong assumption).

Dis-Advantages

1. If categorical variable has a category (in test data set), which was not observed in training data
set, then model will assign a 0 (zero) probability and will be unable to make a prediction. This
is often known as “Zero Frequency”. To solve this, we can use the smoothing technique. One of
the simplest smoothing techniques is called Laplace estimation.
2. On the other side naive Bayes is also known as a bad estimator, so the probability outputs from
predict_proba are not to be taken too seriously.
3. Another limitation of Naive Bayes is the assumption of independent predictors. In real life, it is
almost impossible that we get a set of predictors which are completely independent.
Thank You

SVM Using Python
No ratings yet
SVM Using Python
24 pages
ML Unit 3 Part B Material
No ratings yet
ML Unit 3 Part B Material
15 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
Unit 3 PPT
No ratings yet
Unit 3 PPT
20 pages
SVM&Decision Tree
No ratings yet
SVM&Decision Tree
10 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
Unit 2 SVM
No ratings yet
Unit 2 SVM
16 pages
Mod09-ppt2-ML_in_Image_Classification
No ratings yet
Mod09-ppt2-ML_in_Image_Classification
30 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
Algorithm of Neural Network M4
No ratings yet
Algorithm of Neural Network M4
25 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Basic of SVM Algorithm
No ratings yet
Basic of SVM Algorithm
10 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
Support Vector Machines
No ratings yet
Support Vector Machines
43 pages
6. Support Vector Machine for Classification
No ratings yet
6. Support Vector Machine for Classification
38 pages
Unit-1 DL
No ratings yet
Unit-1 DL
29 pages
SVM Unit3
No ratings yet
SVM Unit3
23 pages
Lab 6 Dsa
No ratings yet
Lab 6 Dsa
15 pages
This Is
No ratings yet
This Is
7 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Tutorial On Support Vector Machine (SVM) : Abstract
No ratings yet
Tutorial On Support Vector Machine (SVM) : Abstract
13 pages
Support Vecor Machine
No ratings yet
Support Vecor Machine
4 pages
SVM Theory
No ratings yet
SVM Theory
7 pages
2B Naive Bayes
No ratings yet
2B Naive Bayes
90 pages
Notes
No ratings yet
Notes
32 pages
Deep Learn
No ratings yet
Deep Learn
48 pages
Machine learning algorithms laiki
No ratings yet
Machine learning algorithms laiki
123 pages
Support vector Machine.pptx
No ratings yet
Support vector Machine.pptx
18 pages
Linear Regression & SVM
No ratings yet
Linear Regression & SVM
33 pages
SVM Basics Paper
No ratings yet
SVM Basics Paper
7 pages
SVM
No ratings yet
SVM
12 pages
Data Classification Using Support Vector Machine: Durgesh K. Srivastava, Lekha Bhambhu
No ratings yet
Data Classification Using Support Vector Machine: Durgesh K. Srivastava, Lekha Bhambhu
7 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
UNIT-3
No ratings yet
UNIT-3
12 pages
SVM Manual
No ratings yet
SVM Manual
7 pages
unit 6 ai
No ratings yet
unit 6 ai
28 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
SVM Notes
No ratings yet
SVM Notes
8 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
svm
No ratings yet
svm
4 pages
AI Chapter 3 Part 3
No ratings yet
AI Chapter 3 Part 3
49 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
SVM_Presentation
No ratings yet
SVM_Presentation
13 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
SVM
No ratings yet
SVM
6 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
ML Unit-2
No ratings yet
ML Unit-2
26 pages
ML Unit-5
No ratings yet
ML Unit-5
14 pages
ML Unit-4
No ratings yet
ML Unit-4
14 pages
ML Unit-1
No ratings yet
ML Unit-1
16 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
ML Unit-1
No ratings yet
ML Unit-1
34 pages
Literature Review
No ratings yet
Literature Review
7 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
3 pages
A Comparison of Deep Learning Methods For Urban Traffic Forecasting Using Floating Car Data
No ratings yet
A Comparison of Deep Learning Methods For Urban Traffic Forecasting Using Floating Car Data
8 pages
Facial Emotion Recognition Using Convolutional Neural Networks
No ratings yet
Facial Emotion Recognition Using Convolutional Neural Networks
11 pages
(Ebook) Imbalanced Classification with Python: Choose Better Metrics, Balance Skewed Classes, and Apply Cost-Sensitive Learning by Jason Brownlee ISBN 9788468452241, 8468452246 - Own the ebook now with all fully detailed content
100% (2)
(Ebook) Imbalanced Classification with Python: Choose Better Metrics, Balance Skewed Classes, and Apply Cost-Sensitive Learning by Jason Brownlee ISBN 9788468452241, 8468452246 - Own the ebook now with all fully detailed content
72 pages
Chapter 5 - 7
No ratings yet
Chapter 5 - 7
72 pages
Deep Learning: Yann Lecun
No ratings yet
Deep Learning: Yann Lecun
58 pages
Bachelor Thesis ToM
No ratings yet
Bachelor Thesis ToM
36 pages
1-s2.0-S1110016821002027-main
No ratings yet
1-s2.0-S1110016821002027-main
9 pages
CONVOLUTIONAL NEURAL NETWORK
No ratings yet
CONVOLUTIONAL NEURAL NETWORK
36 pages
Alpha Beta Example 2 PDF
No ratings yet
Alpha Beta Example 2 PDF
28 pages
(2021) (IEEE) Image-Text Multimodal Emotion Classification via Multi-View Attentional Network
No ratings yet
(2021) (IEEE) Image-Text Multimodal Emotion Classification via Multi-View Attentional Network
13 pages
ssw9 PS2-13 Wu
No ratings yet
ssw9 PS2-13 Wu
6 pages
Abh (Business Forecasting
No ratings yet
Abh (Business Forecasting
8 pages
Electrical Power and Energy Systems: Kusum Verma, K.R. Niazi
No ratings yet
Electrical Power and Energy Systems: Kusum Verma, K.R. Niazi
8 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
17 pages
3rd Yr - CSM - Edx Rgidtered Students
No ratings yet
3rd Yr - CSM - Edx Rgidtered Students
5 pages
KSC2016 - Recurrent Neural Networks
No ratings yet
KSC2016 - Recurrent Neural Networks
66 pages
Edge Enhancement Based Transformer For Medical Image Denoising PDF
No ratings yet
Edge Enhancement Based Transformer For Medical Image Denoising PDF
8 pages
13 Pretraining
No ratings yet
13 Pretraining
47 pages
Tosi Et Al CVPR2023 Poster
No ratings yet
Tosi Et Al CVPR2023 Poster
1 page
Ell409 Aq
No ratings yet
Ell409 Aq
8 pages
Generative AI With LArge Language Models
No ratings yet
Generative AI With LArge Language Models
36 pages
Towards Generalizable Deepfake Detection
No ratings yet
Towards Generalizable Deepfake Detection
10 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
Module 2
No ratings yet
Module 2
44 pages
Graph Anomaly Detection With Graph Neural Networks-Current Status and Challenges
No ratings yet
Graph Anomaly Detection With Graph Neural Networks-Current Status and Challenges
8 pages
WID3009 Lecture 10 Slides Generating Content
No ratings yet
WID3009 Lecture 10 Slides Generating Content
61 pages
Lect 7 Single Layer NN
No ratings yet
Lect 7 Single Layer NN
20 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ML Unit 3 V1

Uploaded by

ML Unit 3 V1

Uploaded by

MACHINE LEARNING

Ensemble Learning and Random Forests

1. Hyperplane: There can be multiple lines/decision boundaries to segregate the classes in n-

• Which of the linear separators is optimal?

• Distance from example xi to the separator is

• Maximizing the margin is good according to intuition and PAC theory.

1. Effective in high dimensional cases

1. Naïve Bayes algorithm is a supervised learning algorithm, which is based on Bayes

Problem: Players will play if weather is sunny. Is this statement is correct?

solve it using above discussed method of posterior probability.

P(Yes | Sunny) = P( Sunny | Yes) * P(Yes) / P (Sunny)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.