0% found this document useful (0 votes)

28 views

SVM Unit 2

Uploaded by

sigowo5021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views

SVM Unit 2

Uploaded by

sigowo5021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Unit:2

What is an SVM?
Support vector machines are a set of supervised learning methods used for
classification, regression, and outliers detection. All of these are common tasks in
machine learning.

You can use them to detect cancerous cells based on millions of images or you can
use them to predict future driving routes with a well-fitted regression model.

There are specific types of SVMs you can use for particular machine learning
problems, like support vector regression (SVR) which is an extension of support
vector classification (SVC).

The main thing to keep in mind here is that these are just math equations tuned to
give you the most accurate answer possible as quickly as possible.

SVMs are different from other classification algorithms because of the way they
choose the decision boundary that maximizes the distance from the nearest data
points of all the classes. The decision boundary created by SVMs is called the
maximum margin classifier or the maximum margin hyper plane.

How an SVM works

A simple linear SVM classifier works by making a straight line between two
classes. That means all of the data points on one side of the line will represent a
category and the data points on the other side of the line will be put into a different
category. This means there can be an infinite number of lines to choose from.

What makes the linear SVM algorithm better than some of the other algorithms,
like k-nearest neighbors, is that it chooses the best line to classify your data points.
It chooses the line that separates the data and is the furthest away from the closet
data points as possible.

A 2-D example helps to make sense of all the machine learning jargon. Basically
you have some data points on a grid. You're trying to separate these data points by
the category they should fit in, but you don't want to have any data in the wrong
category. That means you're trying to find the line between the two closest points
that keeps the other data points separated.
So the two closest data points give you the support vectors you'll use to find that
line. That line is called the decision boundary.

The decision boundary doesn't have to be a line. It's also referred to as a

hyperplane because you can find the decision boundary with any number of
features, not just two.

Types of SVMs
There are two different types of SVMs, each used for different things:

 Simple SVM: Typically used for linear regression and classification problems.
 Kernel SVM: Has more flexibility for non-linear data because you can add more
features to fit a hyperplane instead of a two-dimensional space.
Why SVMs are used in machine learning
SVMs are used in applications like handwriting recognition, intrusion detection,
face detection, email classification, gene classification, and in web pages. This is
one of the reasons we use SVMs in machine learning. It can handle both
classification and regression on linear and non-linear data.

Another reason we use SVMs is because they can find complex relationships
between your data without you needing to do a lot of transformations on your own.
It's a great option when you are working with smaller datasets that have tens to
hundreds of thousands of features. They typically find more accurate results when
compared to other algorithms because of their ability to handle small, complex
datasets.

Here are some of the pros and cons for using SVMs.

Pros
 Effective on datasets with multiple features, like financial or medical data.
 Effective in cases where number of features is greater than the number of data
points.
 Uses a subset of training points in the decision function called support vectors
which makes it memory efficient.
 Different kernel functions can be specified for the decision function. You can use
common kernels, but it's also possible to specify custom kernels.
Cons
 If the number of features is a lot bigger than the number of data points, avoiding
over-fitting when choosing kernel functions and regularization term is crucial.
 SVMs don't directly provide probability estimates. Those are calculated using an
expensive five-fold cross-validation.
 Works best on small sample sets because of its high training time.
Since SVMs can use any number of kernels, it's important that you know about a
few of them.
Kernel functions
Linear
These are commonly recommended for text classification because most of these
types of classification problems are linearly separable.

The linear kernel works really well when there are a lot of features, and text
classification problems have a lot of features. Linear kernel functions are faster
than most of the others and you have fewer parameters to optimize.

Here's the function that defines the linear kernel:

f(X) = w^T * X + b
In this equation, w is the weight vector that you want to minimize, X is the data
that you're trying to classify, and b is the linear coefficient estimated from the
training data. This equation defines the decision boundary that the SVM returns.
Polynomial
The polynomial kernel isn't used in practice very often because it isn't as
computationally efficient as other kernels and its predictions aren't as accurate.

Here's the function for a polynomial kernel:

In this equation, gamma specifies how much a single training point has on the
other data points around it. ||X1 - X2|| is the dot product between your features.

In this function, alpha is a weight vector and C is an offset value to account for
some mis-classification of data that can happen.
Others
There are plenty of other kernels you can use for your project. This might be a
decision to make when you need to meet certain error constraints, you want to try
and speed up the training time, or you want to super tune parameters.

Some other kernels include: ANOVA radial basis, hyperbolic tangent, and Laplace
RBF.
Now that you know a bit about how the kernels work under the hood, let's go
through a couple of examples.

Examples with datasets

To show you how SVMs work in practice, we'll go through the process of training
a model with it using the Python Scikit-learn library. This is commonly used on all
kinds of machine learning problems and works well with other Python libraries.
Here are the steps regularly found in machine learning projects:

 Import the dataset

 Explore the data to figure out what they look like
 Pre-process the data
 Split the data into attributes and labels
 Divide the data into training and testing sets
 Train the SVM algorithm
 Make some predictions
 Evaluate the results of the algorithm
Some of these steps can be combined depending on how you handle your data.
We'll do an example with a linear SVM and a non-linear SVM. You can find
the code for these examples here.
Linear SVM Example
We'll start by importing a few libraries that will make it easy to work with most
machine learning projects.

import matplotlib.pyplot as plt

import numpy as np
from sklearn import svm
For a simple linear example, we'll just make some dummy data and that will act in
the place of importing a dataset.

# linear data
X = np.array([1, 5, 1.5, 8, 1, 9, 7, 8.7, 2.3, 5.5, 7.7, 6.1])
y = np.array([2, 8, 1.8, 8, 0.6, 11, 10, 9.4, 4, 3, 8.8, 7.5])
The reason we're working with numpy arrays is to make the matrix operations
faster because they use less memory than Python lists. You could also take
advantage of typing the contents of the arrays. Now let's take a look at what the
data look like in a plot:
# show unclassified data
plt.scatter(X, y)
plt.show()

Once you see what the data look like, you can take a better guess at which
algorithm will work best for you. Keep in mind that this is a really simple dataset,
so most of the time you'll need to do some work on your data to get it to a usable
state.

We'll do a bit of pre-processing on the already structured code. This will put the
raw data into a format that we can use to train the SVM model.

# shaping data for training the model

training_X = np.vstack((X, y)).T
training_y = [0, 1, 0, 1, 0, 1, 1, 1, 0, 0, 1, 1]
Now we can create the SVM model using a linear kernel.

# define the model

clf = svm.SVC(kernel='linear', C=1.0)
That one line of code just created an entire machine learning model. Now we just
have to train it with the data we pre-processed.

# train the model

clf.fit(training_X, training_y)
That's how you can build a model for any machine learning project. The dataset we
have might be small, but if you encounter a real-world dataset that can be classified
with a linear boundary this model still works.

With your model trained, you can make predictions on how a new data point will
be classified and you can make a plot of the decision boundary. Let's plot the
decision boundary.

# get the weight values for the linear equation from the trained SVM model
w = clf.coef_[0]

# get the y-offset for the linear equation

a = -w[0] / w[1]

# make the x-axis space for the data points

XX = np.linspace(0, 13)

# get the y-values to plot the decision boundary

yy = a * XX - clf.intercept_[0] / w[1]

# plot the decision boundary

plt.plot(XX, yy, 'k-')

# show the plot visually

plt.scatter(training_X[:, 0], training_X[:, 1], c=training_y)
plt.legend()
plt.show()
Non-Linear SVM Example
For this example, we'll use a slightly more complicated dataset to show one of the
areas SVMs shine in. Let's import some packages.

import matplotlib.pyplot as plt

import numpy as np
from sklearn import datasets
from sklearn import svm
This set of imports is similar to those in the linear example, except it imports one
more thing. Now we can use a dataset directly from the Scikit-learn library.

# non-linear data
circle_X, circle_y = datasets.make_circles(n_samples=300, noise=0.05)
The next step is to take a look at what this raw data looks like with a plot.

# show raw non-linear data

plt.scatter(circle_X[:, 0], circle_X[:, 1], c=circle_y, marker='.')
plt.show()
Now that you can see how the data are separated, we can choose a non-linear SVM
to start with. This dataset doesn't need any pre-processing before we use it to train
the model, so we can skip that step. Here's how the SVM model will look for this:

# make non-linear algorithm for model

nonlinear_clf = svm.SVC(kernel='rbf', C=1.0)
In this case, we'll go with an RBF (Gaussian Radial Basis Function) kernel to
classify this data. You could also try the polynomial kernel to see the difference
between the results you get. Now it's time to train the model.
# training non-linear model
nonlinear_clf.fit(circle_X, circle_y)
You can start labeling new data in the correct category based on this model. To see
what the decision boundary looks like, we'll have to make a custom function to plot
it.

# Plot the decision boundary for a non-linear SVM problem

def plot_decision_boundary(model, ax=None):
if ax is None:
ax = plt.gca()

xlim = ax.get_xlim()
ylim = ax.get_ylim()

# create grid to evaluate model

x = np.linspace(xlim[0], xlim[1], 30)
y = np.linspace(ylim[0], ylim[1], 30)
Y, X = np.meshgrid(y, x)

# shape data
xy = np.vstack([X.ravel(), Y.ravel()]).T

# get the decision boundary based on the model

P = model.decision_function(xy).reshape(X.shape)

# plot decision boundary

ax.contour(X, Y, P,
levels=[0], alpha=0.5,
linestyles=['-'])
You have everything you need to plot the decision boundary for this non-linear
data. We can do that with a few lines of code that use the Matlibplot library, just
like the other plots.
# plot data and decision boundary
plt.scatter(circle_X[:, 0], circle_X[:, 1], c=circle_y, s=50)
plot_decision_boundary(nonlinear_clf)
plt.scatter(nonlinear_clf.support_vectors_[:, 0], nonlinear_clf.support_vectors_[:,
1], s=50, lw=1, facecolors='none')
plt.show()

When you have your data and you know the problem you're trying to solve, it
really can be this simple.
You can change your training model completely, you can choose different
algorithms and features to work with, and you can fine tune your results based on
multiple parameters. There are libraries and packages for all of this now so there's
not a lot of math you have to deal with.

Tips for real world problems

Real world datasets have some common issues because of how large they can be,
the varying data types they hold, and how much computing power they can need to
train a model.

There are a few things you should watch out for with SVMs in particular:

 Make sure that your data are in numeric form instead of categorical form. SVMs
expect numbers instead of other kinds of labels.
 Avoid copying data as much as possible. Some Python libraries will make
duplicates of your data if they aren't in a specific format. Copying data will also
slow down your training time and skew the way your model assigns the weights to
a specific feature.
 Watch your kernel cache size because it uses your RAM. If you have a really large
dataset, this could cause problems for your system.
 Scale your data because SVM algorithms aren't scale invariant. That means you
can convert all of your data to be within the ranges of [0, 1] or [-1, 1].

Support Vector Machines
No ratings yet
Support Vector Machines
43 pages
svm
No ratings yet
svm
4 pages
SVM Theory
No ratings yet
SVM Theory
7 pages
Lecture#12
No ratings yet
Lecture#12
16 pages
UNIT 3 AAM
No ratings yet
UNIT 3 AAM
30 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
SVMs[1]
No ratings yet
SVMs[1]
30 pages
SVMs
No ratings yet
SVMs
30 pages
SVM_Presentation
No ratings yet
SVM_Presentation
13 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
SVM
No ratings yet
SVM
9 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
What Is Support Vector Machine
No ratings yet
What Is Support Vector Machine
13 pages
Lab 6 Dsa
No ratings yet
Lab 6 Dsa
15 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Support Vecor Machine
No ratings yet
Support Vecor Machine
4 pages
Day 34
No ratings yet
Day 34
3 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
ML-Lec9-SVM
No ratings yet
ML-Lec9-SVM
32 pages
SVM
No ratings yet
SVM
12 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
SVM Manual
No ratings yet
SVM Manual
7 pages
This Is
No ratings yet
This Is
7 pages
Unit 2 SVM
No ratings yet
Unit 2 SVM
16 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
Understanding Support Vector Machine Algorithm From Examples
No ratings yet
Understanding Support Vector Machine Algorithm From Examples
10 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
SVM
No ratings yet
SVM
11 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
5 pages
Support Vector Machines (SVMs) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMs) - Introduction and Key Concepts
52 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
Support vector Machine.pptx
No ratings yet
Support vector Machine.pptx
18 pages
Tutorial On Support Vector Machine (SVM) : Abstract
No ratings yet
Tutorial On Support Vector Machine (SVM) : Abstract
13 pages
SVM&Decision Tree
No ratings yet
SVM&Decision Tree
10 pages
3.unit 3 ML Part-1 Q&A
No ratings yet
3.unit 3 ML Part-1 Q&A
39 pages
PML Lab Exp 10
No ratings yet
PML Lab Exp 10
3 pages
Support Vector Machine
No ratings yet
Support Vector Machine
40 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
43 pages
Machine Learning SVM - Supervised
No ratings yet
Machine Learning SVM - Supervised
32 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
SVM 1
No ratings yet
SVM 1
17 pages
Machine Learning Answer Bank
No ratings yet
Machine Learning Answer Bank
54 pages
SVM Fully Translated Fixed
No ratings yet
SVM Fully Translated Fixed
5 pages
MLT_07
No ratings yet
MLT_07
8 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
At 2 Manuscript
No ratings yet
At 2 Manuscript
2 pages
AI Unit 4 - Artificial Neural Network by Kulbhushan (Krazy Kaksha & KK World)
No ratings yet
AI Unit 4 - Artificial Neural Network by Kulbhushan (Krazy Kaksha & KK World)
5 pages
Naman Meena: Data Science Engineer
No ratings yet
Naman Meena: Data Science Engineer
1 page
DWDM Externallab2022for Student
No ratings yet
DWDM Externallab2022for Student
3 pages
Krish Naik - YouTube
No ratings yet
Krish Naik - YouTube
1 page
Rasi Seminar
No ratings yet
Rasi Seminar
15 pages
Introduction To Deep Learning Assignment 0: September 2023
No ratings yet
Introduction To Deep Learning Assignment 0: September 2023
3 pages
Review 2
No ratings yet
Review 2
34 pages
Meta Pseudo Labels
No ratings yet
Meta Pseudo Labels
12 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
3 pages
Lec 1
No ratings yet
Lec 1
27 pages
Wild Animal Detection System
No ratings yet
Wild Animal Detection System
8 pages
Object Detection With Deep Learning: A Review
No ratings yet
Object Detection With Deep Learning: A Review
21 pages
Anomaly Detection in Self-Organizing Networks - Conventional Versus Contemporary Machine Learning
No ratings yet
Anomaly Detection in Self-Organizing Networks - Conventional Versus Contemporary Machine Learning
9 pages
Real-Time Human Tracking Using Multi-Features Visual With CNN-LSTM and Q-Learning
No ratings yet
Real-Time Human Tracking Using Multi-Features Visual With CNN-LSTM and Q-Learning
15 pages
Learning
No ratings yet
Learning
1 page
School of Electronics and Telecommunications: Thesis
No ratings yet
School of Electronics and Telecommunications: Thesis
7 pages
Predictive Analytics Answer Key
No ratings yet
Predictive Analytics Answer Key
3 pages
Introduction of ML
No ratings yet
Introduction of ML
53 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
SCT 1st 3 Clusters 2022
No ratings yet
SCT 1st 3 Clusters 2022
9 pages
Recurrent Neural Networks: CSC2535 2013: Advanced Machine Learning
No ratings yet
Recurrent Neural Networks: CSC2535 2013: Advanced Machine Learning
57 pages
synopsis 3d objects2
No ratings yet
synopsis 3d objects2
21 pages
Ai 1
No ratings yet
Ai 1
34 pages
Lecture Five Radial-Basis Function Networks: Associate Professor
No ratings yet
Lecture Five Radial-Basis Function Networks: Associate Professor
64 pages
AI Based Modeling: Techniques, Applications and Research Issues Towards Automation, Intelligent and Smart Systems
No ratings yet
AI Based Modeling: Techniques, Applications and Research Issues Towards Automation, Intelligent and Smart Systems
20 pages
COMP9491 Week1 HistoryOfAI
No ratings yet
COMP9491 Week1 HistoryOfAI
26 pages
2021 - State of The Art Content Based Image Retrieval Techniques Using Deep Learning A Survey
No ratings yet
2021 - State of The Art Content Based Image Retrieval Techniques Using Deep Learning A Survey
23 pages
Exploring Deep Learning and Neural Networks in Data Science (1)
No ratings yet
Exploring Deep Learning and Neural Networks in Data Science (1)
11 pages
1 PB
No ratings yet
1 PB
9 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

SVM Unit 2

Uploaded by

SVM Unit 2

Uploaded by

Unit:2

How an SVM works

The decision boundary doesn't have to be a line. It's also referred to as a

Here's the function that defines the linear kernel:

Here's the function for a polynomial kernel:

Examples with datasets

 Import the dataset

import matplotlib.pyplot as plt

# shaping data for training the model

# define the model

# train the model

# get the y-offset for the linear equation

# make the x-axis space for the data points

# get the y-values to plot the decision boundary

# plot the decision boundary

# show the plot visually

import matplotlib.pyplot as plt

# show raw non-linear data

# make non-linear algorithm for model

# Plot the decision boundary for a non-linear SVM problem

# create grid to evaluate model

# get the decision boundary based on the model

# plot decision boundary

Tips for real world problems

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.