0% found this document useful (0 votes)

17 views

Introduction to ML Unit-1 PPT

Uploaded by

sikkotech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Introduction to ML Unit-1 PPT

Uploaded by

sikkotech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 90

BORANA UNIVERSITY

COLLEGE OF NATURAL AND

COMPUTATIONAL SCIENCE
DEPARTMENT OF COMPUTER SCIENCE
Course Title: Introduction to Machine Learning (ML)
Course instructor: Guyita G.
Target Group: 4th Year second semester
CHAPTER ONE
INTRODUCTION

Definition of machine learning

History and relationships to other fields
Essential math and statistics for machine learning
Applications of machine learning
Types of machine learning techniques
machine learning
Deep learning
The below figure is to show the Relationship between artificial intelligence,
machine learning, and deep learning, but it may not general truth
Relationship between artificial intelligence, machine learning, and deep learning

Artificial Intelligence:
Algorithms and systems that exhibit human-like intelligence.
Machine Learning:
Subset of AI that can learn to perform a task with extracted data and/or models.
Deep Learning:
Subset of machine learning that imitate the functioning of human brain to
solve problems.
Definition of machine learning
First what mean Learning?
learning is the ability to improve one’s behavior with experience.
Machine learning explores algorithms that learn from data, build models
from data and this model can be used for different tasks.
=For example, model can be used for prediction, decision making or solving
tasks.
Machine learning is about extracting knowledge from data.
Definition of machine learning

Machine learning is a subfield of computer science that is concerned with building

algorithms which, to be useful, rely on a collection of examples of some
phenomenon.

Machine learning can also be defined as the process of solving a practical problem
by
1) gathering a dataset, and
2) algorithmically building a statistical model based on that dataset.
That statistical model is assumed to be used somehow to solve the practical
problem.
ML Vs Classical/Traditional Algorithms

Traditional Programming
Traditional programming is a manual process meaning a programmer creates the program.
i.e. input + program = output

Machine Learning
Unlike traditional programming, machine learning is an automated process.
ML algorithm automatically formulates the rules from the data.
i.e. Input (Features) + Output (Class Label) = Program (Rules)

ML algorithms do not depend on rules defined by human experts.

Instead, they process data in raw form like text, emails, documents, social media content,
images, voice and video.
ML Vs Classical/Traditional Algorithms
ML system is truly a learning system , but it is not programmed to perform
a task, but it is programmed to learn to perform a task
Most ML models are uninterpretable, and for these reasons they are usually
unsuitable when the purpose is to understand relationships.
The mostly work well where one only needs predictions.
One of the key differences is that classical approaches have a more rigorous
mathematical approach while machine learning algorithms are more data-
intensive
RELATIONS TO OTHER FIELDS
 Machine learning shares common threads with the mathematical fields of statistics,
information theory, game theory, and optimization.
 It is naturally a subfield of computer science, as our goal is to program machines so that they
will learn.
 In a sense, machine learning can be viewed as a branch of AI, since, after all, the ability to
turn experience into expertise or to detect meaningful patterns in complex sensory data is a
cornerstone of human (and animal) intelligence.
 However, one should note that, in contrast with traditional AI, machine learning is not trying
to build automated imitation of intelligent behavior, but rather to use the strengths and special
abilities of computers to complement human intelligence, often performing tasks that fall way
beyond human capabilities.
 For example, the ability to scan and process huge databases allows machine learning
programs to detect patterns that are outside the scope of human perception.
Essential math and statistics for machine learning
Mathematics for machine learning course covers essential topics such as calculus
for optimization, linear algebra for data transformation, probability for making
predictions under uncertainty, descriptive statistics for data summarization, and
inferential statistics for making data-driven decisions.

Descriptive (frequency, mean) and inferential statistics (correlation and regression

analysis) was used to analyze the empirical data.
Essential math and statistics for machine learning
The Essential Maths & Statistics for Machine Learning course is designed to provide a
foundation in the mathematical and statistical concepts that underpin machine learning
algorithms and models.
Learners will gain insights into why mathematics is fundamental for creating effective
machine learning applications, delve into statistics for data analysis and model evaluation,
and understand basic algebra for handling equations and functions.
This mathematics for machine learning course covers essential topics such as calculus for
optimization, linear algebra for data transformation, probability for making predictions
under uncertainty, descriptive statistics for data summarization, and inferential statistics for
making data-driven decisions.
Through various modules, the course equips participants with the tools needed for model
selection, interpretation, and the creation of compelling data visualizations.
It's designed to help learners build a strong mathematical foundation, enabling them to
implement and innovate with machine learning algorithms effectively
Mean or Expectation Value
The mean or expectation value is a measure of central tendency.
Let X be a random variable with n observations, then the mean value of X is given
by

Variance and Standard Deviation Let X be a random variable with N observations,

then the variance of X is given by:

The standard deviation is the square root of the variance and is a measure of
uncertainty or volatility.
Cont.……..

Cont.……..
Types of Machine Learning
Supervised (inductive) learning
– Training data includes desired / correct outputs or labels
Unsupervised learning
– Training data does not include desired outputs
Semi-supervised learning (various forms)
– Training data includes a few desired outputs
– Training data has desired outputs, but for a different (related) task
Reinforcement learning
– Rewards from sequence of actions
Types of Machine Learning
Supervised Vs Unsupervised Vs Reinforcement
Supervised Vs Unsupervised Vs Reinforcement
Data Types
Training
Aim
Approach
Output Feedback
Popular Algorithms
Applications
Recap
Supervised Learning
The machines learns from the training data that labeled.
Supervised learning is the method in which we teach the machine by using labeled data.
In the supervised learning, a model is able to predict with help of labeled dataset.
Supervised learning often require human effort to build the training set, but afterward
automates & often speed up an otherwise laborious or infeasible task.
Supervised learning is used whenever we want to predict a certain outcome from a given input,
and we have examples of input/output pairs.
Supervised machine learning algorithms can apply what has been learned in the past to new
data using labeled examples to predict future events.

Supervised learning classified into two categories of algorithms:

Classification: A classification problem is when the output variable is a category, such as Red or blue or
disease and no disease.
Regression: A regression problem is when the output variable is a real value, such as price or weight or
height.
How Supervised Learning Works?
In supervised learning, models are trained using labelled dataset, where the
model learns about each type of data.
Once the training process is completed, the model is tested on the basis of
test data (a subset of the training set), and then it predicts the output.
Steps Involved in Supervised Learning

• First Determine the type of training dataset

• Collect/Gather the labelled training data.
• Split the training dataset into training dataset, test dataset, and validation
dataset.
• Determine the input features of the training dataset, which should have
enough knowledge so that the model can accurately predict the output.
• Determine the suitable algorithm for the model, such as support vector
machine, decision tree, etc.
• Execute the algorithm on the training dataset. Sometimes we need validation
sets as the control parameters, which are the subset of training datasets.
• Evaluate the accuracy of the model by providing the test set. If the model
predicts the correct output, which means our model is accurate.
Some Terminology of Supervised Learning
Linear model
A model is said to be linear when it is linear in parameters.
 In order to formulate a learning problem mathematically, we need to define two things:
a model and a loss function.
The model, or architecture defines the set of allowable hypotheses, or functions that
compute predictions from the inputs.
In the case of linear regression, the model simply consists of linear functions.
Recall that a linear function of D inputs is parameterized in terms of D coefficients,
which we’ll call the weights, and an intercept term, which we’ll call the bias.
Mathematically, this is written as:
Linear Model
Loss function is a function L(y,t) which says how far off the prediction y is from the target
t.
In linear regression, we use squared error, defined as L(y,t) = 1/2(y - t)2
This is small when y and t are close together, and large when they are far apart.
In general, the value y - t is known as the residual, and we’d like the residuals to be close to
zero
When we combine our model and loss function, we get an optimization problem, where we
are trying to minimize a cost function with respect to the model parameters (i.e. the weights
and bias).
The cost function is simply the loss, averaged over all the training examples.
What is regression?
First what is function?
“A function is a set of ordered pairs of numbers (x,y) such that to
each value of the first variable (x) there corresponds a unique
value of the second variable (y)” .
Regression analysis consists of a set of machine learning methods that allow us
to predict a continuous outcome variable (y) based on the value of one or
multiple predictor variables (x).
Types of regression
Linear Regression
Linear Regression
In linear regression, the data are modeled to fit a straight line.
y = wx+b, variable, y (called a response variable), can be modeled as a
linear function of another random variable, x (called a predictor variable)
where the variance of y is assumed to be constant.
In the context of data mining, x and y are numeric database attributes.
The coefficients, w and b (called regression coefficients), specify the slope
of the line and the y-intercept, respectively.
Learn the models by the examples
X,Y and then for another X predict Y
X Y-continuous
Eg. The simplest type of function is Linear function.
Polynomial regression
A model is said to be linear when it is linear in parameters.
So the model and
are also the linear model.
In fact, they are the second-order polynomials in one and two
variables, respectively.
The polynomial models can be used in those situations where the relationship between
study and explanatory variables is curvilinear.
Sometimes a nonlinear relationship in a small range of explanatory variable can also
be modelled by polynomials.
For example: or
 is a polynomial regression model in one variable and is called a second-order model
or quadratic model.
The coefficients are called the linear effect parameter and quadratic effect
parameter, respectively.
Regularization
If more features are obtained by extending the model with nonlinear
transformations or if the number of inputs p is large and the number of data
points n is small, one may experience overfitting.
The term overfitting indicates that the model is fitted not only to the ‘signal’
but also to the ‘noise’.
A useful approach to handle overfitting is regularization.
Regularization can be motivated by ‘keeping the parameters β small unless the
data really convinces us otherwise’, or
alternatively ‘if a model with small values of the parameters β fits the data
almost as well as a model with larger parameter values, the one with small
parameter values should be preferred’.
How to evaluate a model?
• Regression
– Some measure of how close are predicted values (by a model) to the actual values
• Classification
– Whether predicted classes match the actual classes
Understand the metrics used to evaluate regression

Evaluation metrics for Regression

• Mean Squared Error (MSE)
– For every data point, compute error (distance between predicted value and actual value)
– Sum squares of these errors, and take average
– More popular variant: RMSE (square root of MSE)
• R2 or R-squared
– A naïve Simple Average Model (SAM): for every point, predict the average of all points
– R2: 1 – (error of model / error of SAM)
– Best possible R2 is 1; can be negative for a really bad model
Cont.…
R2 or R-squared
• Dataset has n instances <xi , yi>, i=1..N
• Predicted values: fi, i=1..N
• Mean of actual values:
Evaluation metrics for classification
• Let y = actual class, h = predicted class for an example
• Accuracy: Out of all examples, for what fraction is h = y?
• But accuracy is often not sufficient to indicate performance in practice
Skewed classes
• Often the class of interest is a rare class (y=1)
– Spam emails / social network accounts
– Cancerous cells
– Fraud credit card transactions
• Precision: Out of all examples for which model
predicted h=1, for what fraction is y=1?
• Recall: Of all examples for which y=1, for what
fraction did model correctly predict h=1?
Overfitting

If we have too many features, the learned hypothesis may fit the training set
very well but fail to generalize to new examples.
Sources of noise and error
• While learning a target function using a training set
• Two sources of noise
– Some training points may not come exactly from the target function:
stochastic noise
– The target function may be too complex to capture using the chosen
hypothesis set: deterministic noise
• Generalization error: Model tries to fit the noise in the training data, which
gets extrapolated to the test set
Ways to handle noise

• Validation
– Check performance on data other than training data, and tune model accordingly
• Regularization
– Constraint the model so that the noise cannot be learnt too well
Validation
• Divide given data into train set and test set
– E.g., 80% train and 20% test
– Better to select randomly
• Learn parameters using training set
• Check performance (validate the model) on test set, using measures such as
accuracy, misclassification rate, etc.
• Trade-off: more data for training vs. validation
Popular methods of evaluating a classifier

• Holdout method
– Split data into train and test set (usually 2/3 for train and 1/3 for test). Learn model using
train set and measure performance over test set
– Usually used when there is sufficiently large data, since both train and test data will be a
part
• Repeated Holdout method
– Repeat the Holdout method multiple times with different subsets used for train/test
– In each iteration, a certain portion of data is randomly selected for training, rest for testing
– The error rates on the different iterations are averaged to yield an overall error rate
– More reliable than simple Holdout
Popular methods of evaluating a classifier

• k-fold cross-validation
– First step: data is split into k subsets of equal size;
– Second step: each subset in turn is used for testing and the remainder for
training
– Performance measures averaged over all folds
• Popular choice for k: 10 or 5
• Advantage: all available data points being used to train as well test
model
k-Nearest Neighbors
K-nearest neighbors is a machine learning algorithm used for classification and regression tasks.
The k-NN algorithm is arguably the simplest machine learning algorithm. Building the model
consists only of storing the training dataset.
To make a prediction for a new data point, the algorithm finds the closest data points in the
training dataset—its “nearest neighbors.
In the K-NN algorithm, the "K" refers to the number of nearest neighbors that are considered
when making a prediction or classification for a new data point.
The algorithm identifies the K closest data points in the training set based on a distance metric
(such as Euclidean distance) and assigns the most common class label (for classification) or
calculates the average value (for regression) of those K neighbors to make a prediction
K-nearest neighbors is a relatively simple and interpretable algorithm, but it can
be computationally expensive, especially for large datasets, as it requires
comparing the new observation to all training examples.
k-Nearest Neighbors
KNN algorithm at the training phase just stores the dataset and when it gets new data,
then it classifies that data into a category that is much similar to the new data.
Example: Suppose, we have an image of a creature that looks similar to cat and dog,
but we want to know either it is a cat or dog. So for this identification, we can use the
KNN algorithm, as it works on a similarity measure.
Our KNN model will find the similar features of the new data set to the cats and dogs
images and based on the most similar features it will put it in either cat or dog category.
Basic k-nearest neighbor classification
K-nearest Neighbor
K-nearest Neighbor
K-nearest Neighbor
Naive Bayes (NB)
NB models are efficient.
The reason is that they learn parameters by looking at each feature individually
and collect simple per-class statistics from each feature.
The NB classifier is a classical demonstration of how generative assumptions
and parameter estimations simplify the learning process.
Consider the problem of predicting a label y ∈ {0,1} on the basis of a vector of
features x = (x1,...,xd), where we assume that each xi is in {0,1}.
Recall that the Bayes optimal classifier is
hBayes(x) = argmax P[Y = y|X = x]. y∈{0,1}
Logistic Regression
First what mean regression?
Regression analysis is a predictive modeling technique
It estimates the relationship between a dependent (target) and an independent
variable (predictor).
Logistic Regression
Logistic regression produces results in a binary format which is used to
predict the outcome of a categorical dependent variable.
So the outcome should be discrete/categorical such as:
Logistic Regression
• Logistic regression is one of the most popular Machine Learning algorithms,
which comes under the Supervised Learning technique.
• It is used for predicting the categorical dependent variable using a given set of
independent variables.
• Therefore the outcome must be a categorical or discrete value. It can be either
Yes or No, 0 or 1, true or False, etc. but instead of giving the exact value as 0
and 1, it gives the probabilistic values which lie between 0 and 1.
• In Logistic regression, instead of fitting a regression line, we fit an "S" shaped
logistic function, which predicts two maximum values (0 or 1).
• Logistic Regression is a significant machine learning algorithm because it has
the ability to provide probabilities and classify new data using continuous and
discrete datasets.
Logistic Regression
The below image is showing the logistic function:
Logistic regression uses the concept of predictive modeling as regression;
therefore, it is called logistic regression, but is used to classify samples;
Therefore, it falls under the classification algorithm.
Logistic Regression Equation
• The Logistic regression equation can be obtained from the Linear Regression
equation. The mathematical steps to get Logistic Regression equations are given
below:
• We know the equation of the straight line can be written as:

• In Logistic Regression y can be between 0 and 1 only, so for this let's divide the
above equation by (1-y):

• But we need range between -[infinity] to +[infinity], then take logarithm of the
equation it will become:
Linear VS Logistic Regression
Support Vector Machine Algorithm
SVM is one of the most popular Supervised Learning algorithms, which is
used for Classification as well as Regression problems.
However, primarily, it is used for Classification problems in ML.
The goal of the SVM algorithm is to create the best line or decision boundary
that can segregate n-dimensional space into classes so that we can easily put
the new data point in the correct category in the future.
This best decision boundary is called a hyperplane.
SVM chooses the extreme points/vectors that help in creating the hyperplane.
These extreme cases are called as support vectors, and hence algorithm is
termed as Support Vector Machine.
Support Vector Machine Algorithm
Consider the below diagram in which there are two different categories that
are classified using a decision boundary or hyperplane:
Support Vector Machine Algorithm
• Example: SVM can be understood with the example that we have used in the KNN classifier.
Suppose we see a strange cat that also has some features of dogs, so if we want a model that
can accurately identify whether it is a cat or dog, so such a model can be created by using the
SVM algorithm.
• We will first train our model with lots of images of cats and dogs so that it can learn about
different features of cats and dogs, and then we test it with this strange creature. So as support
vector creates a decision boundary between these two data (cat and dog) and choose extreme
cases (support vectors), it will see the extreme case of cat and dog. On the basis of the support
vectors, it will classify it as a cat. Consider the below diagram:
Hyperplane and Support Vectors in the SVM algorithm:

Hyperplane:
• There can be multiple lines/decision boundaries to segregate the classes in n-dimensional
space, but we need to find out the best decision boundary that helps to classify the data
points. This best boundary is known as the hyperplane of SVM.
• The dimensions of the hyperplane depend on the features present in the dataset, which
means if there are 2 features (as shown in image), then hyperplane will be a straight line.
And if there are 3 features, then hyperplane will be a 2-dimension plane.
• We always create a hyperplane that has a maximum margin, which means the maximum
distance between the data points.
Support Vectors:
• The data points or vectors that are the closest to the hyperplane and which affect the position
of the hyperplane are termed as Support Vector. Since these vectors support the hyperplane,
hence called a Support vector.
How does SVM works?

• Linear SVM:
• The working of the SVM algorithm can be understood by using an example.
Suppose we have a dataset that has two tags (green and blue), and the dataset has
two features x1 and x2. We want a classifier that can classify the pair(x1, x2) of
coordinates in either green or blue. Consider the below image:
How does SVM works?
• Non-Linear SVM:
• If data is linearly arranged, then we can separate it by using a straight line, but for
non-linear data, we cannot draw a single straight line. Consider the below image:
Decision Tree algorithm
• It is a Supervised learning technique that can be used for both classification and
Regression problems, but mostly it is preferred for solving Classification problems.
• Graphical representation of all possible solutions to a decision
• It is a tree-structured classifier, where internal nodes represent the features of a
dataset, branches represent the decision rules and each leaf node represents the
outcome.
• In a Decision tree, there are two nodes, which are the Decision Node and Leaf Node.
• Decision nodes are used to make any decision and have multiple branches, whereas
Leaf nodes are the output of those decisions and do not contain any further branches.
• The decisions or the test are performed on the basis of features of the given dataset.
• Decision are based on some conditions
• decision made can be easily explained
Decision Tree algorithm
• It is a graphical representation for getting all the possible solutions to a
problem/decision based on given conditions.
• It is called a decision tree because, similar to a tree, it starts with the root
node, which expands on further branches and constructs a tree-like structure.
• In order to build a tree, we use the CART algorithm, which stands for
Classification and Regression Tree algorithm.
• A decision tree simply asks a question, and based on the answer (Yes/No), it
further split the tree into subtrees.
Decision Tree algorithm
• Below diagram explains the general structure of a decision tree:
Decision Tree
A decision tree is graphical representation of all the possible solutions to
decision based on certain conditions.
Why Use Decision Tree
• There are various algorithms in Machine learning, so choosing the best
algorithm for the given dataset and problem is the main point to remember
while creating a machine learning model.
• Below are the two reasons for using the Decision tree:
• Decision Trees usually mimic human thinking ability while making a decision,
so it is easy to understand.
• The logic behind the decision tree can be easily understood because it shows a
tree-like structure.
Decision Tree Terminologies
• Root Node: Root node is from where the decision tree starts. It represents the
entire dataset, which further gets divided into two or more homogeneous sets.
• Leaf Node: Leaf nodes are the final output node, and the tree cannot be
segregated further after getting a leaf node.
• Splitting: Splitting is the process of dividing the decision node/root node into
sub-nodes according to the given conditions.
• Branch/Sub Tree: A tree formed by splitting the tree.
• Pruning: Pruning is the process of removing the unwanted branches from the
tree.
• Parent/Child node: The root node of the tree is called the parent node, and
other nodes are called the child nodes.
How does the Decision Tree algorithm Work?
• In a decision tree, for predicting the class of the given dataset, the algorithm
starts from the root node of the tree.
• This algorithm compares the values of root attribute with the record (real
dataset) attribute and, based on the comparison, follows the branch and jumps to
the next node.
• For the next node, the algorithm again compares the attribute value with the
other sub-nodes and move further.
• It continues the process until it reaches the leaf node of the tree. The complete
process can be better understood using the below algorithm:
How does the Decision Tree algorithm Work? step
The complete process can be better understood using the below algorithm:
• Step-1: Begin the tree with the root node, says S, which contains the complete
dataset.
• Step-2: Find the best attribute in the dataset using Attribute Selection Measure
(ASM).
• Step-3: Divide the S into subsets that contains possible values for the best
attributes.
• Step-4: Generate the decision tree node, which contains the best attribute.
• Step-5: Recursively make new decision trees using the subsets of the dataset
created in step -3. Continue this process until a stage is reached where you
cannot further classify the nodes and called the final node as a leaf node.
How does the Decision Tree algorithm Work?
• Example: Suppose there is a candidate who has a job offer and wants to
decide whether he should accept the offer or Not.
• So, to solve this problem, the decision tree starts with the root node (Salary
attribute by ASM).
• The root node splits further into the next decision node (distance from the
office) and one leaf node based on the corresponding labels.
• The next decision node further gets split into one decision node (Cab facility)
and one leaf node.
• Finally, the decision node splits into two leaf nodes (Accepted offers and
Declined offer).
How does the Decision Tree algorithm Work?
• Consider the below diagram:
Random Forest Algorithm
• Random Forest is a popular machine learning algorithm that belongs to the
supervised learning technique. It can be used for both Classification and
Regression problems in ML.
• It is based on the concept of ensemble learning, which is a process of
combining multiple classifiers to solve a complex problem and to improve
the performance of the model.
• As the name suggests, "Random Forest is a classifier that contains a
number of decision trees on various subsets of the given dataset and takes
the average to improve the predictive accuracy of that dataset."
• Instead of relying on one decision tree, the random forest takes the
prediction from each tree and based on the majority votes of predictions,
and it predicts the final output.
Random Forest Algorithm
• The greater number of trees in the forest leads to higher accuracy and prevents
the problem of overfitting.
• The below diagram explains the working of the Random Forest algorithm:
Why Use Random Forest Algorithm?
• Below are some points that explain why we should use the Random Forest
algorithm:
• It takes less training time as compared to other algorithms.
• It predicts output with high accuracy, even for the large dataset it runs
efficiently.
• It can also maintain accuracy when a large proportion of data is missing.
How Does Random Forest Algorithm Work
• Random Forest works in two-phase first is to create the random forest by
combining N decision tree, and second is to make predictions for each tree
created in the first phase.
• The Working process can be explained in the below steps and diagram:
• Step-1: Select random K data points from the training set.

• Step-2: Build the decision trees associated with the selected data points (Subsets).

• Step-3: Choose the number N for decision trees that you want to build.

• Step-4: Repeat Step 1 & 2.

• Step-5: For new data points, find the predictions of each decision tree, and assign the
new data points to the category that wins the majority votes.
How Does Random Forest Algorithm Work
• Example: Suppose there is a dataset that contains multiple fruit images. So,
this dataset is given to the Random forest classifier. The dataset is divided into
subsets and given to each decision tree. During the training phase, each
decision tree produces a prediction result, and when a new data point occurs,
then based on the majority of results, the Random Forest classifier predicts the
final decision. Consider the below image:
The End of the chapter
Thanks for your Attention!!!

Question ?
Query

Machine Learning Notes
100% (10)
Machine Learning Notes
19 pages
Statistics Symbols
67% (6)
Statistics Symbols
7 pages
This Study Resource Was: Page 1 of 7
No ratings yet
This Study Resource Was: Page 1 of 7
7 pages
(Worksheet # 5) Advanced ANOVA Procedures
0% (1)
(Worksheet # 5) Advanced ANOVA Procedures
2 pages
ML 1
No ratings yet
ML 1
35 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Null 5
No ratings yet
Null 5
16 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
LECTURE-2
No ratings yet
LECTURE-2
36 pages
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
No ratings yet
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
100 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Basics of Machine Learning
No ratings yet
Basics of Machine Learning
20 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Unit I
No ratings yet
Unit I
44 pages
Unit 1
No ratings yet
Unit 1
21 pages
Chapter 01 Introduction to ML
No ratings yet
Chapter 01 Introduction to ML
178 pages
CE880_lecture5_slides
No ratings yet
CE880_lecture5_slides
32 pages
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
68 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
ETI microproject
No ratings yet
ETI microproject
11 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
78 pages
Introduction To ML
No ratings yet
Introduction To ML
3 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
unit 1
100% (1)
unit 1
13 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
135 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Chapter 1 Introduction To Machine Learning
No ratings yet
Chapter 1 Introduction To Machine Learning
29 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Module -1 Lecture-1
No ratings yet
Module -1 Lecture-1
40 pages
Python UNIT-5
100% (1)
Python UNIT-5
67 pages
ML 1-6
No ratings yet
ML 1-6
248 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
Guru Nanak Dev Engineering College, Ludhiana
No ratings yet
Guru Nanak Dev Engineering College, Ludhiana
48 pages
ML Chapter 1
No ratings yet
ML Chapter 1
37 pages
Machine Learning
100% (2)
Machine Learning
211 pages
Unit 1
No ratings yet
Unit 1
62 pages
Machine Learning Lecture-01
No ratings yet
Machine Learning Lecture-01
37 pages
Unit 3 - DS - 1st year
No ratings yet
Unit 3 - DS - 1st year
5 pages
MACHINE LEARNING ALGORITHM - Unit-1-1
100% (1)
MACHINE LEARNING ALGORITHM - Unit-1-1
78 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
12 pages
Module 1
No ratings yet
Module 1
175 pages
Machine Learning- UNIT I (1)
No ratings yet
Machine Learning- UNIT I (1)
70 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
44 pages
ML Intro Theory
No ratings yet
ML Intro Theory
10 pages
CHP 1
No ratings yet
CHP 1
47 pages
Machine Learning
No ratings yet
Machine Learning
74 pages
MLT Unit 1
No ratings yet
MLT Unit 1
15 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
15 pages
Inductive Learning and Machine Learning
100% (1)
Inductive Learning and Machine Learning
321 pages
Practical # 9
No ratings yet
Practical # 9
4 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
Intro_DL_01
No ratings yet
Intro_DL_01
64 pages
Machine Learning-Supervised Learning
No ratings yet
Machine Learning-Supervised Learning
31 pages
Machine Learning Slides
No ratings yet
Machine Learning Slides
46 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Risk Analysis - Book Review
No ratings yet
Risk Analysis - Book Review
2 pages
How To Calculate Standard Deviation
No ratings yet
How To Calculate Standard Deviation
3 pages
Chapter 5- Estimation
No ratings yet
Chapter 5- Estimation
8 pages
Quantitative 1.1. Interest Rate 1.1.1. 1.1.1.1. Interest Rate
No ratings yet
Quantitative 1.1. Interest Rate 1.1.1. 1.1.1.1. Interest Rate
68 pages
Modeling Higher Moments
No ratings yet
Modeling Higher Moments
31 pages
Assignment - Inferential and Hypothesis Testing
No ratings yet
Assignment - Inferential and Hypothesis Testing
6 pages
06 Gaussian Distributions
No ratings yet
06 Gaussian Distributions
33 pages
Sample Size Formula Excel Template
No ratings yet
Sample Size Formula Excel Template
5 pages
Monte Carlo Methods PDF
No ratings yet
Monte Carlo Methods PDF
6 pages
Lampiran Regresi Logistik Aul
No ratings yet
Lampiran Regresi Logistik Aul
4 pages
[Ebooks PDF] download Applied Statistics: From Bivariate Through Multivariate Techniques Second Edition – Ebook PDF Version full chapters
100% (3)
[Ebooks PDF] download Applied Statistics: From Bivariate Through Multivariate Techniques Second Edition – Ebook PDF Version full chapters
51 pages
Tolerance Intervals Nist
No ratings yet
Tolerance Intervals Nist
4 pages
Cyclic Codes
No ratings yet
Cyclic Codes
15 pages
06A Respect
No ratings yet
06A Respect
11 pages
Untitled
No ratings yet
Untitled
2 pages
01 Foundations
No ratings yet
01 Foundations
102 pages
Midterm Exam Solutions
No ratings yet
Midterm Exam Solutions
4 pages
Probability Distributions
No ratings yet
Probability Distributions
18 pages
MCQ Random Process
57% (7)
MCQ Random Process
11 pages
Application of Maximum Permissible Erron in Calibration Calibration
No ratings yet
Application of Maximum Permissible Erron in Calibration Calibration
8 pages
Hidden Markov Model HMM
No ratings yet
Hidden Markov Model HMM
33 pages
Stat 410 Tutorial Week 6
No ratings yet
Stat 410 Tutorial Week 6
4 pages
SPSS Discriminant Function Analysis PDF
100% (1)
SPSS Discriminant Function Analysis PDF
58 pages
ML Hw1
No ratings yet
ML Hw1
2 pages
Stat Final Exam '17-'18
100% (3)
Stat Final Exam '17-'18
2 pages
Jaw Aban Exercise Bab 2
No ratings yet
Jaw Aban Exercise Bab 2
32 pages
lecture5-ngrams
No ratings yet
lecture5-ngrams
40 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Introduction to ML Unit-1 PPT

Uploaded by

Introduction to ML Unit-1 PPT

Uploaded by

BORANA UNIVERSITY

COLLEGE OF NATURAL AND

Definition of machine learning

Machine learning is a subfield of computer science that is concerned with building

ML algorithms do not depend on rules defined by human experts.

Descriptive (frequency, mean) and inferential statistics (correlation and regression

Variance and Standard Deviation Let X be a random variable with N observations,

Supervised learning classified into two categories of algorithms:

• First Determine the type of training dataset

• First Determine the type of training dataset

Evaluation metrics for Regression

• Step-4: Repeat Step 1 & 2.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.