0% found this document useful (0 votes)

13 views27 pages

Chapter 1-ML

The document provides an overview of machine learning, defining it as the ability of computers to learn from experience without explicit programming. It discusses various types of machine learning, including supervised, unsupervised, and reinforcement learning, as well as key concepts such as bias, variance, underfitting, and overfitting. Additionally, it outlines the machine learning process and techniques to mitigate underfitting and overfitting in models.

Uploaded by

sahu.leena24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views27 pages

Chapter 1-ML

Uploaded by

sahu.leena24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

Chapter 1

Definition of Machine Learning

• Machine learning
– making computers modify or adapt their actions
(whether these actions are making predictions, or
controlling a robot)
– so that these actions get more accurate, where
accuracy is measured by how well the chosen
actions reflect the correct ones.
Arthur Samuel (1959). Machine Learning: Field of
– study that gives computers the ability to learn
– without being explicitly programmed.
Machine Learning tasks
• Examples:
• - Database mining
• Large datasets from growth of automation/web.
• E.g., Web click data, medical records, biology,
engineering
• - Applications can’t program by hand.
• E.g., Autonomous helicopter, handwriting
recognition, most of
• Natural Language Processing (NLP), Computer Vision.
Learning
• “A computer program is said to learn from
experience E with respect to some task T and some
performance measure P, if its performance on T, as
measured by P, improves with experience E.”

Suppose your email program watches which emails

you do or do not mark as spam, and based on that
learns how to better filter spam. What is the task T in
this setting?
• Classifying emails as spam or not spam.
• Watching you label emails as spam or not
spam.
• The number (or fraction) of emails correctly
classified as spam/not spam.
• None of the above—this is not a machine
learning problem.
Learning
• We are going to loosely define learning as meaning getting better
at some task through practice.
Types of machine leaning
• Machine learning algorithms:
– Supervised learning
– Unsupervised learning
– Reinforcement learning
– Evolutionary learning
• Housing price prediction.
• Breast cancer (malignant, benign)
Problem
• You’re running a company, and you want to develop learning
algorithms to address each
• of two problems.

• Problem 1: You have a large inventory of identical items. You want

to predict how many of these items will sell over the next 3
months.
• Problem 2: You’d like software to examine individual customer
accounts, and for each account decide if it has been
hacked/compromised.

• Should you treat these as classification or as regression problems?

Solution
• Treat both as classification problems.
• Treat problem 1 as a classification problem,
problem 2 as a regression problem.
• Treat problem 1 as a regression problem,
problem 2 as a classification problem.
• Treat both as regression problems.
Of the following examples, which would you address using an
unsupervised learning algorithm? (Check all that apply.)
• Given a database of customer data, automatically discover
market
segments and group customers into different market segments.
• Given email labeled as spam/not spam, learn a spam filter.
• Given a set of news articles found on the web, group them into
set of articles about the same story.
• Given a dataset of patients diagnosed as either having diabetes
or
not, learn to classify new patients as having diabetes or not.
Reinforcement Learning
Reinforcement Learning or Not
• Designing
– Chess game
– Electric vehicle
– Detection of a disease
– Robotics for industrial automation.
– Business strategy planning.
The machine learning process
• Data Collection and Preparation
• Feature Selection
• Algorithm Choice
• Parameter and Model Selection
• Training
• Evaluation
Bias
• The bias is known as the
– difference between the prediction of the values by the ML
model and the correct value.
– Being high in biasing gives a large error in training as well
as testing data. Its recommended that an algorithm should
always be low biased to avoid the problem of underfitting.
– By high bias, the data predicted is in a straight line format,
thus not fitting accurately in the data in the data set.
– Such fitting is known as Underfitting of Data. This happens
when the hypothesis is too simple or linear in nature.
Variance
• The variability of model prediction for a given data point which
tells us spread of our data is called the variance of the model.
• The model with high variance has a very complex fit to the
training data and thus is not able to fit accurately on the data
which it hasn’t seen before. As a result, such models perform
very well on training data but has high error rates on test data.
• When a model is high on variance, it is then said to as Overfitting
of Data.
• Overfitting is fitting the training set accurately via complex curve
and high order hypothesis but is not the solution as the error
with unseen data is high.
While training a data model variance should be kept low.
Bias Variance Trade-off

• If the algorithm is too simple (hypothesis with linear eq.)

then it may be on high bias and low variance condition
and thus is error-prone.
• If algorithms fit too complex ( hypothesis with high degree
eq.) then it may be on high variance and low bias.
• In the latter condition, the new entries will not perform
well. Well, there is something between both of these
conditions, known as Trade-off or Bias Variance Trade-off.
• This tradeoff in complexity is why there is a tradeoff
between bias and variance. An algorithm can’t be more
complex and less complex at the same time.
Underfitting
• A statistical model or a machine learning algorithm is said to have
underfitting when it cannot capture the underlying trend of the data,
i.e., it only performs well on training data but performs poorly on
testing data.
• Underfitting destroys the accuracy of our machine learning model. Its
occurrence simply means that our model or the algorithm does not fit
the data well enough.
• It usually happens when we have fewer data to build an accurate model
and also when we try to build a linear model with fewer non-linear
data.
• In such cases, the rules of the machine learning model are too easy and
flexible to be applied to such minimal data and therefore the model will
probably make a lot of wrong predictions.
• Underfitting can be avoided by using more data and also reducing the
features by feature selection.
Reasons for Underfitting

• High bias and low variance

• The size of the training dataset used is not
enough.
• The model is too simple.
• Training data is not cleaned and also contains
noise in it.
Techniques to reduce underfitting
• Increase model complexity
• Increase the number of features, performing
feature engineering
• Remove noise from the data.
• Increase the number of epochs or increase the
duration of training to get better results.
Overfitting
• A statistical model is said to be over fitted when the model does
not make accurate predictions on testing data.
• When a model gets trained with so much data, it starts learning
from the noise and inaccurate data entries in our data set. And
when testing with test data results in High variance.
• Then the model does not categorize the data correctly, because of
too many details and noise. The causes of overfitting are the non-
parametric and non-linear methods because these types of
machine learning algorithms have more freedom in building the
model based on the dataset and therefore they can really build
unrealistic models.
• A solution to avoid overfitting is using a linear algorithm if we have
linear data or using the parameters like the maximal depth if we
are using decision trees.
Reasons for Overfitting are as follows:

• High variance and low bias

• The model is too complex
• The size of the training data
Techniques to reduce overfitting
• Increase training data.
• Reduce model complexity.
• Early stopping during the training phase
• Have an eye over the loss over the training
period as soon as loss begins to increase stop
training).
• Use dropout for neural networks to tackle
overfitting.

DL Unit1
100% (1)
DL Unit1
79 pages
Machine Learning Notes Anna University
No ratings yet
Machine Learning Notes Anna University
9 pages
Merge +1
No ratings yet
Merge +1
107 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Underfitting and Overfitting
No ratings yet
Underfitting and Overfitting
4 pages
Machine Learning Math Essentials - 12.02.2025
No ratings yet
Machine Learning Math Essentials - 12.02.2025
88 pages
Week 15
No ratings yet
Week 15
41 pages
ML Interview Questions
No ratings yet
ML Interview Questions
60 pages
Machine Learning
No ratings yet
Machine Learning
57 pages
Classification
No ratings yet
Classification
53 pages
Machine Learning Models
No ratings yet
Machine Learning Models
54 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
Unit 1 Notes - FML
No ratings yet
Unit 1 Notes - FML
95 pages
4 - Bias-Variance Tradeoff
No ratings yet
4 - Bias-Variance Tradeoff
28 pages
Introductiontomachinelearning 230723174746 1a0e5edc
No ratings yet
Introductiontomachinelearning 230723174746 1a0e5edc
27 pages
ML - Underfitting and Overfitting - GeeksforGeeks
No ratings yet
ML - Underfitting and Overfitting - GeeksforGeeks
8 pages
Data Science-Unit-4 - 05.10.23
No ratings yet
Data Science-Unit-4 - 05.10.23
59 pages
Chapter - 1
No ratings yet
Chapter - 1
56 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
(Technical) Machine Learning U3-6 (2019 Pattern)
No ratings yet
(Technical) Machine Learning U3-6 (2019 Pattern)
101 pages
Unit 1.2 Perceptron 2024
No ratings yet
Unit 1.2 Perceptron 2024
107 pages
Unit 4
No ratings yet
Unit 4
50 pages
Machine Learning Notes "2023
No ratings yet
Machine Learning Notes "2023
31 pages
Underfitting and Overfitting Slides and Transcript
No ratings yet
Underfitting and Overfitting Slides and Transcript
13 pages
11 July Unit 1
No ratings yet
11 July Unit 1
47 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
12 Bias-Variance - Underfit - Overfit
No ratings yet
12 Bias-Variance - Underfit - Overfit
4 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
Lecture - 1
No ratings yet
Lecture - 1
35 pages
Data Analyst Interview Questionaries
No ratings yet
Data Analyst Interview Questionaries
16 pages
U&O Fitting
No ratings yet
U&O Fitting
6 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
Lecture 8
No ratings yet
Lecture 8
15 pages
UNIT 2 Data Science LM 2023
No ratings yet
UNIT 2 Data Science LM 2023
13 pages
Diagnosing Bias Vs Variance
No ratings yet
Diagnosing Bias Vs Variance
11 pages
Bias and Variance
No ratings yet
Bias and Variance
4 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
DL UNIT 1 (AB22) Continution
No ratings yet
DL UNIT 1 (AB22) Continution
9 pages
ML & DL
No ratings yet
ML & DL
19 pages
1 Bias Variance Overfit Underfit
No ratings yet
1 Bias Variance Overfit Underfit
6 pages
Machine Learning Volume I 280820241047
No ratings yet
Machine Learning Volume I 280820241047
4 pages
Collins FMS-4200 Flight Management System PDF
100% (4)
Collins FMS-4200 Flight Management System PDF
606 pages
Mod 1
No ratings yet
Mod 1
15 pages
Complete ML Concepts
No ratings yet
Complete ML Concepts
30 pages
Unit 3 - ML
No ratings yet
Unit 3 - ML
15 pages
Overfitting and Underfitting in Machine Learning
No ratings yet
Overfitting and Underfitting in Machine Learning
3 pages
Machine Learning Juunit2.pdf Lands
No ratings yet
Machine Learning Juunit2.pdf Lands
7 pages
Study Notes - Lesson 1 - 7 PDF
No ratings yet
Study Notes - Lesson 1 - 7 PDF
25 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Emsemble Methods-Pages-Deleted
No ratings yet
Emsemble Methods-Pages-Deleted
2 pages
Machine Learning - 1 (UNIT - 1)
No ratings yet
Machine Learning - 1 (UNIT - 1)
6 pages
Advances in Data and Information Sciences Shailesh Tiwari Munesh C Trivedi Download
No ratings yet
Advances in Data and Information Sciences Shailesh Tiwari Munesh C Trivedi Download
76 pages
Bias and Variance in Machine Learning
No ratings yet
Bias and Variance in Machine Learning
3 pages
Sampling Gates
67% (3)
Sampling Gates
37 pages
Bias - Variance
No ratings yet
Bias - Variance
2 pages
University Mental Health Charter
No ratings yet
University Mental Health Charter
92 pages
40CrMo EDM
No ratings yet
40CrMo EDM
230 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
57 pages
Math4 Q4 Mod6
No ratings yet
Math4 Q4 Mod6
47 pages
Prediction - Accuracy
No ratings yet
Prediction - Accuracy
33 pages
Technical Offer Rev 0 - Opt
No ratings yet
Technical Offer Rev 0 - Opt
47 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
Software Requirements Specification For Notes
No ratings yet
Software Requirements Specification For Notes
17 pages
Linear Models For Classification
No ratings yet
Linear Models For Classification
21 pages
Traditional or Modern Way: A Description On The Desired Purchasing Method of Students
No ratings yet
Traditional or Modern Way: A Description On The Desired Purchasing Method of Students
25 pages
Components, Moment and Resultant of Spatial Forces
No ratings yet
Components, Moment and Resultant of Spatial Forces
10 pages
Mia Liza A. Lustria, Ph.D. (CV)
No ratings yet
Mia Liza A. Lustria, Ph.D. (CV)
24 pages
Chapter 11: Risk Assessment Part III Consideration of Internal Control in A Financial Statement Audit
No ratings yet
Chapter 11: Risk Assessment Part III Consideration of Internal Control in A Financial Statement Audit
42 pages
Midterm
No ratings yet
Midterm
11 pages
EN How To Change Front Windshield Wipers On Ford Mondeo mk1 Saloon GBP Replacement Guide
No ratings yet
EN How To Change Front Windshield Wipers On Ford Mondeo mk1 Saloon GBP Replacement Guide
6 pages
PUBLIC ADDRESS AND GENERAL Alarm System
No ratings yet
PUBLIC ADDRESS AND GENERAL Alarm System
3 pages
Teaching English Language in Ecuador: A Review From The Inclusive Educational Approach
No ratings yet
Teaching English Language in Ecuador: A Review From The Inclusive Educational Approach
17 pages
Viking M Service Manual PT - 1 Includes PI PM
100% (1)
Viking M Service Manual PT - 1 Includes PI PM
30 pages
XVII - Cataloging Games
No ratings yet
XVII - Cataloging Games
13 pages
Godox AD300 Pro Manual
No ratings yet
Godox AD300 Pro Manual
12 pages
Samsung Monte Facut in Android Si Invers
No ratings yet
Samsung Monte Facut in Android Si Invers
19 pages
Q4-Week9 or Week 39 - Wlp-Dll-With PSS and HG Integration
No ratings yet
Q4-Week9 or Week 39 - Wlp-Dll-With PSS and HG Integration
9 pages
Communication Process Model: Lesson 3
No ratings yet
Communication Process Model: Lesson 3
17 pages
CAL-00-P-0009 Rev A Compressed Air System
No ratings yet
CAL-00-P-0009 Rev A Compressed Air System
13 pages
Tutorial 3 Transmission Lines
No ratings yet
Tutorial 3 Transmission Lines
4 pages
Fisher 377 Trip Valve: Scope of Manual
No ratings yet
Fisher 377 Trip Valve: Scope of Manual
20 pages
B - S Grewal - Higher Engineering Mathematics PDF - Download B
No ratings yet
B - S Grewal - Higher Engineering Mathematics PDF - Download B
3 pages
IEEE Power System Paper-A 20-KW, 10-KHz, Single-Phase Multilevel Active
No ratings yet
IEEE Power System Paper-A 20-KW, 10-KHz, Single-Phase Multilevel Active
7 pages
Chapter 2 PDF
No ratings yet
Chapter 2 PDF
13 pages
ASD The Meltdown
No ratings yet
ASD The Meltdown
4 pages
Atc Training and Emergency Handling: "Never Stop Learning, For When We Stop Learning, We Stop Growing"
No ratings yet
Atc Training and Emergency Handling: "Never Stop Learning, For When We Stop Learning, We Stop Growing"
2 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Practical Statistical Process Control
From Everand
Practical Statistical Process Control
Colin Hardwick
5/5 (9)
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter 1-ML

Uploaded by

Chapter 1-ML

Uploaded by

Chapter 1

Definition of Machine Learning

Suppose your email program watches which emails

• Problem 1: You have a large inventory of identical items. You want

• Should you treat these as classification or as regression problems?

• If the algorithm is too simple (hypothesis with linear eq.)

• High bias and low variance

• High variance and low bias

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.