0% found this document useful (0 votes)

2 views21 pages

Module 07

Uploaded by

Aryansh Raj Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views21 pages

Module 07

Uploaded by

Aryansh Raj Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Predictive Analytics

Regression and Classification

Module 7

Sourish Das

Chennai Mathematical Institute

Linear Regression
mpg = β0 + β1 wt +

10 15 20 25 30
mpg

2 3 4 5

wt
Linear Regression

I mpg = β0 + β1 wt +
I We write the model in terms of linear models

y = Xβ +

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

 
1 wt1
1 wt2 
X = .
 
.. 
 .. . 
1 wtn

β = (β0 , β1 )T and = (1 , 2 , . . . , n )T

Linear Regression

I Normal Equations:

β̂ = (β̂0 β̂1 )T = (X T X )−1 X T y

Pn −1 Pn
n i=1 wti i=1 mpg i
= Pn P 2
Pn
i=1 wti i=1 wti i=1 wti .mpgi
Regression Plane
mpg=β0 +β1 wt+β2 disp+

35
30
25
mpg

disp
500
20

400
300
15

200
100
10

0
1 2 3 4 5 6

wt
Regression Plane

I mpg=β0 +β1 wt+β2 disp+

I We write the model in terms of linear models

y = Xβ +

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

 
1 wt1 disp1
1 wt2 disp2 
X = .
 
.. .. 
 .. . . 
1 wtn dispn

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

Linear Plane

I mpg=β0 +β1 wt+β2 disp+

I Normal Equations:

β̂ = (β̂0 β̂1 β̂2 )T

= (X T X )−1 X T y

I Ask yourself.
Linear Plane

I mpg=β0 +β1 wt+β2 disp+

I Normal Equations:

β̂ = (β̂0 β̂1 β̂2 )T

= (X T X )−1 X T y
 Pn Pn −1
n wt dispi
Pn P i=1 2i P i=1
= wt wt i=1 wti dispi
Pn i=1 i
 
P i=1 i P 2
i=1 disp i i=1 wt i dispi i=1 dispi
 Pn 
i=1 mpg i
× P ni=1 wti .mpgi 
P
n
i=1 dispi .mpgi
Quadratic Regression
mpg = β0 + β1 hp + β2 hp2 +

10 15 20 25 30
mpg

50 150 250

hp
Feature Engineering
mpg=β0 +β1 hp+β2 hp2 +

35
30
25

hp2
mpg

120000
100000
20

80000
60000
15

40000
20000
10

0
50 100 150 200 250 300 350

hp
Quadratic Regression

I mpg = β0 + β1 hp + β2 hp2 +
I We write the model in terms of linear models

y = Xβ +

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

1 hp1 hp21
 
1 hp2 hp2 
2
X = . ..  ,

. ..
. . . 
1 hpn hp2n

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

I The linear model is linear in parameter.
Quadratic Regression

I Normal Equations:

β̂ = (β̂0 β̂1 β̂2 )T

= (X T X )−1 X T y
Pn Pn −1  Pn
hp2i
 
n hp i=1 mpgi
P i=1 2i Pi=1
 ni=1 hpi .mpgi 
Pn n 3
P
=  i=1 hpi i=1 hpi i=1 hpi Pn
Pn 2
P n 3
Pn 4 2
i=1 hpi i=1 hpi i=1 hpi i=1 hpi .mpgi
Feature Engineering
mpg=β0 +β1 hp+β2 hp2 +

35
30
25
mpg

hp^2
500
20

400
300
15

200
100
10

0
1 2 3 4 5 6

hp
Feature Engineering/ Variable Transformation

I we put the original data into a higher dimension and

I hope that we will find a good fit for linear hyper-plane in a

higher dimension,

I which will explain the non-linear relationship between the

feature space and target variable.
Non-linear Regression Basis Functions

I Consider i th record

yi = f (x i ) + i , i = 1, 2, · · · , n

represents f (x) as
K
X
f (x) = βj φj (x) = φβ
j=1

we say φ is a basis system for f (x).

Representing Functions with Basis Functions

I mpg = β0 + β1 hp + β2 hp2 +

I Generic terms for curvature in linear regression

y = β1 + β2 x + β3 x 2 + · · · + i

implies
f (x) = β1 + β2 x + β3 x 2 + · · ·
I Sometimes in ML φ is known as ‘engineered features’
and the process is known as ‘feature engineering’
Fourier Basis
I sine cosine functions of incresing frequencies

y = β1 +β2 sin(ωx)+β3 cos(ωx)+β4 sin(2ωx)+β5 cos(2ωx) · · ·+i

I constant ω = 2π/P defines the period P of oscillation of

the first sine/cosine pair. P is known.

I φ = {1, sin(ωx), cos(ωx), sin(2ωx), cos(2ωx)...}

I β T = {β1 , β2 , β3 , · · · }

y = φβ +
I Again in ML φ is known as ‘engineered features’

I mpg = β0 + β1 sin(ω hp ) +
Functional Estimation/Learning

I We are writing the function with its basis expansion

y = φβ +
I Lets assume basis (or engineered features) φ are fully
known.

I Problem is β is unknown - hence we estimate β.

Functional Estimation/Learning

I We are writing the function with its basis expansion

y = φβ +
I Lets assume basis (or engineered features) φ are fully
known.

I OLS Estimator:
β̂ = (φT φ)−1 φT y
Uncertainty associated with the OLS estimator

I How do we estimate the uncertainty (i.e., margin of error)

associate with OLS estimator β̂?

I If x0 is a test point, then

ŷ = φ(x0 )β̂

is the predicted value of true but unknown y0 .

I What is the margin of error of ŷ ?

Next ...

I We will discuss sampling distributions and inference of

regression coefficients!

Economic Analysis of Organic Farming
No ratings yet
Economic Analysis of Organic Farming
19 pages
Premchand 30min Speech
No ratings yet
Premchand 30min Speech
4 pages
Structural Equation Modeling Using Amos
100% (3)
Structural Equation Modeling Using Amos
238 pages
Chap 7 Multiple Regression Analysis The Problem of Estimation
No ratings yet
Chap 7 Multiple Regression Analysis The Problem of Estimation
24 pages
ANLEY - Ezana - Spring 2022
No ratings yet
ANLEY - Ezana - Spring 2022
27 pages
Institutions and Economic Growth Evidence From Eco
No ratings yet
Institutions and Economic Growth Evidence From Eco
16 pages
Revised Thesis Paper Dumagat Ombao
No ratings yet
Revised Thesis Paper Dumagat Ombao
35 pages
Multicollinearity Nature of Multicollinearity
100% (2)
Multicollinearity Nature of Multicollinearity
7 pages
CPSC 540 Assignment 1 (Due January 19)
100% (1)
CPSC 540 Assignment 1 (Due January 19)
9 pages
Gauranga_Kantar_doc_Q2(b)
No ratings yet
Gauranga_Kantar_doc_Q2(b)
1 page
Linear Regression
100% (2)
Linear Regression
228 pages
Assignment3_Report_MDS202312
No ratings yet
Assignment3_Report_MDS202312
2 pages
Linear Regression
No ratings yet
Linear Regression
108 pages
Linear Regression - Cheatsheet
No ratings yet
Linear Regression - Cheatsheet
8 pages
AML assignment explanation
No ratings yet
AML assignment explanation
1 page
AML2024 Assignment 2 (1)
No ratings yet
AML2024 Assignment 2 (1)
1 page
Nu - Edu.kz Econometrics-I Assignment 6 Answer Key
No ratings yet
Nu - Edu.kz Econometrics-I Assignment 6 Answer Key
8 pages
The Relevance of Tourism in Financial - 2019 - European Research On Management
No ratings yet
The Relevance of Tourism in Financial - 2019 - European Research On Management
10 pages
Sailesh Shrestha - RESM 7901 Chapter 1 To 3 Research
No ratings yet
Sailesh Shrestha - RESM 7901 Chapter 1 To 3 Research
22 pages
Canales - PH - The Effects of A Minimum Wage On Employment Outcomes An Application of Regression Discontinuity Design
No ratings yet
Canales - PH - The Effects of A Minimum Wage On Employment Outcomes An Application of Regression Discontinuity Design
24 pages
Solution Basic Econometrics
No ratings yet
Solution Basic Econometrics
10 pages
EE263 Homework 3 Solutions
No ratings yet
EE263 Homework 3 Solutions
16 pages
Applications of Response Surface Methodology in The Food Industry Processes
No ratings yet
Applications of Response Surface Methodology in The Food Industry Processes
21 pages
Homework Set No. 8: 1. Simplest Problem Using Least Squares Method
No ratings yet
Homework Set No. 8: 1. Simplest Problem Using Least Squares Method
3 pages
2018 Biosatics MCQ
100% (4)
2018 Biosatics MCQ
33 pages
Survey Adjustments R
No ratings yet
Survey Adjustments R
35 pages
Chapter 18
No ratings yet
Chapter 18
9 pages
Solutions Chapter 5
No ratings yet
Solutions Chapter 5
21 pages
Econ 471 Notes 1
No ratings yet
Econ 471 Notes 1
14 pages
A New Empirical Approach To Catching Up or Falling Behind: Article
No ratings yet
A New Empirical Approach To Catching Up or Falling Behind: Article
23 pages
Econometrics I: Problem Set II: Prof. Nicolas Berman November 30, 2018
No ratings yet
Econometrics I: Problem Set II: Prof. Nicolas Berman November 30, 2018
4 pages
Index Introductory Econometrics For Finance
No ratings yet
Index Introductory Econometrics For Finance
7 pages
Lecture 3 Multi-Regresion 2022.
No ratings yet
Lecture 3 Multi-Regresion 2022.
16 pages
Complete Download Beyond Multiple Linear Regression Applied Generalized Linear Models And Multilevel Models in R 1st Edition Paul Roback PDF All Chapters
No ratings yet
Complete Download Beyond Multiple Linear Regression Applied Generalized Linear Models And Multilevel Models in R 1st Edition Paul Roback PDF All Chapters
71 pages
ML_Lec 4-introduction to regression
No ratings yet
ML_Lec 4-introduction to regression
65 pages
CPSC 4830 2025Summer Lecture 3
No ratings yet
CPSC 4830 2025Summer Lecture 3
33 pages
2. Linear Reg, Logistic Reg and SVM
No ratings yet
2. Linear Reg, Logistic Reg and SVM
40 pages
Lecture 09_02.09.2024_Regression-01
No ratings yet
Lecture 09_02.09.2024_Regression-01
62 pages
Business Analytics 2nd Edition Evans Test Bankdownload
100% (10)
Business Analytics 2nd Edition Evans Test Bankdownload
47 pages
Bayesian linear regression for Posterior Predictive Distribution MATLAB
No ratings yet
Bayesian linear regression for Posterior Predictive Distribution MATLAB
46 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
Microeconometrics Using Stata Second Edition Volume I Cross Sectional and Panel Regression Models A. Colin Cameron & Pravin K. Trivedi - The full ebook version is ready for instant download
No ratings yet
Microeconometrics Using Stata Second Edition Volume I Cross Sectional and Panel Regression Models A. Colin Cameron & Pravin K. Trivedi - The full ebook version is ready for instant download
46 pages
Unit 2 ML_Ver 2
No ratings yet
Unit 2 ML_Ver 2
129 pages
ML Unit3
No ratings yet
ML Unit3
9 pages
Econometrics II Lecture 5: Instrumental Variables Part II: Måns Söderbom
No ratings yet
Econometrics II Lecture 5: Instrumental Variables Part II: Måns Söderbom
17 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
Experiment 7 ml vtu
No ratings yet
Experiment 7 ml vtu
5 pages
Week2 BBM406 Lec2.1 LinearRegression
No ratings yet
Week2 BBM406 Lec2.1 LinearRegression
49 pages
Regression Analysis
No ratings yet
Regression Analysis
11 pages
PR M4 Notes
No ratings yet
PR M4 Notes
38 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Hota ML Regression
No ratings yet
Hota ML Regression
57 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
Lec1_Winter2024
No ratings yet
Lec1_Winter2024
30 pages
lecture3_supervised_learning_I
No ratings yet
lecture3_supervised_learning_I
84 pages
Examples for LSE, RLS, and RBFN
No ratings yet
Examples for LSE, RLS, and RBFN
16 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
5_AML Lecture 5_Linear regression
No ratings yet
5_AML Lecture 5_Linear regression
56 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
ML UNIT II
No ratings yet
ML UNIT II
30 pages
Nonlinear Regression
No ratings yet
Nonlinear Regression
8 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
matlabnoteschap06
No ratings yet
matlabnoteschap06
34 pages
ML Unit
No ratings yet
ML Unit
23 pages
Lecture3_upload
No ratings yet
Lecture3_upload
28 pages
Unit III Da Notes
No ratings yet
Unit III Da Notes
43 pages
Intro To ML RevisionNotes
No ratings yet
Intro To ML RevisionNotes
24 pages
Unit 2
No ratings yet
Unit 2
80 pages
Machine Learning (CSO851) - Lecture 02
No ratings yet
Machine Learning (CSO851) - Lecture 02
74 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
Polynomial Regression
No ratings yet
Polynomial Regression
15 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Lecture 2
No ratings yet
Lecture 2
23 pages
Chapter 14
No ratings yet
Chapter 14
18 pages
(Slide) Non Linear Regression
No ratings yet
(Slide) Non Linear Regression
39 pages
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
No ratings yet
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
17 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
Lecture 16: Polynomial and Categorical Regression 1 Review
No ratings yet
Lecture 16: Polynomial and Categorical Regression 1 Review
10 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
AC-ED L04 - Logistic Regression, Regularization
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
80 pages
Module 5
No ratings yet
Module 5
48 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Lecture 3
No ratings yet
Lecture 3
90 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
Kondor Regression
No ratings yet
Kondor Regression
4 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Syllabusecon5072011 PDF
No ratings yet
Syllabusecon5072011 PDF
1 page
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Mathematical Formulas for Economics and Business: A Simple Introduction
From Everand
Mathematical Formulas for Economics and Business: A Simple Introduction
K.H. Erickson
4/5 (4)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Module 07

Uploaded by

Module 07

Uploaded by

Predictive Analytics

Regression and Classification

Chennai Mathematical Institute

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 )T and = (1 , 2 , . . . , n )T

β̂ = (β̂0 β̂1 )T = (X T X )−1 X T y

I mpg=β0 +β1 wt+β2 disp+

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

I mpg=β0 +β1 wt+β2 disp+

β̂ = (β̂0 β̂1 β̂2 )T

I mpg=β0 +β1 wt+β2 disp+

β̂ = (β̂0 β̂1 β̂2 )T

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

β̂ = (β̂0 β̂1 β̂2 )T

I we put the original data into a higher dimension and

I hope that we will find a good fit for linear hyper-plane in a

I which will explain the non-linear relationship between the

we say φ is a basis system for f (x).

I Generic terms for curvature in linear regression

y = β1 +β2 sin(ωx)+β3 cos(ωx)+β4 sin(2ωx)+β5 cos(2ωx) · · ·+i

I constant ω = 2π/P defines the period P of oscillation of

I φ = {1, sin(ωx), cos(ωx), sin(2ωx), cos(2ωx)...}

I We are writing the function with its basis expansion

I Problem is β is unknown - hence we estimate β.

I We are writing the function with its basis expansion

I How do we estimate the uncertainty (i.e., margin of error)

I If x0 is a test point, then

is the predicted value of true but unknown y0 .

I What is the margin of error of ŷ ?

I We will discuss sampling distributions and inference of

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Module 07

Uploaded by

Module 07

Uploaded by

Predictive Analytics

Regression and Classification

Chennai Mathematical Institute

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 )T and  = (1 , 2 , . . . , n )T

β̂ = (β̂0 β̂1 )T = (X T X )−1 X T y

I mpg=β0 +β1 wt+β2 disp+

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 , β2 )T and  = (1 , 2 , . . . , n )T

I mpg=β0 +β1 wt+β2 disp+

β̂ = (β̂0 β̂1 β̂2 )T

I mpg=β0 +β1 wt+β2 disp+

β̂ = (β̂0 β̂1 β̂2 )T

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 , β2 )T and  = (1 , 2 , . . . , n )T

β̂ = (β̂0 β̂1 β̂2 )T

I we put the original data into a higher dimension and

I hope that we will find a good fit for linear hyper-plane in a

I which will explain the non-linear relationship between the

we say φ is a basis system for f (x).

I Generic terms for curvature in linear regression

y = β1 +β2 sin(ωx)+β3 cos(ωx)+β4 sin(2ωx)+β5 cos(2ωx) · · ·+i

I constant ω = 2π/P defines the period P of oscillation of

I φ = {1, sin(ωx), cos(ωx), sin(2ωx), cos(2ωx)...}

I We are writing the function with its basis expansion

I Problem is β is unknown - hence we estimate β.

I We are writing the function with its basis expansion

I How do we estimate the uncertainty (i.e., margin of error)

I If x0 is a test point, then

is the predicted value of true but unknown y0 .

I What is the margin of error of ŷ ?

I We will discuss sampling distributions and inference of

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

β = (β0 , β1 )T and = (1 , 2 , . . . , n )T

I mpg=β0 +β1 wt+β2 disp+

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

I mpg=β0 +β1 wt+β2 disp+

I mpg=β0 +β1 wt+β2 disp+

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

y = β1 +β2 sin(ωx)+β3 cos(ωx)+β4 sin(2ωx)+β5 cos(2ωx) · · ·+i