0% found this document useful (0 votes)

25 views

LARS

This document discusses translating code for least angle regression (LARS) from R to Mata. It provides an overview of LARS, lasso regression, and forward stagewise regression. It describes translating Hastie and Efron's R code to Mata and creating a Stata command called "lars" to implement the algorithms. Similarities and differences between the S-Plus/R and Mata programming languages are also outlined.

Uploaded by

Luis Alberto Caneo Vergara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

LARS

Uploaded by

Luis Alberto Caneo Vergara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 15

Translating the S-Plus/R Least

Angle Regression package to Mata

Adrian Mander

MRC-Human Nutrition Research Unit, Cambridge

SUG London 2007 Least Angle Regression

Outline

LARS package
Lasso (the constrained OLS)
Forward Stagewise regression
Least Angle Regression

Translating Hastie & Efron’s code from R to Mata

The lars Stata command

SUG London 2007 Least Angle Regression

Lasso
Let y be the dependent variable and xj be the m covariates
The usual linear predictor m
ˆ   x j ˆ j
j 1
Want to minimise the squared differences
n
S ( ˆ )  y  ˆ i 
2
i
i 1
Subject to this constraint, large m
t gives OLS solution T ( ˆ )   ˆ j  t
N.B. Ridge regression does j 1
constraint on L2 norm
SUG London 2007 Least Angle Regression
Lasso graphically

The constraints can be seen below.

One property of this constraint is that there will be coefficients =0

for a subset of variables

SUG London 2007 Least Angle Regression

Ridge Regression

The constraints can be seen below.

The coefficients are shrunk but does not have the

property of parsimony
SUG London 2007 Least Angle Regression
Forward Stagewise

Using constraints

The function of current correlations is

Move the mean in the direction of the greatest

correlation for some small ε

FORWARD STEPWISE is greedy and selects

SUG London 2007 Least Angle Regression
Least Angle Regression

The LARS (S suggesting LaSso and Stagewise)

Starts like classic Forward Selection

Find predictor xj1 most correlated with the current residual

Make a step (epsilon) large enough until another predictor

xj2 has as much correlation with the current residual

LARS – now step in the direction equiangular between two

predictors until xj3 earns its way into the “correlated set”

SUG London 2007 Least Angle Regression

Least Angle Regression
Geometrically
Two covariates x1 and x2 and the space L(x1 ,x2) that is
spanned by them
x2 x2
Start at μ0 =0
y2 is the
projection of y
y2
onto L(x1 ,x2)

y1
μ0 μ1 x1
SUG London 2007 Least Angle Regression
Continued…

The current correlations only depend on the projection

of y on L(x1 ,x2) I.e. y2

SUG London 2007 Least Angle Regression

Programming similarities
The code comparing Splus to Mata looks incredibly similar

SUG London 2007 Least Angle Regression

Programming similarities
There are some differences though
Array of arrays… beta[[k]] = array

Indexing on the left hand side…

beta[positive] = beta0

Being able to “join” null matrices.

Row and column vectors are not very strict in Splus.
Being able to use the minus sign in indexing
beta[-positive]

“Local”-ness of mata functions within mata functions? Local is from the

first call of Mata
Not the easiest language to debug when you don’t know what you are
doing (thanks to statalist/Kit to push start me).
SUG London 2007 Least Angle Regression
Stata command
LARS is very simple to use
lars y <varlist>, a(lar)
lars y <varlist>, a(lasso)
lars y <varlist>, a(stagewise)

Not everything in the Splus package is implemented

because I didn’t have all the data required to test all the
code

SUG London 2007 Least Angle Regression

Stata command

SUG London 2007 Least Angle Regression

Graph output
1000

500
Beta

-500

-1000
0 .2 .4 .6 .8 1
Sum mod(beta)/ Max sum mod (beta)

age sex bmi bp s1

s2 s3 s4 s5 s6

SUG London 2007 Least Angle Regression

Conclusions

Mata could be a little easier to use

Translating Splus code is pretty simple
Least Angle Regression/Lasso/Forward Stagewise
are all very attractive algorithms and certainly an
improvement over Stepwise.

SUG London 2007 Least Angle Regression

LSSGB Practice Exam Questions and Answers
100% (4)
LSSGB Practice Exam Questions and Answers
101 pages
Download (Ebook) Applied Statistics Using Stata: A Guide for the Social Sciences by Mehmet Mehmetoglu, Tor Georg Jakobsen ISBN 9781473913226, 1473913225 ebook All Chapters PDF
100% (8)
Download (Ebook) Applied Statistics Using Stata: A Guide for the Social Sciences by Mehmet Mehmetoglu, Tor Georg Jakobsen ISBN 9781473913226, 1473913225 ebook All Chapters PDF
71 pages
Econometrics Cheat Sheet Stock and Watson
100% (5)
Econometrics Cheat Sheet Stock and Watson
2 pages
Statistics Made Easy
100% (1)
Statistics Made Easy
412 pages
Econometrics Cheat Sheet Stock and Watson
No ratings yet
Econometrics Cheat Sheet Stock and Watson
2 pages
KTEE309 - MCQs Chapter 1-2 (Ms. Qu NH) PDF
No ratings yet
KTEE309 - MCQs Chapter 1-2 (Ms. Qu NH) PDF
4 pages
Analysis of Covariance-ANCOVA-with Two Groups
No ratings yet
Analysis of Covariance-ANCOVA-with Two Groups
41 pages
Least Angle Regression: Tim Hesterberg, Insightful Corp
No ratings yet
Least Angle Regression: Tim Hesterberg, Insightful Corp
14 pages
A Simple Explanation of The Lasso and Least Angle Regression
No ratings yet
A Simple Explanation of The Lasso and Least Angle Regression
3 pages
Stata Lecture2
No ratings yet
Stata Lecture2
134 pages
Data Analysis
No ratings yet
Data Analysis
70 pages
Euclid Aos 1083178935
No ratings yet
Euclid Aos 1083178935
93 pages
1 Causality: POLS571 - Longitudinal Data Analysis September 25, 2001
No ratings yet
1 Causality: POLS571 - Longitudinal Data Analysis September 25, 2001
6 pages
Intro Mata
No ratings yet
Intro Mata
39 pages
Topology and Geometry for Physicists
From Everand
Topology and Geometry for Physicists
Charles Nash
3.5/5 (1)
Ridge Regression: Patrick Breheny
No ratings yet
Ridge Regression: Patrick Breheny
22 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Lasso & Ridge Regression
No ratings yet
Lasso & Ridge Regression
5 pages
Spatial Econometric S 5 A
No ratings yet
Spatial Econometric S 5 A
19 pages
Standard-Slope Integration: A New Approach to Numerical Integration
From Everand
Standard-Slope Integration: A New Approach to Numerical Integration
Peter James Italia, MD
No ratings yet
Empirical Example - The Model: = Α + Β ln + Β ln + Β ln + u
No ratings yet
Empirical Example - The Model: = Α + Β ln + Β ln + Β ln + u
4 pages
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Angle Between Regression Lines
No ratings yet
Angle Between Regression Lines
8 pages
REGRESSION
No ratings yet
REGRESSION
86 pages
Tutorial1_estimates(1)
No ratings yet
Tutorial1_estimates(1)
9 pages
Accuracy Assessment and Confusion Matrix
No ratings yet
Accuracy Assessment and Confusion Matrix
23 pages
Constructed Layered Systems: Measurements and Analysis
From Everand
Constructed Layered Systems: Measurements and Analysis
W. H. Cogill
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Exercises of Multi-Variable Functions
From Everand
Exercises of Multi-Variable Functions
Simone Malacrida
5/5 (1)
Slides (Handout) - Caio - Chapter 2 (Wooldridge)
No ratings yet
Slides (Handout) - Caio - Chapter 2 (Wooldridge)
86 pages
Applied Statistics Using Stata A Guide for the Social Sciences 1st edition by Mehmet Mehmetoglu, Tor Georg Jakobsen 1473987148 9781473987142 - Download the full ebook now for a seamless reading experience
100% (5)
Applied Statistics Using Stata A Guide for the Social Sciences 1st edition by Mehmet Mehmetoglu, Tor Georg Jakobsen 1473987148 9781473987142 - Download the full ebook now for a seamless reading experience
80 pages
Session 9 Linear Algebra and Rec
No ratings yet
Session 9 Linear Algebra and Rec
18 pages
MFIN 305_Lecture1
No ratings yet
MFIN 305_Lecture1
77 pages
Introduction To Regression and Analysis of Variance PDF
No ratings yet
Introduction To Regression and Analysis of Variance PDF
15 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
35 pages
What Is Nonlinear Regression
No ratings yet
What Is Nonlinear Regression
4 pages
Feynman Lectures Simplified 2C: Electromagnetism: in Relativity & in Dense Matter
From Everand
Feynman Lectures Simplified 2C: Electromagnetism: in Relativity & in Dense Matter
Robert Piccioni
No ratings yet
Forward Stagewise Regression and The Monotone Lasso: Trevor Hastie
No ratings yet
Forward Stagewise Regression and The Monotone Lasso: Trevor Hastie
29 pages
Lasoo Regression
No ratings yet
Lasoo Regression
8 pages
Notes_Lecture 13_Regularization_LASSO and RIDGE Regression
No ratings yet
Notes_Lecture 13_Regularization_LASSO and RIDGE Regression
29 pages
A Guide On How To Compare Different Models in Linear Progression
No ratings yet
A Guide On How To Compare Different Models in Linear Progression
8 pages
Linear Regression Models: 1 What Does This Equation Means?
No ratings yet
Linear Regression Models: 1 What Does This Equation Means?
4 pages
Applying Regression Analysis: Jean-Philippe Gauvin Université de Montréal
No ratings yet
Applying Regression Analysis: Jean-Philippe Gauvin Université de Montréal
71 pages
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
Evolution of Regression - Ols To Gps To Mars SF Meetup
No ratings yet
Evolution of Regression - Ols To Gps To Mars SF Meetup
66 pages
chapter 8
No ratings yet
chapter 8
39 pages
Lasso Ridge Notes
No ratings yet
Lasso Ridge Notes
2 pages
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
From Everand
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
Mohmmad Khaja Shareef
No ratings yet
LSavati Regression Analysis
No ratings yet
LSavati Regression Analysis
25 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Lasso Regression
No ratings yet
Lasso Regression
16 pages
Regression Models
No ratings yet
Regression Models
10 pages
Applied Statistics Using Stata A Guide for the Social Sciences 1st Edition Mehmet Mehmetoglu Tor Georg Jakobsen instant download
No ratings yet
Applied Statistics Using Stata A Guide for the Social Sciences 1st Edition Mehmet Mehmetoglu Tor Georg Jakobsen instant download
71 pages
Regression in Excel: Accessing Excel Data From The Computer Lab
No ratings yet
Regression in Excel: Accessing Excel Data From The Computer Lab
1 page
Drawing Regression Lines On Excel Graphs: File: D:/b173-2013/regression - WPD Date: September 15, 2013
No ratings yet
Drawing Regression Lines On Excel Graphs: File: D:/b173-2013/regression - WPD Date: September 15, 2013
5 pages
Course Slides - Regression Analysis
No ratings yet
Course Slides - Regression Analysis
63 pages
Da28786 PDF
No ratings yet
Da28786 PDF
7 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Statistical Testing and Prediction Using Linear Regression: Abstract
No ratings yet
Statistical Testing and Prediction Using Linear Regression: Abstract
10 pages
Bivariate Regression Analysis: The Beginning of Many Types of Regression
No ratings yet
Bivariate Regression Analysis: The Beginning of Many Types of Regression
40 pages
MLDA U1
No ratings yet
MLDA U1
10 pages
Regression - Analysis ch1,2
No ratings yet
Regression - Analysis ch1,2
100 pages
Applied Statistics Using Stata A Guide for the Social Sciences 1st Edition Mehmet Mehmetoglu Tor Georg Jakobsen pdf download
No ratings yet
Applied Statistics Using Stata A Guide for the Social Sciences 1st Edition Mehmet Mehmetoglu Tor Georg Jakobsen pdf download
52 pages
Lecture-2 Least Squares Regression
No ratings yet
Lecture-2 Least Squares Regression
18 pages
1 Correlation
No ratings yet
1 Correlation
1 page
The Improvement of Generation Z Financial Well-Being in Pekanbaru
No ratings yet
The Improvement of Generation Z Financial Well-Being in Pekanbaru
10 pages
FYCS DM Stats Sample Questions
No ratings yet
FYCS DM Stats Sample Questions
6 pages
Sample Methodology
No ratings yet
Sample Methodology
8 pages
General Method of Moments
No ratings yet
General Method of Moments
14 pages
Applied Econometrics Using Stata
No ratings yet
Applied Econometrics Using Stata
48 pages
Impact of Agricultural Mechanization On Production, Productivity
No ratings yet
Impact of Agricultural Mechanization On Production, Productivity
21 pages
Skittles Project Part 3
No ratings yet
Skittles Project Part 3
3 pages
Operations Management, 10e: (Heizer/Render) Chapter 4 Forecasting
No ratings yet
Operations Management, 10e: (Heizer/Render) Chapter 4 Forecasting
22 pages
MA Economics CBCS 2023 24 With Objectives
No ratings yet
MA Economics CBCS 2023 24 With Objectives
34 pages
Instant Download Emotions in Learning Teaching and Leadership Asian Perspectives 1st Edition Junjun Chen PDF All Chapters
No ratings yet
Instant Download Emotions in Learning Teaching and Leadership Asian Perspectives 1st Edition Junjun Chen PDF All Chapters
81 pages
Study of Shoreline Changes Along Kanyakumari-Thoothukudi Using Remote Sensing and Gis
No ratings yet
Study of Shoreline Changes Along Kanyakumari-Thoothukudi Using Remote Sensing and Gis
35 pages
GAC028 Assessment Event 3 Real
No ratings yet
GAC028 Assessment Event 3 Real
12 pages
Modeling of EAD and LGD: Empirical Approaches and Technical Implementation
100% (1)
Modeling of EAD and LGD: Empirical Approaches and Technical Implementation
21 pages
ch15 Solutions
No ratings yet
ch15 Solutions
81 pages
English Teachers' Readiness For Home-Based Learning Its Relationship To Teachers' Performance
No ratings yet
English Teachers' Readiness For Home-Based Learning Its Relationship To Teachers' Performance
12 pages
12 ASReml
No ratings yet
12 ASReml
5 pages
Normal Form: Ch.1 - Rank of Matrix
No ratings yet
Normal Form: Ch.1 - Rank of Matrix
22 pages
Twyman Lothian Ppaer
No ratings yet
Twyman Lothian Ppaer
8 pages
Assa - Financialization and its Consequences - The OECD experience
No ratings yet
Assa - Financialization and its Consequences - The OECD experience
6 pages
Hoklas SC-38
No ratings yet
Hoklas SC-38
16 pages
Chap 012
75% (4)
Chap 012
91 pages
Donnelly, Iyer & Howell, 2012
No ratings yet
Donnelly, Iyer & Howell, 2012
15 pages
SYLLABUS
No ratings yet
SYLLABUS
54 pages
Cost Concepts Classification Behavior
No ratings yet
Cost Concepts Classification Behavior
46 pages
Determinants of Value Added Tax Collection Performance in West Shoa Zone, Oromia Regional State, Ethiopia
No ratings yet
Determinants of Value Added Tax Collection Performance in West Shoa Zone, Oromia Regional State, Ethiopia
19 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

LARS

Uploaded by

LARS

Uploaded by

Translating the S-Plus/R Least

Angle Regression package to Mata

MRC-Human Nutrition Research Unit, Cambridge

SUG London 2007 Least Angle Regression

Translating Hastie & Efron’s code from R to Mata

SUG London 2007 Least Angle Regression

The constraints can be seen below.

One property of this constraint is that there will be coefficients =0

SUG London 2007 Least Angle Regression

The constraints can be seen below.

The coefficients are shrunk but does not have the

The function of current correlations is

Move the mean in the direction of the greatest

FORWARD STEPWISE is greedy and selects

The LARS (S suggesting LaSso and Stagewise)

Starts like classic Forward Selection

Find predictor xj1 most correlated with the current residual

Make a step (epsilon) large enough until another predictor

xj2 has as much correlation with the current residual

LARS – now step in the direction equiangular between two

SUG London 2007 Least Angle Regression

The current correlations only depend on the projection

SUG London 2007 Least Angle Regression

SUG London 2007 Least Angle Regression

Indexing on the left hand side…

Being able to “join” null matrices.

“Local”-ness of mata functions within mata functions? Local is from the

Not everything in the Splus package is implemented

SUG London 2007 Least Angle Regression

SUG London 2007 Least Angle Regression

age sex bmi bp s1

SUG London 2007 Least Angle Regression

Mata could be a little easier to use

SUG London 2007 Least Angle Regression

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.