Correlation Regression 15 16
Correlation Regression 15 16
• From the scatter diagram, we can observe that there exists a positive correlation between
and
Coefficient of Correlation
• It can be shown that takes value between and . The sign of r gives the direction of the
relation, i.e., whether they are positively correlated or negatively correlated. The magnitude
of i.e., gives the strength of the relationship.
• The test statistic for testing the above hypothesis is , which follows -distribution with df.
Coefficient of Correlation (cont…)
• Example: Suppose that a study was conducted for investigating relationship between working
memory (WM) and reading comprehension skill (RC) among healthy African-American adults. The
following scores were obtained from a sample of 10 subjects.
Subject 1 2 3 4 5 6 7 8 9 10
WM scores 21 38 14 29 45 33 38 53 31 40
RC scores 44 58 40 49 70 55 55 80 43 56
• Find the coefficient of correlation. Test whether there is a positive correlation between working
memory (WM) and reading comprehension skill (RC) at 5% level of significance and comment.
• Let us consider WM scores as variable , and RC scores as variable .
Estimate of :
Estimate of :
• read hat, is the estimated value of the variable for a selected value. is the estimated
intercept and is the slope of the line, or the average change in for each unit change (either
increase or decrease) in the independent variable .
Estimate of :
Estimate of :
• Multiple Linear Regression Model: The dependent variable () can be modeled by the
value of independent variable () as , where are the regression parameters and ε is the
random error term such that .
• In multiple linear regression model, it is assumed that here is no correlation between the
independent (explanatory) variables. When two of the explanatory variables in a model are
highly correlated (and could therefore be used to predict one another), we say that they are
collinear. This leads to a problem called multicollinearity.
Multiple Linear Regression– Interpreting the results
• Coefficient for quantitative independent variable : If we leave all other variables the
same (sometimes called “holding all other variables constant”), then we can see that an
increase of 1 unit increase in the leads to a unit increase/decrease (based on the sign of )
in the average value of . Another way of saying this is to say this is controlling for , a 1 unit
increase in the leads to a unit increase/decrease (based on the sign of ) in the average
value of .
Index of goodness of fit: Coefficient of determination ()
• measures how good the fit of the regression line is? Note that like ANOVA, in regression
analysis Total SS = Regression SS + Error SS. . The sum of squares can be computed
as:Total SSRegression SS and Error SS (Residual SS) Total SS- Regression SS
• Therefore, takes on values between 0 and 1. Values of this ratio closer to 1 would imply a
better fitting estimated regression line.
• For instance, , implies that 77% of the total variation of Y (dependent variable) is explained
by the regression line (or by the variation in all independent variables)
• Adjusted : The adjusted is a modified version of that accounts for predictors that are not
significant in a regression model. In other words, the adjusted shows whether adding
additional predictors improve a regression model or not.
Index of goodness of fit (cont…)
Testing significance of ( vs )
• In ANOVA table of regression output, we look mainly at the Sig. column of the F statistic,
which tells us the p-value for the statistic. If this is greater than 0.05 then the whole
model is not statistically significant and we need to stop our analysis here.
Test of hypothesis in regression models
Hypothesis for testing regression parameters
• Step 1: versus
• Step 2:
• Step 3: follows t distribution with df, where
• Step 4: Here , at df the critical value = -2.228, 2.228. may be rejected if or
• Step 5: We have :, the observed sample yields As falls in the rejection region, we may
reject the and conclude that changes in advertising costs can change sales.
Thank You