5. Regression
5. Regression
5. Regression
The dashed line shows the course you should take for finding the
corresponding score. Now read the number on the criterion axis where
your line has stopped.
On the basis of information you gained by using the test, you would thus
expect to obtain 7.4 as the criterion variable.
◦This chapter began with a discussion of a claim made in a grocery store tabloid
that poor diet causes marital problems. Actually, there was no specific
evidence that diet causes the problems only that diet and marital difficulties are
associated.
◦So there may be other factors that influence this relationship.
Multivariate analysis
◦When there are more than two variable causing changes on our dependent variable,
this requires multivariate analysis
◦Eg. The prediction of success in the first year of college using linear combination of
SAT verbal and quantitative scores.
◦ Suppose we want to predict success in law school from three variables: undergraduate GPA,
rating by former professors, and age.
◦ This type of multivariate analysis is called multiple regression
◦ the goal of the analysis is to find the linear combination of the three variables that provides the
best prediction of law school success.
◦ We find the correlation between the criterion (law school GPA) and some composite of the
predictors (undergraduate GPA plus professor rating plus age).
◦ Multiple regression is appropriate when the criterion variable is continuous (not nominal).
Factor Analysis
◦ Another multivariate model
◦ Factor analysis is used to study the interrelationships among a set of variables without reference to
a criterion.
◦ You might think of factor analysis as a data-reduction technique. When we have responses to a
large number of items or a large number of tests, we often want to reduce all this information to
more manageable chunks.
◦ In factor analysis, we first create a matrix that shows the correlation between every variable and
every other variable.
◦ Then we find the linear combinations, or principal components, of the variables that describe as
many of the interrelationships among the variables as possible.
◦ Once the linear combinations or principal components have been found, we can find the
correlation between the original items and the factors.
◦ These correlations are called factor loadings.
◦By examining which variables load highly on each factor, we
can start interpreting the meanings of the factors. use
methods that help them get a clearer picture of the meaning
of the components by transforming the variables in a way that
pushes the factor loadings toward the high or the low extreme.
◦Because these transformational methods involve rotating the
axes in the space created by the factors, they are called
methods of rotation.