Correlation, Regression Analysis in Civil Engineering
Correlation, Regression Analysis in Civil Engineering
Correlation, Regression Analysis in Civil Engineering
▪ Introduction
▪ Scatter diagrams
▪ Correlation analysis
o Pearson correlation coefficient with example
o Spearman rank correlation coefficient with example
o Kendall’s rank correlation coefficient with example
o Differences between Spearman and Kendall’s tau
▪ Regression Analysis
o Regression (curve fitting)
o Methods of regression
o Multiple regression model
▪ Some Statistical software Packages for regression analysis
▪ Conclusion
MEFGI - GTU 10/12/2018
3 CORRELLATION AND REGRESSION – Introduction
A scatter diagram is a diagram that shows the values of two variables X and Y , along with
the way in which these two variables relate to each other.
10/12/2018
MEFGI - GTU Temp. (x) oC
6 CORRELATION
Correlation is a bivariate analysis that measures the strength of relationship
or association between two variables and the direction of the relationship.
Correlation coefficient:
Statistic showing the degree of relation between two variables
i. Pearson correlation
ii. Spearman correlation
iii. Kendall rank correlation
xy − x y
r= n
( x) 2 ( y)2
x −
2 . y −
2
n n
-1 -0.75 -0.25 0
0.25 0.75 1
indirect Direct
no relation
perfect perfect
correlation correlation
If r = l = perfect correlation
MEFGI - GTU 10/12/2018
Example1 -Pearson correlation
10 A sample of 6 concrete cubes was selected, data about their age
in days and strength in N/mm2 was recorded as shown in the
following table . It is required to find the correlation between age
and weight.
serial Age Strength
No (days) (N/mm2)
1 7 12
2 6 8
3 8 12
4 5 10
5 6 11
6 9 13
MEFGI - GTU 10/12/2018
11
Example1 -Pearson correlation
∑(di)2=64
(rs)=-0.1 A negative (indirect) weak correlation
MEFGI - GTU 10/12/2018
18 Kendall rank correlation coefficient, tau
• Kendall rank correlation is a non-parametric test that measures the
degree of concordance between 2 columns of ranked data.
• Kendall’s tau = (C – D) / (C + D)
C – No of concordant pairs
D – No of discordant pairs
tau = (C – D) / (C + D)
= (7- 14) / (7 + 14) = -0.33 ( -ve Weak Relationship)
Spearman, (rs)=-0.1
MEFGI - GTU 10/12/2018
20 Pearson Vs Spearman rs Vs Kendall’s tau
▪ Parameteric statistic ▪ Non- Parameteric statistic
▪ For example
1) Relationship between strength of concrete and number of
curing days
2) Relationship between strength of road subgrade with lime
content, ground temperature and delay in compaction
r = a + bt , r = 1090.26 – 0.534t
MLS can be used to fit the data under the following situations
1. Relationship is linear y = f(x) = a + bx
2. Relationship is a polynomial f(x) = a + bx + bx + cx2
3. Relationship is transcendental f(x)=aeb
4. Multiple linear regression
…………………..eqn (1)
…………………..eqn (2)
y=a1 + a2 x + a3x2
Normal equations are as below;