Mas202 Finalproject-1
Mas202 Finalproject-1
MAS202 |MK1616
Lecturer: Mr. Nguyễn Việt Anh
RS IN 5 COUNTRIES (1970-2015)
Assigned Job
Linear Regression
6.772778 to 8.5515694598
11.45991 to 13.119220466
5.914793 to 7.5669460992
10.18013 to 11.983344623
8.308508 to 10.222796754
in school of China falls in the interval from 6.773 to 8.552 and 5% people falls out of the interval
in school of Japan falls in the interval from 11.460 to 13.120 and 5% of people falls out of the interval
) in school of Vietnam falls in the interval from 5.915 to 7.567 and 5% of people falls out of the interval
in school of South Korea falls in the interval from 10.180 to 11.983 and 5% of people falls out of the interval
in school of North Korea falls in the interval from 8.309 to 10.223 and 5% of people falls out of the interval
period and was embargoed by the great powers for a long time lead to Vietnam's economy grown is slower than others
education due to having a leading economy in Asia.
ower than others
CHINA
Mean 7.66217391304348
Standard Error 0.428857272166412
Median 7.58
Mode #N/A
Standard Deviation 2.05672722485758
Sample Variance 4.23012687747036
Range 6.57
Minimum 4.53
Maximum 11.1
Sum 176.23
Count 23
Confidence Level(95.0%) 0.889395546720707
JAPAN
Mean 12.2895652173913
Standard Error 0.400051122356089
Median 12.3
Mode #N/A
Standard Deviation 1.91857778353196
Sample Variance 3.68094071146243
Range 6.18
Minimum 9.12
Maximum 15.3
Sum 282.66
Count 23
Confidence Level(95.0%) 0.82965524843907
VIETNAM
Mean 6.74086956521739
Standard Error 0.398325503497521
Median 6.6
Mode #N/A
Standard Deviation 1.91030200621282
Sample Variance 3.64925375494074
Range 6.2
Minimum 3.9
Maximum 10.1
Sum 155.04
Count 23
Confidence Level(95.0%) 0.82607653396282
SOUTH KOREA
Mean 11.0817391304348
Standard Error 0.434744780963118
Median 11.1
Mode #N/A
Standard Deviation 2.0849627251386
Sample Variance 4.34706956521739
Range 6.66
Minimum 7.64
Maximum 14.3
Sum 254.88
Count 23
Confidence Level(95.0%) 0.901605492651244
NORTH KOREA
Mean 9.26565217391305
Standard Error 0.461525150838503
Median 9.23
Mode #N/A
Standard Deviation 2.21339686719295
Sample Variance 4.89912569169956
Range 7.11
Minimum 5.79
Maximum 12.9
Sum 213.11
Count 23
Confidence Level(95.0%) 0.957144580484314
NUMBER OF PEOPLE
(15-24) MEANS YEAR
IN SCHOOL OF 5 COUNTRIES EASTERN (1970-201
CHINA JAPAN
STT Country
Women Women
1 1970 4.53 9.12
2 1972 4.77 9.42
3 1974 5.01 9.72
4 1976 5.27 10
5 1978 5.53 10.3
6 1980 5.8 10.6
7 1982 6.08 10.9
8 1984 6.37 11.2
9 1986 6.66 11.5
10 1988 6.96 11.8
11 1990 7.27 12
12 1992 7.58 12.3
13 1994 7.89 12.6
14 1996 8.2 12.8
15 1998 8.52 13.2
16 2000 8.84 13.4
17 2002 9.16 13.7
18 2004 9.48 14
19 2006 9.81 14.3
20 2008 10.1 14.6
21 2010 10.5 14.8
22 2012 10.8 15.1
23 2014 11.1 15.3
MBER OF PEOPLE
24) MEANS YEAR
OUNTRIES EASTERN (1970-2015)
VIETNAM SOUTH KOREA NORTH KOREA
Women Women Women
3.9 7.64 5.79
4.11 7.95 6.07
4.33 8.27 6.36
4.56 8.58 6.66
4.79 8.9 6.96
5.03 9.22 7.27
5.28 9.55 7.59
5.53 9.87 7.91
5.79 10.2 8.24
6.05 10.5 8.56
6.32 10.8 8.9
6.6 11.1 9.23
6.88 11.5 9.57
7.17 11.8 9.9
7.46 12.1 10.2
7.77 12.4 10.6
8.08 12.7 10.9
8.4 12.9 11.3
8.72 13.2 11.6
9.05 13.5 11.9
9.39 13.8 12.2
9.73 14.1 12.5
10.1 14.3 12.9
NUMBER OF PEOPLE
(15-24) MEANS YEAR
IN SCHOOL OF 5 COUNTRIES EASTERN (1970-2015)
STT CHINA JAPAN VIETNAM SOUTH KOREANORTH KOREA
Country
CHINA JAPAN VIETNAM SOUTH KOREANORTH KOREA
1 1970 4.53 9.12 3.9 7.64 5.79
2 1972 4.77 9.42 4.11 7.95 6.07
3 1974 5.01 9.72 4.33 8.27 6.36
4 1976 5.27 10 4.56 8.58 6.66
5 1978 5.53 10.3 4.79 8.9 6.96
6 1980 5.8 10.6 5.03 9.22 7.27
7 1982 6.08 10.9 5.28 9.55 7.59
8 1984 6.37 11.2 5.53 9.87 7.91
9 1986 6.66 11.5 5.79 10.2 8.24
10 1988 6.96 11.8 6.05 10.5 8.56
11 1990 7.27 12 6.32 10.8 8.9
12 1992 7.58 12.3 6.6 11.1 9.23
13 1994 7.89 12.6 6.88 11.5 9.57
14 1996 8.2 12.8 7.17 11.8 9.9
15 1998 8.52 13.2 7.46 12.1 10.2
16 2000 8.84 13.4 7.77 12.4 10.6
17 2002 9.16 13.7 8.08 12.7 10.9
18 2004 9.48 14 8.4 12.9 11.3
19 2006 9.81 14.3 8.72 13.2 11.6
20 2008 10.1 14.6 9.05 13.5 11.9
21 2010 10.5 14.8 9.39 13.8 12.2
22 2012 10.8 15.1 9.73 14.1 12.5
23 2014 11.1 15.3 10.1 14.3 12.9
Anova: Single Factor
SUMMARY
Groups Count Sum Average Variance
CHINA 23 176.23 7.662174 4.230127
JAPAN 23 282.66 12.28957 3.680941
VIETNAM 23 155.04 6.74087 3.649254
SOUTH KOREA 23 254.88 11.08174 4.34707
NORTH KOREA 23 213.11 9.265652 4.899126
ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 489.5913 4 122.3978 29.41334 1.245E-16 2.454213
Within Groups 457.7434 110 4.161303
1994 - 2014
Count 11 11 11 11 11 55
Sum 104.4 153.8 92.75 142.3 123.57 616.82
Average 9.490909 13.98182 8.431818 12.93636 11.23364 11.21491
Variance 1.149929 0.847636 1.135896 0.874545 1.202445 5.311388
Total
Count 22 22 22 22 22
Sum 171.7 273.54 151.14 247.24 207.32
Average 7.804545 12.43364 6.87 11.23818 9.423636
Variance 3.943159 3.356091 3.421248 3.964358 4.531024
ANOVA
Source of Variation SS df MS F P-value F crit
Sample 303.4481 1 303.4481 306.34 3.284E-32 3.936143
Columns 471.0247 4 117.7562 118.8784 4.204E-37 2.462615
Interaction 1.029418 4 0.257355 0.259807 0.90303 2.462615
Within 99.05598 100 0.99056
Mean
Variance
Observations
Hypothesized Mean
df
t Stat
P(T<=t) one-tail
t Critical one-tail
P(T<=t) two-tail
t Critical two-tail
H0: µ= 12
H1: µ≠ 12 => Two-tailed
t Critical two-tail = 2.07387306790403
t Stat = 0.723820534950431
t Critical two-tail <t Stat < t Critical two-tail > Failed to reject H0 = > Failed to reject claim
We can see that the average number of years for Japanese
women is 12 years, roughly the same number of years
of schooling for men in this country. it shows that both
men and women in Japan are well educated
Mean
Variance
Observations
Hypothesized Mean
df
t Stat
P(T<=t) one-tail
t Critical one-tail
P(T<=t) two-tail
t Critical two-tail
H0: 𝜇>=8
H1: 𝜇 < 8 => Left tail
t Stat = -3.161059043
t Critical one-tail = 1.717144374
t Stat < -t Critical one-tail => Reject H0 => Fail to reject claim
Mean
Variance
Observations
Hypothesized Mean
df
t Stat
P(T<=t) one-tail
t Critical one-tail
P(T<=t) two-tail
t Critical two-tail
H0: 𝜇 <= 11
H1:𝜇 > 11 => Right tail
t Stat = 2.95857990796336
t Critical one-tail = 1.71714437438024
t Stat < t Critical one-tail => Reject H0 => Fail to reject claim
aller than 8?
Women in Vietnam
6.740869565
3.649253755
23
8
22
-3.161059043
0.002264246
1.717144374
0.004528492
2.073873068
Men in Japan
12.0060869565217
2.65969762845849
23
11
22
2.95857990796336
0.00362883952497278
1.71714437438024
0.00725767904994555
2.07387306790403
NUMBER OF PEOPLE
(15-24) MEANS YEAR
IN SCHOOL OF 5 COUNTRIES EASTERN (1970-2015)
CHINA JAPAN VIETNAM SOUTH KOREA
STT Country
Men Women Men Women Men Women Men Women
1 1970 5.55 4.53 9.33 9.12 4.5 3.9 8.87 7.64
2 1972 5.77 4.77 9.58 9.42 4.69 4.11 9.12 7.95
3 1974 6 5.01 9.83 9.72 4.89 4.33 9.37 8.27
4 1976 6.23 5.27 10.1 10 5.1 4.56 9.62 8.58
5 1978 6.46 5.53 10.3 10.3 5.31 4.79 9.86 8.9
6 1980 6.7 5.8 10.6 10.6 5.52 5.03 10.1 9.22
7 1982 6.93 6.08 10.8 10.9 5.73 5.28 10.3 9.55
8 1984 7.18 6.37 11.1 11.2 5.96 5.53 10.6 9.87
9 1986 7.42 6.66 11.3 11.5 6.18 5.79 10.8 10.2
10 1988 7.67 6.96 11.5 11.8 6.41 6.05 11 10.5
11 1990 7.92 7.27 11.8 12 6.64 6.32 11.2 10.8
12 1992 8.17 7.58 12 12.3 6.87 6.6 11.5 11.1
13 1994 8.43 7.89 12.2 12.6 7.11 6.88 11.7 11.5
14 1996 8.68 8.2 12.5 12.8 7.35 7.17 11.9 11.8
15 1998 8.93 8.52 12.7 13.2 7.6 7.46 12.1 12.1
16 2000 9.19 8.84 13 13.4 7.85 7.77 12.3 12.4
17 2002 9.45 9.16 13.2 13.7 8.11 8.08 12.5 12.7
18 2004 9.71 9.48 13.5 14 8.37 8.4 12.7 12.9
19 2006 9.97 9.81 13.7 14.3 8.64 8.72 12.9 13.2
20 2008 10.2 10.1 13.9 14.6 8.92 9.05 13.2 13.5
21 2010 10.5 10.5 14.2 14.8 9.2 9.39 13.4 13.8
22 2012 10.8 10.8 14.4 15.1 9.48 9.73 13.6 14.1
23 2014 11.1 11.1 14.6 15.3 9.76 10.1 13.8 14.3
NORTH KOREA
Men Women
7.22 5.79
7.48 6.07
7.75 6.36
8.02 6.66
8.29 6.96
8.56 7.27
8.84 7.59
9.11 7.91
9.39 8.24
9.67 8.56
9.95 8.9
10.2 9.23
10.5 9.57
10.8 9.9
11.1 10.2
11.3 10.6
11.6 10.9
11.9 11.3
12.2 11.6
12.4 11.9
12.7 12.2
12.9 12.5
13.2 12.9
CLAIM 1: The number of mean years of schooling of Vietnamese women is more than 4 years less than that of Japanese wo
Mean
Variance
Observations
Pooled Variance
Hypothesized Mean Difference
df
t Stat
P(T<=t) one-tail
t Critical one-tail
P(T<=t) two-tail
t Critical two-tail
H0: 𝜇1 - 𝜇2 <=4
H1: 𝜇1 - 𝜇2 > 4=> Right tail
t Stat = 2.74329333856881
t Critical one-tail = 1.68022997657212
t Stat > t Critical one-tail => Reject H0 => Fail to reject claim
➪ We can see that the average number of years of schooling for women in Japan is much larger than
that of women in Vietnam, showing how large the social disparity of the two countries is.
CLAIM 2:The number of mean years of schooling of South Korea men is equal to that of North Korea men
Mean
Variance
Observations
Pooled Variance
Hypothesized Mean Difference
df
t Stat
P(T<=t) one-tail
t Critical one-tail
P(T<=t) two-tail
t Critical two-tail
H0: 𝜇1 - 𝜇2 = 0
H1: µ1 - µ2 ≠ 0=> two- tail
t Stat =2.38293601535201
t Critical one-tail = 2.01536757444376
t Stat > t Critical two-tail => Reject H0 => reject claim
➪We can see that although we are the same country, the political divide has made the
two countries have certain differences, specifically in this example it is the
average number of years of schooling for men. in these two countries are not equal
CLAIM 3: The number of mean years of schooling of China women is less than that of China men
Mean
Variance
Observations
Pooled Variance
Hypothesized Mean Difference
df
t Stat
P(T<=t) one-tail
t Critical one-tail
P(T<=t) two-tail
t Critical two-tail
H0: µ1 >=µ2
H1: µ1 < µ2 => Left-tailed
t Critical one-tail = 1.68022997657212
t Stat = -0.994149961270969
t Stat > -t Critical two-tail => Failed to reject H0 =>Reject Claim
➪ Although China has a strong ideology of respecting men and disrespecting women, in recent
decades, Chinese women still have the same number of years of schooling as men in this country.
Variable 1 Variable 2
12.289565217 6.7408695652
3.6809407115 3.6492537549
23 23
3.6650972332
4
44
2.7432933386
0.0043834233
1.6802299766
0.0087668467
2.0153675744
Variable 1 Variable 2
11.410434783 10.220869565
2.2726225296 3.4590264822
23 23
2.8658245059
0
44
2.3829360154
0.0107796546
1.6802299766
0.0215593093
2.0153675744
Variable 1 Variable 2
7.662173913 8.2156521739
4.2301268775 2.8988166008
23 23
3.5644717391
0
44
-0.9941499613
0.1627929373
1.6802299766
0.3255858747
2.0153675744
NUMBER OF PEOPLE
(15-24) MEANS YEAR
IN SCHOOL OF 5 COUNTRIES EASTERN (1970-2015)
CHINA JAPAN VIETNAM SOUTH KOREA
STT Country
Men Women Men Women Men Women Men
1 1970 5.55 4.53 9.33 9.12 4.5 3.9 8.87
2 1972 5.77 4.77 9.58 9.42 4.69 4.11 9.12
3 1974 6 5.01 9.83 9.72 4.89 4.33 9.37
4 1976 6.23 5.27 10.1 10 5.1 4.56 9.62
5 1978 6.46 5.53 10.3 10.3 5.31 4.79 9.86
6 1980 6.7 5.8 10.6 10.6 5.52 5.03 10.1
7 1982 6.93 6.08 10.8 10.9 5.73 5.28 10.3
8 1984 7.18 6.37 11.1 11.2 5.96 5.53 10.6
9 1986 7.42 6.66 11.3 11.5 6.18 5.79 10.8
10 1988 7.67 6.96 11.5 11.8 6.41 6.05 11
11 1990 7.92 7.27 11.8 12 6.64 6.32 11.2
12 1992 8.17 7.58 12 12.3 6.87 6.6 11.5
13 1994 8.43 7.89 12.2 12.6 7.11 6.88 11.7
14 1996 8.68 8.2 12.5 12.8 7.35 7.17 11.9
15 1998 8.93 8.52 12.7 13.2 7.6 7.46 12.1
16 2000 9.19 8.84 13 13.4 7.85 7.77 12.3
17 2002 9.45 9.16 13.2 13.7 8.11 8.08 12.5
18 2004 9.71 9.48 13.5 14 8.37 8.4 12.7
19 2006 9.97 9.81 13.7 14.3 8.64 8.72 12.9
20 2008 10.2 10.1 13.9 14.6 8.92 9.05 13.2
21 2010 10.5 10.5 14.2 14.8 9.2 9.39 13.4
22 2012 10.8 10.8 14.4 15.1 9.48 9.73 13.6
23 2014 11.1 11.1 14.6 15.3 9.76 10.1 13.8
SOUTH KOREA NORTH KOREA
Women Men Women
7.64 7.22 5.79
7.95 7.48 6.07
8.27 7.75 6.36
8.58 8.02 6.66
8.9 8.29 6.96
9.22 8.56 7.27
9.55 8.84 7.59
9.87 9.11 7.91
10.2 9.39 8.24
10.5 9.67 8.56
10.8 9.95 8.9
11.1 10.2 9.23
11.5 10.5 9.57
11.8 10.8 9.9
12.1 11.1 10.2
12.4 11.3 10.6
12.7 11.6 10.9
12.9 11.9 11.3
13.2 12.2 11.6
13.5 12.4 11.9
13.8 12.7 12.2
14.1 12.9 12.5
14.3 13.2 12.9
Chine's total mean years of women per year from 1970-2015
12
f(x) = 0.283644773363552 x + 4.329347826
10 R² = 0.998118057355104
0
0 5 10 15 20 25
18
16
f(x) = 0.264754856625347 x + 9.178695652
14 R² = 0.998444960194022
12
10
0
0 5 10 15 20 25
0
0 5 10 15 20 25
R square = 99.95%=> About 99.95% 1. The estimated regression line have form
variation in Y is explained by SO 2. βo = 9.1787 so In 1970 , the mean years of
variation in X => Strong correlation
3. β1 = 0.1414 reveal that after 1 years, the m
about 0.1414 years
Use regression line for estimation of mean
12
10
f(x) = 0.263052728954672 x + 3.65
R² = 0.99731077442249
8
0
0 5 10 15 20 25
R square = 99.52%=> About 99.52% 1. The estimated regression line have form
variation in Y is explained by SO 2. βo = 3.65 so In 1970 , the mean years of w
variation in X => Strong correlation
3. β1 = 0.1405 reveal that after 1 years, the m
about 0.1405 years
Use regression line for estimation of mean
South Korea's total mean years of women per year from 1970-2015
16
14 f(x) = 0.28764107306383 x + 7.701956522
R² = 0.998307664061357
12
10
8
16
14 f(x) = 0.28764107306383 x + 7.701956522
R² = 0.998307664061357
12
10
8
6
4
2
0
0 5 10 15 20 25
R square = 99.9%=> About 99.9% 1. The estimated regression line have form
variation in Y is explained by SO 2. βo = 7.702 so In 1970 , the mean years of
variation in X => Strong correlation
3. β1 = 0.1536 reveal that after 1 years, the m
about 0.1536 years
Use regression line for estimation of mean
North Korea's total mean years of women per year from 1970-
2015
14
0
0 5 10 15 20 25
R square = 99.95%=> About 99.95% 1. The estimated regression line have form
variation in Y is explained by SO 2. βo = 5.6766 so In 1970 , the mean years o
variation in X => Strong correlation
3. β1 = 0.1631 reveal that after 1 years, the m
about 0.1631years
Use regression line for estimation of mean
SUMMARY OUTPUT
ANOVA
df SS MS F Significance F
Regression 1 92.901012253 92.9010123 12059.17 1.809E-30
Residual 21 0.16177905138 0.00770376
Total 22 93.0627913043
25
Coefficients Standard Error t Stat P-value Lower 95%
Intercept 4.3293478261 0.03544077504 122.157256 1.939E-31 4.255645
X Variable 1 0.1514920949 0.0013795303 109.814257 1.809E-30 0.148623
Regression Statistics
Multiple R 0.9997435746
R Square 0.9994872149
Adjusted R Square 0.9994627966
Standard Error 0.0444681222
Observations 23
ANOVA
df SS MS F Significance F
Regression 1 80.9391699605 80.93917 40931.83 4.899E-36
Residual 21 0.0415256917 0.00197741
Total 22 80.9806956522
SUMMARY OUTPUT
15
Regression Statistics
Multiple R 0.9976193796
R Square 0.9952444266
Adjusted R Square 0.9950179708
Standard Error 0.1348357849
Observations 23
ANOVA
df SS MS F Significance F
Regression 1 79.9017881423 79.9017881 4394.871 7.035E-26
Residual 21 0.3817944664 0.01818069
Total 22 80.2835826087
25
Coefficients Standard Error t Stat P-value Lower 95%
Intercept 3.65 0.05444485974 67.040305 5.566E-26 3.536776
X Variable 1 0.1404940711 0.00211926329 66.2938257 7.035E-26 0.136087
SUMMARY OUTPUT
970-2015
Regression Statistics
Multiple R 0.9994859706
R Square 0.9989722055
Adjusted R Square 0.9989232629
Standard Error 0.0684152851
Observations 23
ANOVA
df SS MS F Significance F
Regression 1 95.5372367589 95.5372368 20411.1 7.259E-33
Residual 21 0.09829367589 0.00468065
Total 22 95.6355304348
25
Coefficients Standard Error t Stat P-value Lower 95%
Intercept 7.7019565217 0.02762516348 278.802206 5.842E-39 7.644507
X Variable 1 0.1536264822 0.00107530803 142.867418 7.259E-33 0.15139
SUMMARY OUTPUT
ANOVA
df SS MS F Significance F
Regression 1 107.732644368 107.732644 47014.66 1.144E-36
Residual 21 0.0481208498 0.00229147
Total 22 107.780765217
25
Coefficients Standard Error t Stat P-value Lower 95%
Intercept 5.6766304348 0.01932897003 293.685097 1.961E-39 5.636434
)
X Variable 1 0.1631373518 0.00075237914 216.828649 1.144E-36 0.161573
Significance F
Significance F