20171130081511stat 250 Data Analysis
20171130081511stat 250 Data Analysis
20171130081511stat 250 Data Analysis
Your Name
STAT 250-0xx (your correct section)
Problem 1
n(1-p)=256*(1-0.5) = 128 10
Assumption 3: We can assume that there are atleast 256*10 =2560 games were played during
n=256
0.57420.5
z= = = 2.38
(1) 0.5(10.5)
256 256
v.Using the standard normal table, the p-value obtained was 0.0088
vi.We will reject the null as the p-value is less than the significance level of 0.05.
2
vii.We can conclude that the home team wins more than 50% of games in the National
Football League.
b)
3
c)
I am surprised with the result as the after simulating with 1 test; the result showed not to
reject the null which is contradictory to our result being obtained in part a.
d)
The result after running 1000 tests showed that the null hypothesis is rejected in only 44 of
the test when the sample size of 256 is considered.The z-statistic histogram plot shows
different z-value after running 1000 tests.Similarly,the histogram plot of p-value shows
e) The first red number is 30th run which shows that the null hypothesis is rejected in the 30th
test.
4
f)Yes, I am surprised because the null hypothesis is rejected in only 44 of the samples out of
1000 runs.Thus,similating the test 1000 times that home team does not wins more than 50%
When changing the sample came from the true proportion of 0.6, running 1000 tests lead to
the rejection of null hypothesis as opposed to running 1000 tests when we assumed sample
came from a true population proportion is 0.5.The difference is caused due to change in
proportion characteristics.
Problem 2
_(1 ) 0.6545(10.6545)
Standard error of the sample,SE = = = 0.0641
55
95% Confidence interval = p_hat Margin of error = 0.6545 0.1257 = (0.5289, 0.7802)
b)
i.Null Hypothesis,H0:p=0.588
Alternative Hypothesis,Ha:p0.588
n(1-p)=55*(1-0.588) = 22.66 10
Assumption 3:We can assume that there are at least 55*10 =550 males between the ages of
20 and 39..
n=55
0.65450.588
z= = = 1.0027
(1) 0.588(10.588)
256 55
vi.We will not reject the null as the p-value is greater than the significance level of 0.05.
vii.We can conclude that the percentage of a male between the ages of 20 and 39 who
c)Both the result confirmed the fact that the percentage of male between the ages of 20 and
39 who consume the recommended daily allowance of calcium has not changed. The
confidence interval contained the true proportion of 0.588 in its interval indicated that the
proportion is not significantly different from 0.588 which is similar to result obtained using z-
Problem 3
b)
Where p is the proportion that the new newspaper will capture in order to be financially
viable.
ii.The significance level is denoted by alpha which is 0.02 as stated in the problem.
n(1-p)=560*(1-0.14) = 481.6 10
Assumption 3:We can assume that there are atleast 560*10 =5600 Toronto residents.
p_hat =0.14107143
p=0.14
n=560
(1) 0.14(10.14)
Standard error of the sample ,SE = = =0.0147
560
vi.We will not reject the null as the p-value is greater than the assumed significance level of
0.02.
vii.The claim that new newspaper would have to capture at least 14$ of the Toronto market in
Problem 4
a)Histogram plot
10
The above histogram plot shows that the Fairfax home closing prices are skewed to the right.
b)In this scenarios the condition of large sample size is not met as the sample size is 10 which
is less than the central limit condition of large sample size of 25.Also,the distribution is
skewed.
c) In this case the central limit theorem of large sample size is met as sample size is 36 which
$700,000$510,000
P(X>$700,000) = ( > $145,000 ) = ( > 7.86)
36