Sta301 Lec40
Sta301 Lec40
Lecture No. 40
of the course on
Statistics and Probability
by
Miss Saleha Naghmi Habibullah
IN THE LAST LECTURE,
YOU LEARNT
s
x t / 2 n 1
n
x
where x is the sample mean
n
x x 2
s is the sample
n 1
standard deviation n = sample size
and t (/2,) is found by looking in the
t-table under the appropriate value of
against = n – 1;
/2 = 0.005 if we desire
99% confidence:
0.005 0.005
0.99
t 0.005 0 t 0.005
/2 = 0.025 if we desire
95% confidence:
0.025 0.025
0.95
t 0.025 0 t 0.025
/2 = 0.05 if we desire
90% confidence:
0.05 0.05
0.90
t 0.05 0 t 0.05
Next, we discuss hypothesis - testing
regarding the mean of a normally distributed
population for which 2 is unknown and the
sample size is small (n < 30).
This procedure is illustrated through the
following example:
EXAMPLE-1
Just as human height is approximately
normally distributed, we can expect the heights
of animals of any particular species to be
normally distributed.
Suppose that, for the past five years, a
zoologist has been involved in an extensive
research-project regarding the animals of one
particular species.
Based on his research-experience, the
zoologist believes that the average height of
the animals of this particular species is 66
centimeters.
He selects a random sample of ten
animals of this particular species, and, upon
measuring their heights, the following data
is obtained:
63, 63, 66, 67, 68, 69, 70, 70, 71, 71
In the light of these data, test the
hypothesis that the mean height of the
animals of this particular species is 66
centimeters.
SOLUTION
Hypothesis-Testing Procedure:
Now, X
i 1
i
678
X 67.8 cm
n 10
n n
2
And (Xi X ) 2
n Xi
1 i 1
2
s i 1
n 1
n 1 i 1
X i
2
n
1
46050 45968.4 9.0667
9
So,
s 9.0667 3.01 cm
x 0
t
s n
67.8 66
3.01 10
1.89
V) Critical Region:
Since this is a two-tailed test, hence the
critical region is given by
| t | > t0.025(9) = 2.262.
-2.262 0 2.262
REJECT ACCEPT REJECT
vi) Conclusion:
Since the computed value of t = 1.89
does not fall in the critical region, we
therefore
do not reject H0 and may conclude that the
mean height of the animals of this particular
species is 66 centimeters.
Next, we consider the construction of
the confidence interval for 1-2 in that
situation when we are drawing small samples
from two normally distributed populations
having unknown but equal variances:
We illustrate this concept with the help
of the following example:
EXAMPLE
1 1
X 1 X 2 t 2n1 n2 2 s p
n1 n2
Calculations:
sp
10 9 2
1.1503 0.4674
17
0.31
The confidence interval is
3.465 4.064
2.898 0.31 1 1
0.5990.411
10 9
i.e. the C.I. is :
1.010 .188
1 2
With 99% confidence, the record
company executive can conclude that the true
difference in population average length of play
is between –1.01 minutes and –.188 minute.
Zero is not in this interval, so she could
conclude that there is a significant difference
in the average length of play time between
semi-classical music and pop music songs’
recordings.
Examination of the sample results
indicates that pop music songs’ recordings
are longer.
The result and conclusion obtained
above can be used in the tactical and
strategic planning for programming,
marketing, and production of recordings.
Next, we discuss the application of the
t-distribution for testing the equality of two
population means:
EXAMPLE:
H0 : 1 - 2 > 1,
and
H 1 : 1 - 2 < 1
ii) We set the significance level at = 0.05.
iii) The test-statistic, if H0 is true, is
t
X 1 X 2 1 2
1 1
sp
n1 n2
x 80 . 92
x 1 6 . 74 ,
1 n 12
1
x 84 . 25
x 2 5 . 62 ,
2 n 15
2
And
x1
2
x1 x1
2 2
x1
n1
561.6402
80.92
2
12
561 .6402 545 .6705
15 .9697
Also
x2 2
x 2 x 2
2 2
x2
n2
478.9779
84.25
2
15
478.9779 473.2042
5.7737
x x 2 x x 2
1 1 2 2
Now s 2
p n n 2
1 2
5 .9697 5 .7737
12 15 2
= 0.8697,
so that
s 0 . 8697 0 . 93 ,
p
Hence, the computed value of our test
statistic comes out to be
6 . 74 5 . 62 1 0 . 12
t 0 . 33
1 1 0 . 36
0 . 93
12 15
v) Critical Region :
Since this is a left-tailed test, therefore the
critical region is given by
t < -t0.05(25)
i.e. t < -1.708
vi) Conclusion:
Since the computed value of t = 0.33
falls in the acceptance region, therefore we
accept H0.
We may conclude that the mean rubber
percentage of the Aberrants is at least 1
percent more than the mean rubber
percentage of Offtypes.
Next, we consider the application of
the
t-distribution in the case of paired
observations:
In testing hypotheses about two means,
until now we have used independent
samples, but there are many situations in
which the two samples are not independent.
This happen when the observation are found
in pairs such that the two observations of a
pair are related to each other.
Pairing occurs either naturally or by
design.
Natural pairing occurs whenever
measurement is taken on the same unit or
individual at two different times. For
example, suppose ten young recruits are
given a strenuous physical training
programme by the Army.
Their weights are recorded before they
begin and after they complete the training. The
two observations obtained for each recruit i.e.
the before-and-after measurement constitute
natural pairing.