LEC09
LEC09
LEC09
9
Test of significance – Basic concepts – null hypothesis – alternative hypothesis – level of
significance – Standard error and its importance – steps in testing
Test of Significance
Objective
To familiarize the students about the concept of testing of any hypothesis, the different
terminologies used in testing and application of different types of tests.
Sampling Distribution
By drawing all possible samples of same size from a population we can calculate the
statistic, for example, for all samples. Based on this we can construct a frequency distribution
and the probability distribution of . Such probability distribution of a statistic is known a
sampling distribution of that statistic. In practice, the sampling distributions can be obtained
theoretically from the properties of random samples.
Standard Error
As in the case of population distribution the characteristic of the sampling distributions
are also described by some measurements like mean & standard deviation. Since a statistic is a
random variable, the mean of the sampling distribution of a statistic is called the expected valued
of the statistic. The SD of the sampling distributions of the statistic is called standard error of the
Statistic. The square of the standard error is known as the variance of the statistic. It may be
noted that the standard deviation is for units whereas the standard error is for the statistic.
The relationship between type I & type II errors is that if one increases the other will decrease.
The probability of type I error is denoted by α. The probability of type II error is denoted by β.
The correct decision of rejecting the null hypothesis when it is false is known as the power of the
test. The probability of the power is given by 1-β.
Critical Region
The testing of statistical hypothesis involves the choice of a region on the sampling
distribution of statistic. If the statistic falls within this region, the null hypothesis is rejected:
otherwise it is accepted. This region is called critical region.
Let the null hypothesis be Ho: µ1 = µ2 and its alternative be H1: µ1 ≠ µ2. Suppose Ho is
true. Based on sample data it may be observed that statistic follows a normal
distribution given by
We know that 95% values of the statistic from repeated samples will fall in the range
±1.96 times SE . This is represented by a diagram.
Region of Region of
rejection rejection
Region of acceptance
-1.96 0 1.96
The border line value ±1.96 is the critical value or tabular value of Z. The area beyond
the critical values (shaded area) is known as critical region or region of rejection. The remaining
area is known as region of acceptance.
If the statistic falls in the critical region we reject the null hypothesis and, if it falls in the
region of acceptance we accept the null hypothesis.
In other words if the calculated value of a test statistic (Z, t, χ2 etc) is more than the
critical value in magnitude it is said to be significant and we reject Ho and otherwise we accept
Ho. The critical values for the t and are given in the form of readymade tables. Since the
criticval values are given in the form of table it is commonly referred as table value. The table
value depends on the level of significance and degrees of freedom.
Example: Z cal < Z tab -We accept the Ho and conclude that there is no significant difference
between the means
Test Statistic
The sampling distribution of a statistic like Z, t, & χ2 are known as test statistic.
Generally, in case of quantitative data
Note
The choice of the test statistic depends on the nature of the variable (ie) qualitative or
quantitative, the statistic involved (i.e) mean or variance and the sample size, (i.e) large or small.
Level of Significance
The probability that the statistic will fall in the critical region is . This α is
nothing but the probability of committing type I error. Technically the probability of committing
type I error is known as level of Significance.
One and two tailed test
The nature of the alternative hypothesis determines the position of the critical region. For
example, if H1 is µ1≠µ2 it does not show the direction and hence the critical region falls on
either end of the sampling distribution. If H1 is µ1 < µ2 or µ1 > µ2 the direction is known. In the
first case the critical region falls on the left of the distribution whereas in the second case it falls
on the right side.
One tailed test – When the critical region falls on one end of the sampling distribution, it is
called one tailed test.
Two tailed test – When the critical region falls on either end of the sampling distribution, it is
called two tailed test.
For example, consider the mean yield of new paddy variety (µ1) is compared with that of a
ruling variety (µ2). Unless the new variety is more promising that the ruling variety in terms of
yield we are not going to accept the new variety. In this case H1 : µ1 > µ2 for which one tailed
test is used. If both the varieties are new our interest will be to choose the best of the two. In this
case H1: µ1 ≠ µ2 for which we use two tailed test.
Degrees of freedom
The number of degrees of freedom is the number of observations that are free to vary
after certain restriction have been placed on the data. If there are n observations in the sample,
for each restriction imposed upon the original observation the number of degrees of freedom is
reduced by one.
The number of independent variables which make up the statistic is known as the degrees
of freedom and is denoted by (Nu)
Steps in testing of hypothesis
The process of testing a hypothesis involves following steps.
1. Formulation of null & alternative hypothesis.
2. Specification of level of significance.
3. Selection of test statistic and its computation.
4. Finding out the critical value from tables using the level of significance, sampling
distribution and its degrees of freedom.
5. Determination of the significance of the test statistic.
6. Decision about the null hypothesis based on the significance of the test statistic.
7. Writing the conclusion in such a way that it answers the question on hand.
Large sample theory
The sample size n is greater than 30 (n≥30) it is known as large sample. For large
samples the sampling distributions of statistic are normal(Z test). A study of sampling
distribution of statistic for large sample is known as large sample theory.
samples the sampling distributions are t, F and χ2 distribution. A study of sampling distributions
Test of Significance
The theory of test of significance consists of various test statistic. The theory had been developed
under two broad heading
1. Test of significance for large sample
Large sample test or Asymptotic test or Z test (n≥30)
2. Test of significance for small samples(n<30)
Small sample test or Exact test-t, F and χ2.
It may be noted that small sample tests can be used in case of large samples also.
Large sample test
Large sample test are
Ze = 1.65 at 5% level
2.33 at 1% level One tailed test
6. Inference
If the observed value of the test statistic Zo exceeds the table value Ze we reject the Null
Hypothesis Ho otherwise accept it.
4. Level of Significance
The level may be fixed at either 5% or 1%
5. Expected vale
The expected value is given by
Ze = 1.96 at 5% level
2.58 at 1% level Two tailed test
Ze = 1.65 at 5% level
2.33 at 1% level One tailed test
6. Inference
If the observed value of the test statistic Z exceeds the table value Ze we may reject the Null
Hypothesis Ho otherwise accept it.
2. Alternative Hypothesis(H1)
H1 : There is significance difference between the sample mean
ie., µ≠µo or µ>µo or µ<µo
3. Test statistic
4. Level of Significance
The level may be fixed at either 5% or 1%
5.Expected vale
The expected value is given by
Ze = 1.96 at 5% level
2.58 at 1% level Two tailed test
Ze = 1.65 at 5% level
2.33 at 1% level One tailed test
6. Inference
If the observed value of the test statistic Z exceeds the table value Ze we may reject the Null
Hypothesis Ho otherwise accept it.
Test for equality of two Means
Given two sets of sample data of large size n1 and n2 from variables. We may examine
whether the two samples come from the populations having the same mean. We may proceed as
follows
When ,
where
4. Level of Significance
The level may be fixed at either 5% or 1%
5. Expected vale
The expected value is given by
Ze = 1.96 at 5% level
2.58 at 1% level Two tailed test
Ze = 1.65 at 5% level
2.33 at 1% level One tailed test
6. Inference
If the observed value of the test statistic Z exceeds the table value Ze we may reject the
Null Hypothesis Ho otherwise accept it.
Questions
1. A hypothesis may be classified as
(a)Simple (b) Composite
(c)Null (d) All the above
3. Large sample test can be applied when the sample size exceeds 30.
Ans: True
4. If the calculated test statistic is greater than the critical value, the null hypothesis is rejected.
Ans: True
Ans: True
6. If the alternative hypothesis is µ1≠ µ2 then the test is known as one tailed test.
Ans: False