Intro of Hypothesis Testing
Intro of Hypothesis Testing
TESTING
THE statistical variables
Some criteria in the selection of variables
1. appropriateness
2. clarity
3. measurability
4. comparability
Two types of variables
1.Dependent variable
- a factor, property, characteristics or attribute that is
measured and made the object of analysis.
Descriptive Statistics
Inferential Statistics
Descriptive Statistics
> <
is greater than is less than
is above is below
is higher than is lower than
is longer than is shorter than
is bigger than is smaller than
is increased is decreased or
reduced from
Hypothesis Testing Common Phrases
≥
is greater than or equal to
is at least
is not less than
≤
is less than or equal to
is at most
is not more than
Hypothesis Testing Common Phrases
=
is equal to
is exactly the same as/ is the same as
has not changed from
≠
is not equal to
is different from
has changed from
is not the same as
Errors in decision making
Possible situations in testing a statistical hypothesis
Statistical
H0 is TRUE H0 is FALSE
Decision
Type II error
Accept H0 Correct Decision
(β)
Correct
Reject H0 Type I error (α)
Decision
Level of Significance
hypothesis)
Not Significant
Greater than 0.05
(Insufficient evidence against the null hypothesis)
Definition of Terms
Statistical hypothesis – an assertion or conjecture
concerning the population or more populations.
Null hypothesis – the hypothesis or assumption about the
population parameter we wish to test.
Alternative hypothesis – the conclusion we accept when
the data fail to support the null hypothesis.
Significance level - a value indicating the percentage of
sample values that is outside a certain limits, assuming the
null hypothesis is correct; i.e. the probability of rejecting
the null hypothesis.
Definition of Terms
Power of the test – the probability of rejecting the null
hypothesis when it is false, i.e. it is measure of how well
the hypothesis test is working.
Test statistic – a statistic used in deciding whether to
reject or to accept the null hypothesis.
Confidence level – the probability that the parameter
tested is within the specified values in the hypothesis.
Critical region or rejection region – part of the set of all
possible values of a sample statistic for which the
hypothesis to be tested is rejected.
Definition of Terms
Critical value – the last number observed in passing from
the acceptance region into the rejection region.
Dependent samples – samples drawn from two
populations in such a way that the elements were not
chosen independently of one another, in order to allow a
more precise analysis or to control for some extraneous
factors.
One-tailed test – a hypothesis test in which there is only
one rejection regions, i.e. we are concerned only with
whether the observed values deviates from the hypothesis
value in one direction.
Definition of Terms
Problem 1
The average baptismal cost includes 50 guests. A
random sample of 32 baptismal during the past year
in the National Capital Region had a mean of 53
guests and a standard deviation of 10. Is there
sufficient evidence at the 0.05 level of significance
that the average number of guests differs from the
national average?
Solution:
Given: sample mean = 53 sample sd = 10
population mean = 50 n = 32
Step 1. Ho : µ = 50
Ha : µ ≠ 50 (claim)
Step 2. α = 0.05
Step 3. The z critical value is ± 1. 960
(it is two-tailed test, since it does not mention about the
direction of the distribution).
Step 4. Computation
one sample z test value
z = sample mean – population mean
sample sd/
= 53 – 50 (
10
= 1.697
After 11 15 10 14 14 13 11 12
Answer
Step
1.
Ho: There is no significant difference in the performance
level of employees before and after the seminar-workshop.
()
14 11 -3 9
13 15 2 4
9 10 1 1
9 14 5 25
10 14 4 16
10 13 3 9
12 11 -1 1
7 12 5 25
ƩD = 16 Ʃ = 90
Continuation…
Determine
the mean of the differences.
= =2
Determine the standard deviation of the differences.
= = 2.878
Determine the test value
t = = = 1.966
Step 6. Decision Rule
Since the computed t value of 1.966 is less than the t
critical of 2. 365 at level of significance of 0.05, the
statistical decision is not to reject the null hypothesis.
Step 7. Conclusion.
Since we do not reject the Ho, we can conclude that
there is no significant difference in the performance level of
employees before and after the seminar-workshop.
Example 3
The operations manager of sales company wants to see
whether there is a significant difference in the ages of male
and female customers. He selects a sample of 35 samples for
each group. The ages are shown here. At α = 0.05, decide if
there is enough evidence to reject the claim of no difference
in the ages of the two groups.
Male Female
= 27.3 = 28
s = 2.4 s = 3.1
n = 35 n = 35
Answer
Step
1.
Ho: There is no significant difference between the average
ages of male and female customers. ()
Ha: There is a significant difference between the average
ages of male and female customers. ()
Step 2. The level of significance is α = 0.05.
Step 7. Conclusion.
There is no evidence of significant differences between
the averages ages of male and female customers.
Some Statistical Tools
Purpose of Test Parametric Nonparametric
(Random (Non-random
sampling) sampling)
Significant difference T –test (Independent Mann – Whitney U test
between 2 groups samples)
Paired samples Paired t – test Wilcoxon Signed -Rank
Test
Degree of association Pearson’s r (Pearson’s Spearman’s rho (spearman
between the two variables product-moment rank correlation)
correlation coefficient)
Difference between chi-square test Chi-square goodness-of-fit
categorical variables (Pearson’s chi-square test) test
Significant difference One – way Analysis of Kruskal Wallis Test
between 3 or more groups Variance (ANOVA)