Chi Square Distribution
Chi Square Distribution
Chi Square Distribution
119211078
Accounting IC
CHI SQUARE DISTRIBUTION
Information:
χ2 = Chi Square
fo = Frequency of observation
fh = Frequency expected
The sample size must be large enough to convince us that there is a similarity between
the theoretical distribution and the Chi Square sampling distribution.
Observations must be independent (unpaired). This means that one subject's answer has
no effect on the answers of other subjects or that one subject is only used once in the
analysis.
Chi Square test can only be used on descriptive data (frequency data or categorical data)
or continuous data that has been grouped into categories.
The number of expected frequencies must be equal to the number of observed
frequencies.
At degrees of freedom equal to 1, there cannot be an extremely small expected value.
In general, if the expected value lies in one cell is too small (<5), Chi Square should not
be used because it can cause an over estimate so that many hypotheses are rejected except with
the correction from Yates. If it is not big enough, then having an expectation value smaller than 5
will not have much effect on the desired result. In the Chi Square test with many categories, if
there is more than one expected value less than 5, then the expected values can be combined with
the consequence that the number of categories will decrease and the information obtained is also
reduced.
EXAMPLE PROBLEM AND SOLUTION CHI SQUARE
1. Researchers have conducted a survey of 1600 coffee drinkers asking how much coffee
they drink in order to confirm previous studies. Previous studies have indicated that 72%
of Americans drink coffee. The results of previous studies (left) and the survey (right) are
below. At α = 0.05, is there enough evidence to conclude that the distributions are the
same?
Answer:
1) The null hypothesis H0:the population frequencies are equal to the expected
frequencies (to be calculated below).
2) The alternative hypothesis, Ha: the null hypothesis is false.
3) α = 0.05.
4) The degrees of freedom: k − 1 = 4 − 1 = 3.
5) The test statistic can be calculated using a table: