Business Statistics 5
Business Statistics 5
Business Statistics 5
Ikram-E-Khuda
Agenda Items
• Types of hypothesis
• Statistical hypothesis errors
• Z test and t test
• Procedure of hypothesis testing
• One tailed and two tailed tests
• One sample z test and t test
• Independent samples t test
• Practice problems
• Working on SPSS
Hypothesis Testing
• The main purpose of statistics is to test hypothesis
The power of any test of statistical significance is defined as the probability that it will reject a false null
hypothesis. Statistical power is inversely related to beta or the probability of making a Type II error. In
short, power = 1 – β.
Statistical Hypothesis Errors
Statistical Hypothesis Errors
Example:
Two drugs are to be compared in a clinical trial for use in treatment of disease X. Drug A is cheaper than Drug
B. Efficacy is measured using a continuous variable, Y, and .H0: μ1=μ2.
Type I error—occurs if the two drugs are truly equally effective, but we conclude that Drug B is better. The
consequence is financial loss.
Type II error—occurs if Drug B is truly more effective, but we fail to reject the null hypothesis and conclude
there is no significant evidence that the two drugs vary in effectiveness. What is the consequence in this case?
Statistical Hypothesis Errors
Z Test and T Test
• So far we have seen that a hypothesis refers to a supposition (𝐻𝑜 )which is
to be accepted or rejected.
• For example we might know that the average weight for white babies
in the US is 3,410 grams and wish to compare the average birth
weight of a sample of black babies to this value.
One Tail and Two Tail Tests
Upper-tailed, Lower-tailed, Two-tailed Tests
The research or alternative hypothesis can take one of three forms. An investigator
might believe that the parameter has increased, decreased or changed. For example, an
investigator might hypothesize:
H1: μ > μ 0 , where μ0 is the comparator or null value (e.g., μ0 =191 in an example about
accepted average weight in men ) and an increase is hypothesized - this type of test is
called an upper-tailed test;
H1: μ < μ0 , where a decrease is hypothesized and this is called a lower-tailed test; or
The exact form of the research hypothesis depends on the investigator's belief about the
parameter of interest is set up by the investigator before any data are collected.
Rejection Region Approach to Hypothesis Testing
Important Terms
• Rejection Region: The null hypothesis is rejected if the test statistic lies within this region
• Critical value is the boundary of the rejection region. This could be either 𝑧 ∗ 𝑜𝑟 𝑡 ∗
• The critical region encompasses those values of the test statistic that lead to a rejection of the null hypothesis
• The significance level or α can have values e.g. 0.1, 0.05, 0.01 for 90%, 95% 99% confidence intervals respectively.
• A value of α implies that the null hypothesis is rejected α % of the time when it is in fact true.
• The p-value is the probability of obtaining the test statistic ( z or t value) given that the null hypothesis is true.
• P values summarize statistical significance
• A small p-value is an indication that the null hypothesis is false.
• To make null hypothesis false, p value should be lesser than α
• P values can be calculated manually but we will use software to find them
All these quantitative metrics are defined before the hypothesis test is performed.
Important Terms
The following figures illustrate the rejection regions defined by the decision rule for upper-,
lower- and two-tailed Z tests with α=0.05. Notice that the rejection regions are in the upper,
lower and both tails of the curves, respectively. The decision rules are written below each figure.
Procedure for Hypothesis Testing
The procedure for hypothesis testing is based on the ideas described above.
Specifically, we set up competing hypotheses, select a random sample from the
population of interest and compute summary statistics. We then determine whether
the sample data supports the null or alternative hypotheses. The procedure can be
broken down into the following five steps.
ഥ.
Step 2. Calculate the sample mean ; 𝒙
σ𝒏
𝒊=𝟏 𝒙𝒊
ഥ=
𝒙 𝒏
Procedure for Hypothesis Testing
• Step 3. Select the appropriate test statistic.
The test statistic is a single number that summarizes the
sample information.
An example of a test statistic is the z statistic which is
used when sample size is greater than 30 and population
standard deviation is known.
It is computed as follows:
ҧ 𝑜
𝑥−𝜇
𝑧=
𝜎/ 𝑛
The final conclusion is made by comparing the test statistic (which is a summary
of the information observed in the sample) to the decision rule. The final
conclusion will be either to reject the null hypothesis (because the sample data
are very unlikely if the null hypothesis is true) or not to reject the null hypothesis
(because the sample data are not very unlikely).
• The general form for a confidence interval around the mean, if 𝜎 is known, is
∗
𝜎
𝑥ഥ ± 𝑧
𝑛
• If the computed values in any of the above equations lie in the rejection
regions defined by the respective confidence intervals than null hypothesis
can be rejected otherwise not.
Summary of Comparison Means Statistical Tests
References
• http://www.probabilityformula.org/central-limit-theorem.html#
• https://www.geoib.com/confidence-level--margin-of-error.html
• https://onlinecourses.science.psu.edu/stat504/node/19/
• http://www.statisticshowto.com/probability-and-statistics/hypothesis-
testing/
• http://blog.minitab.com/blog/adventures-in-statistics-2/understanding-t-
tests-1-sample-2-sample-and-paired-t-tests
• https://keydifferences.com/difference-between-t-test-and-z-test.html
• https://keydifferences.com/difference-between-t-test-and-z-test.html
• http://www.six-sigma-material.com/Hypothesis-Testing.html