SME11e PPT ch11std
SME11e PPT ch11std
SME11e PPT ch11std
Andrew Stephenson
Georgia Gwinnett College
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Chapter 11
11.2
Introduction to Hypothesis Testing
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Introduction
11.3
• In chap. 10, we introduced estimation(추정) and showed how it is
used.
• Hypothesis testing(가설검정) is the second form of statistical
inference.
• The purpose of hypothesis testing is to determine whether
enough statistical evidences exists to enable us to conclude
that a belief or hypothesis about a parameter is supported by
the data.
• Hypothesis testing has greater applicability in business and
economics, as well as many other fields.
• This chapter will lay the foundation upon which the rest of the
book is based. It will help you to understand an inferential
statistics.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Statistical Inference
11.4
• In many ways the formal procedure for hypothesis testing is similar
to the scientific method.
• The scientist observes nature, formulates a theory, and then
tests this theory against observation.
• In our context scientist poses a hypothesis concerning one or
more population parameters that they equal specified values.
She then samples the population and compares her observations
with the hypothesis.
• If the observations disagree with the hypothesis, the scientist
rejects it.
• If not, the scientist concludes either that the hypothesis is true
or that the sample did not detect the difference between the
real and hypothesized values of the population parameter.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.5
• To understand the concepts we’ll start with an example of
nonstatistical hypothesis testing.
• When a person is accused of a crime, he or she faces a trial(재판).
• When the prosecution(검사) presents its case, a jury(배심원) must
make a decision on the basis of the evidence presented. In
fact, the jury conducts a test of hypothesis.
• There are two hypotheses that are tested:
• The null hypothesis (귀무가설) 𝐻0 : The defendant is innocent
(피고는 무죄이다).
* 𝐻0 : 𝐻 − 𝑛𝑜𝑢𝑔ℎ𝑡
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.6
• The jury does not know which hypothesis is true. The members
must make a decision on the basis of evidence presented by
both the prosecution and the defense.
• In the language of statistics convicting the defendant* is
equivalent to rejecting the null hypothesis in favor of the
alternative hypothesis.**
• That is, the jury is saying that there is enough evidence to
conclude that the defendant is guilty (i.e., there is enough
evidence to support the alternative hypothesis).
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.7
• If the jury acquits the defendant*, it is stating that not rejecting the
null hypothesis**, which means that the jury decided that there
was not enough evidence to conclude that the defendant was
guilty.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.8
• There are two possible errors:
• A Type I error (제1종 오류) occurs when we reject a true null
hypothesis. That is, a Type I error occurs when the jury
convicts an innocent person.
• A Type II error (제2종 오류) occurs when we don’t reject a false
null hypothesis. That occurs when a guilty defendant is
acquitted*.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.9
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.10
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.11
• In our justice system, Type I errors are regarded as more serious.
We try to avoid convicting innocent people. We are more willing
to acquit guilty people.
• As a consequence, the system is set up so that the probability
of a Type I error is small.
• This is arranged by placing the burden of proof (입증책임) on the
prosecution(the prosecution must prove guilt.) and by having
judges instruct the jury to find the defendant guilty only if
there is “evidence beyond a reasonable doubt(합리적인 의심을
넘어서는 증거가 있을 때만).” (= “sufficient evidence”)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.12
• The consequence of this arrangement is that the probability of
acquitting guilty people is relatively large.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.13
• The critical concepts in hypothesis testing follow.
1. There are two hypotheses, the null and the
alternative(research) hypotheses.
2. The procedure begins with the assumption that the null
hypothesis is true.
3. The goal is to determine whether there is enough evidence
to infer that the alternative hypothesis is true.
4. There are two possible decisions:
• Conclude that there is enough evidence to support the
alternative hypothesis.
• Conclude that there is not enough evidence to support
the alternative hypothesis.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.14
5. Two possible errors can be made.
Type I error: Reject a true null hypothesis
Type II error: Do not reject a false null hypothesis.
𝑃 Type I error = 𝛼
𝑃 Type II error = 𝛽
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.15
• In statistics we frequently test hypotheses about parameters.
• The hypotheses we test are generated by questions that
managers need to answer.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.16
• Consider Example 10.1 (mean demand for computers during
assembly lead time) again.
• Rather than estimate the mean demand, our operations
manager wants to know whether the mean is different from 350
units, which may be the point at which the current inventory
policy needs to be altered.
• We can rephrase this request into a test of the hypothesis:
𝐻0 : 𝜇 = 350
• Thus, our alternative(research) hypothesis becomes:
𝐻1 : 𝜇 ≠ 350
⇒ Is there enough evidence to conclude that 𝜇 is not
equal to 350?
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.17
• The testing procedure begins with the assumption that the null
hypothesis is true.
• Thus, until we have further statistical evidence, we will
assume:
𝐻0 : 𝜇 = 350 (assumed to be TRUE)
• The goal of the process is to determine whether there is enough
evidence to infer that the alternative hypothesis is true.
• That is, is there sufficient statistical information to determine if
this statement is true?
𝐻1 : 𝜇 ≠ 350
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.18
• Now suppose that the current inventory policy is based on an
analysis that the actual mean demand during lead time is 350.
• After a vigorous advertising campaign, the manager
suspects that there has been an increase in demand and
thus an increase in mean demand during lead time.
• To test whether there is an evidence of increase, the
manager would specify the alternative hypothesis as
𝐻1 : 𝜇 > 350
• Because the manager knew that the mean was (and
maybe still is) 350, the null hypothesis would state
𝐻0 : 𝜇 = 350 (assumed to be TRUE)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.19
• Further suppose that the manager does not know the actual mean
demand during lead time, but the current inventory policy is
based on the assumption that the mean is less than or equal to
350.
• If the advertising campaign increases the mean to a
quantity larger than 350, the hypotheses become
𝐻0 : 𝜇 ≤ 350
𝐻1 : 𝜇 > 350
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.20
• If there is enough evidence to conclude that the alternative
hypothesis is true when we assume that the mean is equal to 350,
we would certainly draw the same conclusion when we assume
that the mean value is less than 350.
• As a result, the null hypothesis will always state that the parameter
equals the value specified in the alternative hypothesis. (Use the
equal sign in the 𝐻0 .)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.21
• The hypotheses are often set up to reflect a manager’s decision
problem wherein the null hypothesis represents the status quo
(현재상황).
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.22
• Once the null and alternative hypotheses are stated, the next step
is to randomly sample the population and calculate a test statistic
(검정통계량) (in this example, the sample mean).
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.23
• For example, if we’re trying to decide whether the mean is not
equal to 350, a large value of 𝑋ത (say, 600) would provide
enough evidence.
• If 𝑋ത is close to 350 (say, 355) we would say that this does not
provide a great deal of evidence to infer that the population
mean is greater than 350.
• In the absence of sufficient evidence, we do not reject the
null hypothesis in favor of the alternative. (In then absence of
sufficient evidence of guilt, a jury finds the defendant not
guilty.)
• Sufficient evidence = “evidence beyond a reasonable doubt”
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-1 Concepts of Hypothesis Testing
11.24
• Two possible errors can be made in any test:
• A Type I error occurs when we reject a true null hypothesis
and
• A Type II error occurs when we don’t reject a false null
hypothesis.
• There are probabilities associated with each type of error:
𝑃 Type I error = 𝛼
𝑃 Type II error = 𝛽
• 𝛼 is called the significance level (유의수준).
• (1 − 𝛼) is called the confidence level (신뢰수준)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2 Testing the Population Mean with
Known 𝜎 11.25
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.1 Department Store’s New Billing
System 11.26
• The manager of a department store is thinking about establishing a new
billing system for the store's credit customers.
• She determines that the new system will be cost-effective only if the
mean monthly account is more than $170. A random sample of 400
monthly accounts is drawn, for which the sample mean is $178 (Data
Xm 11-01).
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.1 New Billing System
11.27
IDENTIFY
• The system will be cost effective if the mean account balance for all
customers is greater than $170.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.1 New Billing System
11.28
IDENTIFY
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.1 New Billing System(새로운 청구 제도)
COMPUTE 11.29
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11.2a Rejection region (기각역)
COMPUTE 11.30
• Definition 11.1 (Rejection Region)
• The rejection region is a range of values such that if the test
statistic falls into that range, we decide to reject the null
hypothesis in favor of the alternative hypothesis. (대립가설을
선호하여 귀무가설을 기각)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11.2a Rejection region (기각역)
COMPUTE 11.31
• It seems reasonable to reject the null hypothesis in favor of the
alternative if the value of the sample mean is large relative to 170.
• If the calculated sample were 500, we would reject 𝐻0 .
• If values of 𝑥ҧ were 171, we would not reject it.
• Because Type I error is defined as rejecting a true null hypothesis,
𝛼 = 𝑃 Type 𝐼 error
= 𝑃(reject 𝐻0 given that 𝐻0 is true)
= 𝑃(Value of test statistic is in RR when 𝐻0 is true)
= 𝑃(𝑥ҧ > 𝑥ҧ𝐿 ).
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11.2a Rejection region (기각역)
COMPUTE 11.32
• Since the sampling distribution of 𝑥ҧ is normal or approximately
normal (Section 9-1), with mean 𝜇 and standard deviation 𝜎/ 𝑛,
we can standardize 𝑥ҧ and obtain the following probability:
ҧ
𝑥−𝜇 𝑥ҧ 𝐿 −𝜇 𝑥ҧ 𝐿 −𝜇
𝑃 𝑥ҧ > 𝑥ҧ𝐿 =𝑃 > =𝑃 𝑍> =𝛼
𝜎/ 𝑛 𝜎/ 𝑛 𝜎/ 𝑛
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11.2a Rejection region (기각역)
COMPUTE 11.33
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2b Standardized Test Statistic
• An easier method is to use the standardized test statistic: 11.34
ҧ
𝑥−𝜇
𝑧=
𝜎/ 𝑛
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2b Standardized Test Statistic
• When a null hypothesis is rejected, the test is said to be 11.35
statistically significant at whatever significance level the test was
conducted!!
• The test was significant at the 5% significance level. (5%
유의수준에서 통계적으로 유의하다.)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.1 The Big Picture Again
11.36
Fig. 11.2
Sampling Distribution of 𝑋ത 𝑥ҧ 𝐿 = 175.34
for Example 11.1 ҧ 178
𝑥=
Z
Fig. 11.3
Sampling Distribution of Z for
Example 11.1
.05
H0: 𝜇 = 170
H1: 𝜇 > 170
Reject H0 in favor of
Z.05=1.645 z = 2.46
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2c p-Value of a Test (검정의 p값)
11.37
• Definition 11.2 (The p-value of a test)
• The p-value of a test is the probability of observing a test statistic at
least as extreme as the one computed given that the null hypothesis
is true.
(검정의 p값은 귀무가설이 옳다는 가정하에서 계산되는 검정 통계량의 값보다 더 큰
검정 통계량의 값이 관측되는 확률이다.)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
p-Value of a Test
11.38
p-value =.0069
z =2.46
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2d Interpreting the p-value
11.39
• Remember that the sampling distribution allows us to make
probability statements about a sample statistic assuming
knowledge of the population parameter.
• Thus, the probability of observing a sample mean at least as
large as 178 from a population whose mean is 170 is .0069,
which is very small.
• In other words, we have just observed an unlikely event, an
event so unlikely that we seriously doubt the assumption that
began the process—that the null hypothesis is true.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2d Interpreting the p-value
11.40
• Do not interpret that the p-value is the probability that the null
hypothesis is true.
• You cannot make a probability statement about a
parameter. It is not a random variable.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2d Interpreting the p-value
11.41
Table 11.2 Test Statistics and p-values for Example 11.1
Test Statistic
ഥ−𝝁
𝒙 ഥ − 𝟏𝟕𝟎
𝒙
𝒛= =
Sample mean 𝒙
ഥ 𝝈/ 𝒏 𝟔𝟓/ 𝟒𝟎𝟎 𝒑-value
170 0 .5000
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Figure 11.5 p-Values for Example 11.1
11.42
z = 0.62; 𝒙
ഥ = 𝟏𝟕𝟐
z =1.856; 𝒙
ഥ = 𝟏𝟕𝟔
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2d Interpreting the p-value
11.44
• This raises the question: How small does the p-value have to be to
infer that the alternative hypothesis is true?
• In general, the answer depends on a number of factors,
including the cost of making Type I and Type II errors. In
Example 11.1, a Type I error would occur if the manager
adopts the new billing system when it is not cost-effective.
• If the cost of this error is high, we attempt to minimize its
probability.
• In the rejection region method, we do so by setting the
significance level quite low—say 1%.
• Using p-value method, we would insist that the p-value be
quite small, providing sufficient evidence to infer that the
mean monthly account is greater than $170 before
proceeding with the new billing system.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2e Describing the p-value (p-값의 해석)
11.45
• p-values can be translated using the following descriptive terms.
• If the p-value is less than 1%, we say that there is overwhelming
evidence to infer that the alternative hypothesis is true. We
also say that the test is highly significant*.
• If the p-value is between 1% and 5%, there is strong evidence
to infer that the alternative hypothesis is true. The result is
deemed to be significant**.
• If the p-value is between 5% and 10%, there is weak evidence
that supports the alternative hypothesis. We say that the result
is weakly significant***.
• If the p-value exceeds 10%, there is little to no evidence that
supports the alternative hypothesis.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2e Describing the p-value
11.46
Figure 11.6 Describing p-values
Overwhelming Evidence
(Highly Significant)
Strong Evidence Weak Evidence No Evidence
(Strongly Significant)(Weakly Significant) (Not Statistically
Significant)
0 0.05 1.0
0.01 0.10
p=.0069
• We observe a p-value of .0069, hence there is overwhelming
evidence to support 𝐻1 : 𝜇 > 170.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2f The p-value and Rejection Region
Method 11.47
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2g Solving Manually and Using Excel
11.48
COMPUTE: Excel Workbook
• Instructions
• Type or import the data into one column. (Open Xm11-01.) In
any empty cell, calculate the sample mean
(=AVERAGE(A1:A401).)
• Open the Test Statistics in Excel Workbooks folder and click the
z-Test Mean tab. In Cell B3, type or copy the value of the
sample mean. In cells B4-B7, type the value of 𝜎(65), the value
of n(400), the value of 𝜇 under the null hypothesis(170), and the
value of 𝛼 .05 , respectively.
• The spreadsheet reports the value of the test statistic, 𝑧 = 2.46.
The p-value of the test is .0069. Excel reports this probability as
𝑃 𝑍 ≤ 𝑧 𝑜𝑛𝑒 𝑡𝑎𝑖𝑙 . (see appendix)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.1
11.49
COMPUTE: Excel Workbook
z-Test of a Mean
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2h Interpreting the Results of a Test
11.50
• In Example 11.1, we rejected the null hypothesis. Does this prove
that the alternative hypothesis is rue?
• The answer is no; because our conclusion is based on sample
data(and not on the entire population), we can never prove
anything by using statistical inference.
• Consequently, we summarize the test by stating that there is
enough statistical evidence to infer that the null hypothesis is
false and that the alternative hypothesis is true.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Conclusions of a Test of Hypothesis
11.51
• If we reject the null hypothesis, we conclude that there is
enough evidence to infer that the alternative hypothesis is true.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Chapter-Opening Example: SSA Envelope
Plan 11.52
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
SSA Envelope Plan
11.53
• She calculates that the improved cash flow from a 2-day
decrease in the payment period would pay for the costs of the
envelopes and stamps.
• Any further decrease in the payment period would generate
a profit.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
SSA Envelope Plan
11.54
IDENTIFY
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
SSA Envelope Plan
11.55
IDENTIFY
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
SSA Envelope Plan
COMPUTE: Manually
11.56
σ220
𝑖=1 𝑥𝑖 4,759 ҧ
𝑥−𝜇 21.63−22
𝑥ҧ = = = 21.63 and 𝑧 = = = −.91,
220 220 𝜎/ 𝑛 6/ 220
• Because the value of test statistic , 𝑧 = −.91, is not less than -1.28,
we do not reject the null hypothesis and we do not conclude that
the alternative hypothesis is true.
• There is insufficient evidence to infer that the mean is less than
22 days.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
SSA Envelope Plan
COMPUTE: Manually
11.57
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
SSA Envelope Plan
11.58
COMPUTE: Excel Workbook
z-Test of a Mean
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
SSA Envelope Plan
11.59
p-value=.1814
-1.28 -.91
Rejection region
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
SSA Envelope Plan
INTERPRET 11.60
• Conclusion:
• The value of the test statistic is -.91, and its p-value is .1814: this
figure does not allow us to reject the null hypothesis.
• We say that there is not enough evidence to infer that the
mean payment period is less than 22 days. In the absence of
evidence, we cannot infer that the plan would be profitable.
• A Type I error occurs when we conclude that the plan works
when it actually does not. The cost of this mistake Is not high.
• A Type II error occurs when we don’t adopt the SSA envelope
plan when it would reduce costs. The cost of this mistake can
be high.
• As a consequence, we would like to minimize the probability
of a Type II error. Thus, we chose a large value for the
probability of a Type I error; we set 𝛼 = .10.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2i One– and Two–Tail Tests (단측 검정과 양측검정)
11.61
• The statistical tests conducted in department store
example(Example 11.1) is called one tail tests, because the
rejection region is located in only one tail of the sampling
distribution:
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2i One– and Two–Tail Testing
11.62
• The SSA Envelope example is a left tail test because the
rejection region was located in the left tail of the sampling
distribution.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Right-Tail Testing
11.63
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Left-Tail Testing
11.64
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Two–Tail Testing
11.65
• Two tail testing is used when we want to test a research hypothesis
that a parameter is not equal (≠) to some value
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.2 Comparison of AT&T and
Its Competitor 11.66
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.2 Comparison of AT&T
and Its Competitor 11.67
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.2
11.68
IDENTIFY
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.2
11.69
COMPUTE
• The rejection region is set up so we can reject the null hypothesis
when the test statistic is large or when it is small.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.2
11.70
COMPUTE
z
-z.025 0 +z.025
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.2
COMPUTE 11.71
ҧ
𝑥−𝜇
• Using our standardized test statistic, 𝑧 = , we find that:
𝜎/ 𝑛
ҧ
𝑥−𝜇 17.55−17.09
𝑧= = = 1.19
𝜎/ 𝑛 3.87/ 100
• Since z = 1.19 is neither greater than 1.96, nor less than –1.96, we
cannot reject the null hypothesis in favor of 𝐻1 . That is “there is
insufficient evidence to infer that there is a difference between the
bills of AT&T and the competitor.”
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Two-Tail Test p-value
11.72
COMPUTE
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.2
11.73
Figure 11.8 Sampling Distribution for Example 11.2
• There is not enough evidence to infer that the mean long-
distance bill is different from AT&T’s mean of $17.09.
• Figure 11.8 depicts the sampling distribution for this example.
𝒑 − 𝒗𝒂𝒍𝒖𝒆
=. 𝟏𝟏𝟕𝟎
𝟐
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2j When Do We Conduct One- and
Two-Tail Tests? 11.75
• We conduct a one-tail test that focuses on the right tail of the
sampling distribution whenever we want to know whether there is
enough evidence to infer that the mean is greater than the
quantity specified by the null hypothesis:
𝐻0 : 𝜇 = 𝜇0 vs. 𝐻1 : 𝜇 > 𝜇0
• The left tail test focuses on the left tail of the sampling distribution
to determine whether there is enough evidence to infer that the
mean is less than the value of the mean stated in the null
hypothesis:
𝐻0 : 𝜇 = 𝜇0 vs. 𝐻1 : 𝜇 < 𝜇0
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Summary of One- and Two-Tail Tests…
11.76
One-Tail Test Two-Tail Test One-Tail Test
(left tail) (right tail)
𝐻0 : 𝜇 = 𝜇0 𝐻0 : 𝜇 = 𝜇0 𝐻0 : 𝜇 = 𝜇0
𝐻1 : 𝜇 < 𝜇0 𝐻1 : 𝜇 ≠ 𝜇0 𝐻1 : 𝜇 > 𝜇0
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2k Testing Hypothesis and Confidence
Interval Estimators 11.77
• The test statistic and the confidence interval estimator are both
derived from the sampling distribution.
• So we can use the confidence interval estimator to test
hypothesis.
• To illustrate, consider Example 11.2. The 95% confidence
interval estimate of the population mean is
𝜎 3.87
𝑥ҧ ± 𝑧𝛼/2 = 17.55 ± 1.96 = 17.55 ± .76
𝑛 100
LCL=16.79 and UCL=18.31
• We estimate that 𝜇 lies between 16.79 and 18.31. Because this
interval includes 17.09, we cannot conclude that there is
sufficient evidence to infer that the population mean differs
from 17.09.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2k Testing Hypothesis and Confidence
Interval Estimators 11.78
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2k Testing Hypothesis and Confidence
Interval Estimators 11.79
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2k Testing Hypothesis and Confidence
Interval Estimators 11.80
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2l Developing an Understanding of
Statistical Concepts 1 11.81
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-2m Developing an Understanding of
Statistical Concepts 2 11.82
• When we (or the computer) calculate the value of the test statistic
ҧ
𝑥−𝜇
𝑧= ,
𝜎/ 𝑛
we’re also measuring the difference between the sample statistic
𝑥ҧ and the hypothesized value of the parameter 𝜇 in terms of the
standard error 𝜎Τ 𝑛.
• In Example 11.2, the value of the test statistic was 𝑧 = 1.19. This
means that the sample mean was 1.19 standard error above the
hypothesized value of 𝜇.
• The standard normal probability table told us that this value is
not considered unlikely. As a result, we did not reject the null
hypothesis.
• The unit of measurement of the difference is the standard error.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3 Probability of a Type II Error (β)
11.83
• To properly interpret the results of a test of hypothesis, you must
be able to specify an appropriate significance level or to judge
the p-value of a test.
• However, you must also understand the relationship between a
Type I and Type II errors.
• In this section, we describe how the probability of a Type II error is
computed and interpreted.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3 Probability of a Type II Error (β)
11.84
• Recall Example 11.1…
𝐻0 : 𝜇 = 170
𝐻1 : 𝜇 > 170
, where we conducted the test using the sample mean as the
test statistic and we computed the rejection region (with 𝛼 = .05) as
𝑥ҧ > 175.34
• At a significance level of 5% we rejected H0 in favor of H1 since
our sample mean (178) was greater than the critical value of 𝑋ത
(175.34).
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3 Probability of a Type II Error (β)
11.85
• A Type II error occurs when a false null hypothesis is not rejected.
• In example 11.1, this means that if 𝑋ത is less than 175.34 (our
critical value) we will not reject our null hypothesis, which
means that we will not install the new billing system.
• Thus, we can see that:
𝛽 = 𝑃(𝑋ത < 175.34 𝑔𝑖𝑣𝑒𝑛 𝑡ℎ𝑎𝑡 𝑡ℎ𝑒 𝑛𝑢𝑙𝑙 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠 𝑖𝑠 𝑓𝑎𝑙𝑠𝑒)
• The condition that the null hypothesis is false tells us only that
the mean is not equal to 170.
• If we want to compute 𝛽, we need to specify a value for 𝜇.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.1 (revisited)
11.86
• For example, suppose that if the mean account balance is $180
the new billing system will be so profitable that the manager
would hate to lose the opportunity to install it.
• She would like to determine the probability of not installing the
new system when it would produce large cost savings.
• Calculate the probability of not installing the new system
when 𝜇 is equal to 180:
𝛽 = 𝑃(𝑋ത < 175.34, 𝑔𝑖𝑣𝑒𝑛 𝑡ℎ𝑎𝑡 𝜇 = 180), thus
ത
𝑋−𝜇 175.34−180
𝛽=𝑃 < = 𝑃 𝑍 < −1.43 = .0764∎
𝜎/ 𝑛 65/ 400
• This tells us that when the mean account is actually $180, the
probability of incorrectly not rejecting the null hypothesis is .0764.
• Fig. 11.9 graphically depicts how the calculation was performed.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Example 11.1 (revisited)
11.87
Figure 11.9 Calculating 𝛽 𝑓𝑜𝑟 𝜇 = 180, 𝛼 = .05, 𝑎𝑛𝑑 𝑛 = 400
• Solving for 𝑥,
ഥ we find the rejection region in terms of
unstandardized test statistic
𝑥ҧ > 177.57
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Effects on β of Changing α
11.89
• Stage 2: Probability of a Type II error
•
𝛽 = 𝑃 𝑥ҧ < 177.57 𝜇 = 180
ҧ
𝑥−𝜇 177.57−180
=𝑃 <
𝜎/ 𝑛 65/ 400
= 𝑃 𝑧 < −.75 = .2266∎
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3a Effects on β of Changing α
11.90
• Decreasing the significance level α, increases the value of β and vice versa.
• Consider this diagram again. Shifting the critical value line to the right (to
decrease α) will mean a larger area under the lower curve for β… (and vice
versa)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3b Judging the Test (검정의 판단)
11.91
• A statistical test of hypothesis is effectively defined by the significance level
(α) and the sample size (n), both of which are selected by the statistics
practitioner.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3b Judging the Test
11.92
• For example, suppose we increased n from a sample size of 400
account balances to 1,000 in Example 11.1.
𝑥ҧ > 173.38
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3b Judging the Test
11.93
Stage 2: Probability of a Type II error
𝛽 = 𝑃 𝑥ҧ < 173.38 𝜇 = 180
ҧ
𝑥−𝜇 173.38−180
=𝑃 <
𝜎Τ 𝑛 65Τ1,000
= 𝑃 𝑧 < −3.22
= 0 (𝑎𝑝𝑝𝑟𝑜𝑥𝑖𝑚𝑎𝑡𝑒𝑙𝑦)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
Fig. 11.9 and 11.11 Compare β at n=400 and n=1,000
… 𝜎Τ 𝑛
11.94
n=400 175.35
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3c Developing an Understanding of
Statistical Concepts 11.95
• And hence, we make better decisions in the long run. This finding
lies at the heart of applied statistical analysis and reinforces the
book's first sentence, "Statistics is a way to get information from
data."
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3c Developing an Understanding of
Statistical Concepts 11.96
• Throughout this book we introduce a variety of applications in
finance, marketing, operations management, human resources
management, and economics.
• In all such applications the statistics practitioner must make a
decision, which involves converting data into information. The
more information, the better the decision.
• Without such information, decisions must be based on
guesswork, instinct, and luck. A famous statistician, W. Edwards
Deming* said it best: "Without data you're just another person
with an opinion."
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3d Power of a Test (검정력)
11.97
• Another way of expressing how well a test performs is to report its
power: the probability of its leading us to reject the null hypothesis
when it is false. Thus, the power of a test is 1 − 𝛽.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3e Operating Characteristic Curve
(검사특성곡선)
11.98
• To compute the probability of a Type II error, we must specify the
significance level(𝛼), the sample size (𝑛), and an alternative value
of the population mean.
• One way to keep track of all these components is to draw the
operating characteristic(OC)curve, which plots the value of 𝛽
versus the values of 𝜇.
• We used the Excel function NORMDIST to compute the
probability of a Type II error in Example 11.1 for 𝜇 =
170, 171, … , 184, with n=400.
• Figure 11.12 depicts this curve*.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3e Operating Characteristic Curve
(검사특성곡선)
11.99
• Excel Function Instruction
• With 𝜎 = 65 and 𝑛 = 400, the standard error of the mean is
𝜎 65
𝜎𝑥ҧ = = = 3.25
𝑛 400
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3e Operating Characteristic Curve
(검사특성곡선)
11.100
• Notice as the alternative value of 𝜇 increases the value of 𝛽
decreases.
• In other words, it becomes easier to distinguish between 𝜇 =170
and other values of 𝜇 when 𝜇 is farther from 170.
• When 𝜇=170, 𝛽 = 1 − 𝛼.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3e Operating Characteristic Curve
(검사특성곡선)
11.101
• The OC curve can also be useful in selecting a sample size.
• Figure 11.13 shows the OC curve for Example 11.1 with 𝑛 =
100, 400, 1,000, 𝑎𝑛𝑑 2,000.
• An examination of this chart sheds some light on the effect
increasing the sample size has on how well the test performs at
different values of 𝜇.
• Smaller sample sizes will work well to distinguish between 170
and values of larger than 180.
• However, to distinguish between 170 and smaller values of
requires larger sample sizes.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3e Operating Characteristic Curve
(검사특성곡선)
11.102
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3f Determining the Alternative Hypotheses to
Define Type I and Type II Errors 11.103
• We’ve already discussed how the alternative hypotheses is
determined?
• In Example 11.1, we wanted to know whether there was
sufficient statistical evidence to infer that the new billing system
would be cost-effective—that is, whether the mean monthly
account is greater than $170.
• In general, you will find that the question can be posed in two
ways.
• In Example 11.1, we asked whether there was evidence to
conclude that the new system would be cost-effective.
• Another way of investigating the issue is to determine whether
there is sufficient evidence to infer that the new system would
not be cost-effective.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3f Determining the Alternative Hypotheses to
Define Type I and Type II Errors 11.104
• In a criminal trial, the burden of proof falls on the prosecution to
prove that the defendant is guilty.
• In other countries with less emphasis on individual rights, the
defendant is required to prove his or her innocence.
• This is so because we consider the conviction of an innocent
defendant to be the greater error. (committing a Type I error is a
more serious one.)
• Thus the test is set up with the null and alternative hypotheses as
described in Section 11-1.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3f Determining the Alternative Hypotheses to
Define Type I and Type II Errors 11.105
• In a statistical test where we are responsible for both asking and
answering a question, we must ask the question so that we
directly control the error that is more costly.
• As you have already seen, we control the probability of a Type
I error by specifying its value(the significance level, 𝛼).
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3f Determining the Alternative Hypotheses to
Define Type I and Type II Errors 11.106
• In Example 11.1, there are two possible errors:
(1) conclude that the billing system is cost-effective when it isn’t;
(2) conclude that the system is not cost-effective when it is.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3f Determining the Alternative Hypotheses to
Define Type I and Type II Errors 11.107
• Suppose we believe that the cost of installing a system that is not
cost-effective is higher than the potential loss of not installing an
effective system.
• The error we wish to avoid is the erroneous conclusion that the
system is cost-effective. We define this as a Type I error. The
burden of proof is placed on the system to deliver sufficient
statistical evidence that the mean account is greater than
$170. The null and alternative hypotheses are as formulated
previously.
𝐻0 : 𝜇 = 170 vs 𝐻1 : 𝜇 > 170
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-3f Determining the Alternative Hypotheses to
Define Type I and Type II Errors 11.108
• If we believe that the potential loss of not installing the new system
when it would be cost-effective is larger cost, we would place the
burden of proof on the manager to infer that the mean monthly
account is less than $170.
• Consequently, the hypotheses would be
𝐻0 : 𝜇 = 170 vs 𝐻1 : 𝜇 < 170
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-4 The Road Ahead
11.109
• We had two principal goals to accomplish in chaps. 10 and 11.
• First, we wanted to present the concepts of estimation and
hypothesis testing.
• Second, we wanted to show how to produce confidence
interval estimates and conduct tests of hypotheses.
• The importance of both goals should not be underestimated.
• Sections 10-2 and 11-2 set the pattern for the ways in which
statistical techniques are applied.
• If you understand how to produce and use confidence interval
estimates and how to conduct and interpret hypothesis tests,
then you are well on your way to the ultimate goal of being
competent at analyzing, interpreting, and presenting data.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-4 The Road Ahead
11.110
• In the chapters that follow, we plan to present about three dozen
different statistical techniques that can be (and frequently are)
employed by statistics practitioners.
• To calculate the value of test statistics or confidence interval
estimates requires nothing more than the ability to add,
subtract, multiply, divide and compute square roots.
• If you intend to use the computer, all you need to know are
the commands.
• The key to applying statistics is knowing which formula to
calculate or which set of commands to issue.
• The real challenge of the subject lies in being able to define
the problem and identify which statistical method is the most
appropriate one to use. (ICI approach: Identify-Compute-
Interpret)
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-4 The Road Ahead
11.111
• A number of factors determine which statistical method
should be used, but two are especially important: the type of
data and the purpose of the statistical inference.
• Type of data:
• Nominal data: categories such as marital status,
occupation and gender; count the number of times each
category is observed.
• Ordinal data: obtained from questions whose answers
represent a rating or a ranking system
• Interval data: real numbers, such as those representing
income, age, height, weight, and volume; computations of
means and variances is permissible.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-4a Problem Objectives
11.112
• The second key factor in determining the statistical technique
is the purpose of doing the work. Every statistical method has
some specific objective. We address five such objectives in this
book.
1. Describe a population:
• Population’s incomes: mean and variance…
• The brand of computer: the proportion of the
population
2. Compare two populations
3. Compare two or more populations
4. Analyze the relationship between two variables
5. Analyze the relationship among two or more variables
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-4a Problem Objectives
11.113
Table 11.3 Guide to Statistical Inference Showing Where
Each Technique Is Introduced
Data Type
Problem Objective Nominal Ordinal Interval
Describe a population §12-3, 15-2 Not covered § 12-1, 12-2
Compare two populations §13-5, 15-2 §19-1,19-2 § 13-1, 13-3, 13-4,
19-1, 19-2
Compare two or more §15-2 §19-3 Chap.14, §19-3
populations
Analyze the relationship §15-2 §19-4 Chap.16
between two variables
Analyze the relationship Not covered Not covered Chaps.17,18
among two or more
variables
§: Section
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-4b Derivations
11.114
• Though this book is about statistical applications, it might be
helpful to have some understanding about the process that
produces the formulas.
• Factors such as the problem objective and the type of data
determine the parameter to be estimated and tested.
• For each parameter, statisticians have determined which statistic
to use. That statistic has a sampling distribution that can usually
be expressed as a formula.
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11-4b Derivations
11.115
• For example, if the parameter of interest is the population mean
𝜇, then its best estimator is the sample mean 𝑋. ത (Each parameter
has a “best” estimator.)
• Assuming that the population standard deviation 𝜎 is known,
the sampling distribution of 𝑋ത is normal (or approximately so) with
mean 𝜇 and standard deviation 𝜎Τ 𝑛. The sampling distribution
can be described by the formula,
ത
𝑋−𝜇
𝑍=
𝜎Τ 𝑛
• This
formula also describes the test statistic for 𝜇 with known 𝜎.
the sampling distribution is often the formula for the test statistic.
• With a little algebra the confidence interval estimator can be
derived from the sampling distribution.
𝜎
𝑥ҧ ± 𝑧𝛼/2
𝑛
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.
11.116
End of Chapter 11
© 2018 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in
a license distributed with a certain product or service or otherwise on a password-protected website for classroom use.