Estimation and CI
Estimation and CI
• Response Rates
• Source of data
• Sample size and sample quality
• “Random”
Estimation
• Point Estimation
• Interval estimation
– Sampling Error
– Sampling Distribution
– Confidence Intervals
Interval Estimation
Parameters Statistics
Random No Yes
variable?
Calculated No Yes
Sampling Error
Population mean
Central Limit Theorem
ෝ
𝒑
Population Proportion CI
• CI for population proportion:
pˆ (1 pˆ ) Margin
pˆ z of Error
n
• What are usual values of z? Standard Error
Confidence Error Z or
Level Probability
.9 .10 1.645 Standard Deviation
.95 .05 1.96 of the Sampling Distribution
.99 .01 2.58
An ideal point estimator will have no bias and low variability. Since variability is
almost always present when calculating statistics from different samples, we must
extend our thinking about estimating parameters to include an acknowledgement
that repeated sampling could yield different results.
The Idea of a Confidence Interval
The big idea : The sampling distribution of x tells us how close to m the
sample mean x is likely to be. All confidence intervals we construct will
have a form similar to this :
estimate ± margin of error
The confidence level does not tell us the chance that a particular
confidence interval captures the population parameter.
240.79 ± 2 × 5 = x ± 2s x
This leads to a more general formula for confidence intervals:
statistic ± (critical value) • (standard deviation of statistic)
Constructing Confidence Intervals
Calculating a Confidence Interval
The confidence interval for estimating a population parameter has
the form
statistic ± (critical value) • (standard deviation of statistic)
where the statistic we use is the point estimator for the parameter.
Our method of calculation assumes that the data come from an SRS
of size n from the population of interest.
95%
99%
-2.58sem u +2.58sem
mu
Effects of Sample Size
Process for Constructing Confidence
Intervals
• Compute the sample statistic (e.g. a mean)
• Compute the standard error of the mean
• Make a decision about level of confidence that is
desired (usually 95% or 99%)
• Find tabled value for 95% or 99% confidence
interval
• Multiply standard error of the mean by the tabled
value
• Form interval by adding and subtracting
calculated value to and from the mean
Types
• One-Sided Confidence Intervals vs. Two-Sided
Confidence Intervals
• The concept of one-sided and two-sided
confidence intervals is fairly straightforward.
• A two-sided confidence interval brackets the
population parameter of interest from above and
below.
• A one-sided confidence interval brackets the
population parameter of interest from either
above or below, which establishes an upper or
lower window in which the parameter exists.
• How to Calculate a Confidence Interval
• Let’s imagine a group of researchers that are interested in determining
whether or not the oranges grown on a particular farm are large enough
to be sold to a prospective grocery chain.
• Step #1: Find the number of samples (n).
• The researchers randomly select 46 oranges from trees on the
farm. Therefore, n = 46.
• Step #2: Calculate the mean (x) of the the samples.
• The researchers then calculate of a mean weight of 86 grams from their
sample. Therefore, x = 86.
• Step #3: Calculate the standard deviation (s).
• It’s best to use the standard deviation of the entire population, however,
in many cases researchers will not have access to this information. If this
is the case, the researchers should use the standard deviation of the
sample that they have established.
• For our example, let’s say that the researchers have resorted to
calculating the standard deviation from their sample. They receive a
standard deviation of 6.2 grams. Therefore, s = 6.2.
• Step #4: Decide the confidence interval that will be used.
• 95 percent and 99 percent confidence intervals are the most
common choices in typical market research studies.
• In our example, let’s say the researchers have elected to use a
confidence interval of 95 percent.
• Step #5: Find the Z value for the selected confidence interval.
• The researchers would then utilize the following table to
determine their Z value:
• Confidence Interval Z
• 80% 1.282
• 85% 1.440
• 90% 1.645
• 95% 1.960
• 99% 2.576
• 99.5% 2.807
• 99.9% 3.291
• Since they have decided to use a 95 percent confidence interval,
the researchers determine that Z = 1.960.
• Step #6: Calculate the following formula.
• Next, the researchers would need to plug their known values into
the formula.
• Continuing with our example, this formula would appear as follows:
• 86 ± 1.960 (6.2/6.782)
• When calculated, this formula gives the researchers the result of 86
± 1.79 as their confidence interval.
• Step #7: Draw a conclusion.
• The researchers have now determined that the true mean of the
greater population of oranges is likely (with 95 percent confidence)
between 84.21 grams and 87.79 grams.
Example
DCOVA
DCOVA
Confidence
Intervals
Population Population
Mean Proportion
σ Known σ Unknown
Confidence Interval for μ
(σ Unknown) DCOVA
d.f. = n - 1
Degrees of Freedom (df)
DCOVA
DCOVA
0.1651 0.3349
Interpretation
DCOVA
Solution: DCOVA
73 19 16 64 28 28 31 90 60 56 31 56 22 18 45 48 17 17 17 91 92 63 50 51 69 16 17
73 19 16 64 28 28 31 90 60 56 31 56 22 18 45 48 17 17 17 91 92 63 50 51 69 16 17
The initial bootstrap resample omits some values (18, 45, 50, 63, and 91) that appear in
the initial sample above. Note that the value of 73 appears twice even though it appears
only once in the initial sample.
16 16 16 17 17 17 17 17 19 22 28 31 31 51 56 56 60 60 64 64 64 64 69 73 73 90 92
The Ordered Array of Sample Means for 100 Resamples
Fifth Smallest
DCOVA