Sampling Distribution
Sampling Distribution
POINT ESTIMATE
&
SAMPLING
DISTRIBUTION
Dr. Navneet Bhatt
NUMERICAL SUMMARIES
POINT ESTIMATION
POINT ESTIMATION
SAMPLING DISTRIBUTION
SAMPLING DISTRIBUTION
SAMPLING DISTRIBUTION
POINT ESTIMATOR
SOME DEFINITIONS
12
SOME DEFINITIONS
13
SOME DEFINITIONS
and variance
14
16
EXAMPLE
19
EXAMPLE
20
QUESTION
The amount of time that a customer spends waiting at an airport check-in counter
is a random variable with mean 8.2 minutes and standard deviation 1.5
minutes. Suppose that a random sample of n = 49 customers is observed. Find
the probability that the average time waiting in line for these customers is
(a) Less than 10 minutes
(b) Between 5 and 10 minutes
(c) Less than 6 minutes
21
EXAMPLE
SAMPLING DISTRIBUTION OF THE DIFFERENCE BETWEEN TWO
MEANS
The effective life of a component used in jet-turbine aircraft engine is a random
variable with mean 5000 and SD 40 hours and is close to a normal distribution.
The engine manufacturer introduces an improvement into the Manufacturing
process for this component that changes the parameters to 5050 and 30.
Random samples of size 16 and 25 are selected. What is the probability that
the difference in the two sample means 𝑋 − 𝑋 is at least 25 hours?
23
24
EXAMPLE
The television picture tubes of manufacturer A have a mean lifetime of 6.5 years and a
standard deviation of 0.9 year, while those of manufacturer B have a mean lifetime of 6.0
years and a standard deviation of 0.8 year. What is the probability that a random sample of
36 tubes from manufacturer A will have a mean lifetime that is at least 1 year more than
the mean lifetime of a sample of 49 tubes from manufacturer B?
25
CONFIDENCE
INTERVAL
27
CONFIDENCE INTERVAL
28
CONFIDENCE INTERVAL
The 95% Confidence Interval (we will show how to calculate it later)
is:
175 cm 6.2 cm
This says the true mean of ALL men (if we could measure their
heights) is likely to be between 168.8cm and 181.2cm. But it might
not be!
29
CONFIDENCE INTERVAL
• The "95%" says that 95% of experiments like we just did will include
the true mean, but 5% won't.
• So there is a 1-in-20 chance (5%) that our Confidence Interval
does NOT include the true mean.
175 cm 6.2 cm
30
31
Step 2: Decide what Confidence level you want. 90%, 95% and 99%
are common choices. Then find the “𝑧" value for that Confidence
Interval here:
Confidence level Z
80% 1.282
85% 1.440
90% 1.645
95% 1.960
99% 2.576
For 95% the 𝑧 value is 1.960
99.5% 2.807
99.9% 3.291
32
33
So, we have
34
35
36
where the end-points 𝑙 and 𝑢 are computed from the sample data.
• There is a probability of 1 − 𝛼 of selecting a sample for which the CI
will contain the true value of 𝜇.
• The endpoints or bounds 𝑙 and 𝑢 are called lower- and upper-
confidence limits, and 1 − 𝛼 is called the confidence coefficient.
37
38
39
Interpretation: Based on the sample data, a range of highly plausible values for mean impact energy for A238
steel at 60°C is
63.84 J ≤ 𝜇 ≤ 65.08 J
40
41
The same data for impact testing from Example 1 are used to
construct a lower, one-sided 95% confidence interval for the mean
impact energy.
Answer: 𝑧𝛼 = 1.64, 𝑛 = 10, 𝜎 = 1, and 𝑥̅ = 64.46.
A 100(1 − α)% lower-confidence bound for 𝜇 is
43
EXAMPLE
THE DISTRIBUTION
k is the number of
degrees of freedom.
If the sample size is large enough, say n ≥ 30, the distribution of T does not differ considerably from the
standard normal. However, for n < 30, it is useful to deal with the exact distribution of T.
46
THE DISTRIBUTION
47
THE DISTRIBUTION
48
Percentage points of the t distribution.
Upper-tail probability p
It is customary to let tα represent the t-
value above which we find an area equal
to α.
Upper-tail probability p
t = −t
0.95 0.05
t = −t
0.99
49
0.01
EXAMPLE
EXAMPLE
Find k such that P(k < T < −1.761) = 0.045 for a random sample of size 15 selected from a normal
̅
distribution and ⁄
.
t-0.05
From Table, we note that 1.761 corresponds to t0.05 when v = 14. Therefore, −t0.05 = −1.761. Since k in the
original probability statement is to the left of −t0.05 = −1.761, let k = −tα . Then, from Figure, we have
0.045 = 0.05 − α, or α = 0.005.
Hence, from Table with v = 14,
k = −t0.005 = −2.977 and P(−2.977 < T < −1.761) = 0.045.
51
52
Interpretation: The CI is fairly wide because there is a lot of variability in the measurements.
A larger sample size would have led to a shorter interval. 53
EXAMPLE
Acme Corporation manufactures light bulbs. The CEO claims that an average Acme light bulb
lasts 300 days. A researcher randomly selects 15 bulbs for testing. The sampled bulbs last an
average of 290 days, with a standard deviation of 50 days. If the CEO's claim were true, what is
the probability that 15 randomly selected bulbs would have an average life of no more than
290 days?
54