0% found this document useful (0 votes)

161 views

Sampling Distribution and Estimation

The document discusses sampling distribution and estimation. It defines key terms like population, sample, parameters and statistics. It describes how to construct a sampling distribution by taking samples from a population and calculating statistics. It provides examples to show that the mean of sample means equals the population mean, and that the standard deviation of a sampling distribution is less than the population standard deviation.

Uploaded by

Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

161 views

Sampling Distribution and Estimation

Uploaded by

Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

Sampling Distribution and Estimation

Prepared by:
B. S. Parajuli
Population Sample
Definition Collection of items under study Part or portion of population chosen
for study
Characteristic Parameter Statistic

Symbols Population size = N Sample size = n

Population mean = 𝜇 Sample mean = x̅
Population SD = 𝜎 Sample SD = s
Population variance = 𝜎 2 Sample variance = s2
Population correlation coefficient = 𝜌 (𝑟ℎ𝑜) Sample correlation coefficient = r
Regression coefficient = 𝛽 Regression coefficient = b
Population proportion = P Sample proportion = p
Population proportion(P)
𝑛𝑜. 𝑜𝑓 𝑖𝑡𝑒𝑚𝑠 𝑖𝑛 𝑡ℎ𝑒 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 Sample proportion(p)
ℎ𝑎𝑣𝑖𝑛𝑔 𝑠𝑜𝑚𝑒 𝑐ℎ𝑎𝑟𝑎𝑐𝑡𝑒𝑟𝑖𝑠𝑡𝑖𝑐 𝑎
= =𝑁 𝑛𝑜. 𝑜𝑓 𝑖𝑡𝑒𝑚𝑠 𝑖𝑛 𝑡ℎ𝑒 sample
P𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑠𝑖𝑧𝑒 𝑥
ℎ𝑎𝑣𝑖𝑛𝑔 𝑠𝑜𝑚𝑒 𝑐ℎ𝑎𝑟𝑎𝑐𝑡𝑒𝑟𝑖𝑠𝑡𝑖𝑐
= sample 𝑠𝑖𝑧𝑒
=𝑛
Some important Theorems on Simple random Sampling
 The sample mean is unbiased estimate of population mean
ത = 𝑌ത = µ
E(𝑦)
 In simple random sampling without replacement(SRSWOR) , the
variance of the sample mean is given by
𝜎 2 𝑁−𝑛
V(𝑦)
ത = . where N is population size(finite population)
𝑛 𝑁−1
 In simple random sampling with replacement(SRSWR), the variance
of the sample mean is given by
𝜎2
V(𝑦)
ത =
𝑛
Sampling Distribution

A population parameter is always a constant, whereas a sample

statistic is a random variable. Because every random variable must
possesses a probability distribution. The probability distribution of a
sample statistic is called sampling distribution.
 Definition: “The distribution of all possible values that can be
assumed by some statistic, computed from samples of the same
size randomly drawn from the same population, is called the
sampling distribution of that statistic” (Daniel, 2010).
Construction of Sampling Distribution
 It may be constructed empirically when sampling from a discrete
finite population.
 To construct a sampling distribution we proceed as follows:
i. From a finite population of size N, randomly draw all possible
samples of size ‘n’.
ii. Compute the statistic of interest for each sample.
iii. List in one column the different distinct observed values of the
statistic and in another column list the corresponding
frequency of occurrence of each distinct observed value of the
statistic.
Example 1
 A population consists of 5 numbers 2,3,6,8 and 11. Consider all
sample of size 2 that can be drawn without replacement from
this population.
(a) Find the mean and variance of population
(b) Show that mean of sample mean is equal to population mean.
(c) Find the variance of sampling distribution of means and also
𝜎 2 𝑁−𝑛
verify with formula V(𝑦)
ത = .
𝑛 𝑁−1
(d) Also show that standard deviation of sampling distribution of
means is less than population standard deviation.
Solution:
Population size (N) =5
Population: 2,3,6,8 and 11
Sample Size(n) =2
(a) Calculation of population mean and population variance

Population Value(Y) ഥ =Y - 6
Y-𝒀 ഥ)𝟐
(Y − 𝒀
2 -4 16
3 -3 9
6 0 0
8 2 4
11 5 25
∑Y = 30 ഥ) = 0
∑(Y - 𝒀 ഥ )𝟐 = 54
∑(Y − 𝒀

ഥ ∑Y 30
Population mean (µ) = 𝒀 = = =6
𝑁 5
∑( Y − ഥ )𝟐
𝒀 54
Population Variance (𝜎 2 ) = = = 10.8
𝑁 5
(b) Possible number of samples of size 2 which can be drawn from the population
without replacement = NCn = 5C2 = 10 Possible Samples are : (2,3) (2,6) (2,8)
(2,11), (3,6) (3,8), (3,11), (6,8), (6,11) , (8,11)
Calculation of mean and variance of sampling distribution of sample means
Sample number Sample values Sample (ഥ
𝒚-𝒚
ന)=𝒚
ഥ −𝟔 (ഥ ന)𝟐
𝒚−𝒚
(y) mean(ഥ
𝒚)
1 (2,3) 2.5 -3.5 12.25
2 (2,6) 4 -2 4
3 (2,8) 5 -1 1
4 (2,11) 6.5 0.5 0.25
5 (3,6) 4.5 -1.5 2.25
6 (3,8) 5.5 -0.5 0.25
7 (3,11) 7 1 1
8 (6,8) 7 1 1
9 (6,11) 8.5 2.5 6.25
10 (8,11) 9.5 3.5 12.25
NC
n =10 ∑𝒚
ഥ = 60 ∑(ഥ
𝒚-𝒚
ന)=0 ∑(ഥ ന ) 𝟐 = 40.5
𝒚−𝒚
∑ 𝒚ഥ60
Now, mean of sample means, 𝒚
ന = NC =10 =6
n
Population mean µ =6
Therefore, mean of sample means is equal to population mean i.e. 𝒚
ന=
µ =6
E(𝑦)
ത = µ i.e. sample mean is unbiased estimate of the population
mean.
2 ∑(𝒚ഥ − 𝒚ന ) 𝟐 40.5
(c) Variance of sample mean V(𝑦)
ത =𝜎𝑦ത = N = = 4.05
Cn 10
𝜎 2 𝑁−𝑛 10.8 5−2
Variance with formula, V(𝑦)
ത = . = . = 4.05
𝑛 𝑁−1 2 5−1
Hence verified.
(d) standard deviation of sampling distribution of sample mean is
𝜎𝑦ത = standard error of mean = S.E. (𝑦)
ത = 𝑉𝑎𝑟(𝑦)
ത = 4.05 = 2.01
Population standard deviation (𝜎) = 10.8 = 3.29
Here 𝜎𝑦ത < 𝜎 , hence standard deviation of sampling distribution of
sample mean is smaller than the population standard deviation.
Example 2
 A population consists of 4 numbers 1,2,5 and 8. Consider all sample
of size 2 that can be drawn with replacement from this population.
(a) Find the mean and variance of population
(b) Show that mean of sample mean is equal to population mean.
(c) Find the variance of sampling distribution of means and also verify
𝜎2
with formula V(𝑦)ത =
𝑛
(d) Also show that standard deviation of sampling distribution of
means is less than population standard deviation.
Solution:
Population size (N) =4
Population: 1, 2, 5 & 8
Sample Size(n) =2
(a) Calculation of population mean and population variance

Population Value(Y) ഥ =Y - 4
Y-𝒀 ഥ)𝟐
(Y − 𝒀
1
2
5
8
∑Y = ഥ) = 0
∑(Y - 𝒀 ഥ )𝟐 = 30
∑(Y − 𝒀

ഥ ∑Y
Population mean (µ) = 𝒀 = = …..
𝑁
∑(Y − 𝒀ഥ)𝟐
Population Variance (𝜎 2 ) = =……
𝑁
(b) Possible number of samples of size 2 which can be drawn from
the population with replacement = 𝑁 𝑛 = 42 =16
Possible Samples are :
(1,1), (1,2), (1,5), (1,8), (2,1), (2,2), (2,5), (2,8),
(5,1), (5,2), (5,5), (5,8), (8,1), (8,2), (8,5), (8,8)
Calculation of mean and variance of sampling distribution of sample
means:

Sample number Sample values Sample (ഥ

𝒚-𝒚
ന)=𝒚
ഥ −𝟔 (ഥ ന)𝟐
𝒚−𝒚
(y) mean(ഥ
𝒚)
1
…
16
∑𝒚
ഥ= ∑(ഥ
𝒚-𝒚
ന)=0 ∑(ഥ ന ) 𝟐 = 60
𝒚−𝒚

Same as above
Conclusion
 Population data are uniformly distributed and sample means are
symmetrically distributed.
 Population mean and sample mean are equal.
 Sample mean is unbiased estimator of population mean.
 Sample variance is not equal to population variance.
 SThe spread of the sample means in the distribution is small than
the spread in the population values.
 The shape of the sampling distribution of the sample means tends
to e bell-shaped and approximates the normal distribution, even
when the population is not normally distributed, provided that
the sample size is reasonably large.
The Central Limit Theorem
 The central limit theorem states that, “When the size of the sample increases and
ഥ) will be
becomes sufficiently large, the sampling distribution of the mean(X
𝝈𝟐
approximately normally distributed with mean μ and variance .”
𝒏
 If Xi (i = 1, 2, …,n) be independent random variables, such that E(Xi) = μi
& Var.(Xi) = σi2 then it can be proved that under very certain conditions,
the random variables (Sn), Sn = X1 + X2 + …+ Xn. is asymptotically
normal with mean 𝜇 = σ𝑛𝑖=1 𝜇𝑖 and S. D.(𝜎) = σ𝑛𝑖=1 𝜎𝑖 .
 A mathematical formulation of the central limit theorem is that the
𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 −𝑃𝑎𝑟𝑎𝑚𝑒𝑡𝑒𝑟 ഥ −μ
X
distribution of = σ , approaches a normal
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑒𝑟𝑟𝑜𝑟
n
distribution with mean 0(zero) and variance 1 (one) as 𝑛 → ∞.
 Note: Note that the central limit theorem allows us to sample from non-
normally distributed populations with a guarantee of approximately the
same results as would be obtained if the populations were normally
distributed provided that we take a large sample.
Standard Error
 The standard deviation of sampling distribution of a sample statistic is known
as standard error and it is abbreviated as S. E.
 It is a statistical term that measures the accuracy with which a sample
represents a population.
 It means standard error measures chance deviation and not an error or
mistake.

Use of standard error:

 To work out the limits within which the population means would lie.
 To determine whether the sample is drawn from a known population or not,
when mean is known.
 To determine the standard error of difference between the means of two
samples whether it is real and statistically insignificant or it is apparent and
insignificant due to chance.
 To calculate the size of sample.
Statistic Standard Error (S. E.) Conditions

Mean 𝝈 For infinite population (If sample is drawn with replacement)

ഥ =
S. E. 𝑿
𝒏

Mean 𝝈 𝑵− 𝒏 For finite population

ഥ) =
𝑺. 𝑬. (𝑿 .
𝒏 𝑵−𝟏

Mean 𝒔
ഥ =
S. E. 𝑿 When 𝞼 is unknown and population size infinite
𝒏

Mean 𝒔 𝑵− 𝒏 For finite population (sample is drawn without replacement)

ഥ) =
𝑺. 𝑬. (𝑿 .
𝒏 𝑵−𝟏

Proportion 𝑷.𝑸 For infinite population where P + Q = 1

𝐒. 𝐄. 𝒑 =
𝒏

Proportion 𝒑.𝒒 For infinite population when P and Q are unknown where p + q = 1
𝐒. 𝐄. 𝒑 =
𝒏

Proportion 𝑷.𝑸 𝑵− 𝒏 For finite population (N is given)

𝑺. 𝑬. (𝒑) = .
𝒏 𝑵 −𝟏

Proportion 𝒑.𝒒 𝑵− 𝒏 When P and Q are unknown but N is given

𝑺. 𝑬. (𝒑) = .
𝑵 𝒏 −𝟏
Statistic Standard Error (S. E.) Conditions

Difference of means 𝝈𝟏 𝟐 𝝈𝟐 𝟐 If 𝝈𝟏 & 𝝈𝟐 are known

𝑺. 𝑬. (𝑿𝟏 − 𝑿𝟐 ) = + )
𝒏𝟏 𝒏𝟐
𝟏 𝟏 If two samples have drawn from same population
𝑺. 𝑬. (𝑿𝟏 − 𝑿𝟐 ) = 𝝈𝟐( + ) (𝝈𝟏 = 𝝈𝟐 = 𝛔)
𝒏𝟏 𝒏𝟐

Difference of means 𝟏 𝟏 If 𝝈𝟏 & 𝝈𝟐 are not known then combined variance

𝑺. 𝑬. (𝑿𝟏 − 𝑿𝟐 ) = 𝑺𝟐 ( + )
𝒏𝟏 𝒏𝟐 𝒏𝟏𝒔𝟏𝟐+𝒏𝟐 𝒔𝟐 𝟐 σ(𝑿𝟏 − 𝑿𝟏 )𝟐+σ(𝑿𝟐 − 𝑿𝟐 )𝟐
𝟐
𝑺 = =
𝒏𝟏+𝒏𝟐 𝒏𝟏+𝒏𝟐 −𝟐

Difference of 𝑷𝟏 𝑸𝟏 𝑷𝟐 𝑸𝟐 If two population proportions are known

proportions 𝑺. 𝑬. (𝒑𝟏 − 𝒑𝟐 ) = +
𝒏𝟏 𝒏𝟐

Difference of 𝟏 𝟏 If two population proportions are equal

𝑺. 𝑬. . (𝒑𝟏 − 𝒑𝟐 ) = ෡𝑸
𝑷 ෡( + )
proportions 𝒏𝟏 𝒏𝟐 𝑷𝟏 = 𝑷𝟐 = P

Combined proportion 𝑷 ෡ = 𝒏𝟏𝒑𝟏+𝒏𝟐𝒑𝟐 =

𝑿𝟏+𝑿𝟐
𝒏𝟏+𝒏𝟐 𝒏𝟏+𝒏𝟐
Marketing Manager in an organization needs to estimate
the likely market share his company can achieve in the
market place.

Quality Assurance Manager may be interested in

estimating the proportion defective of the finished
product before shipment to the customer.

Manager of the credit department needs to estimate the

average collection period for collecting dues from the
customers.
Estimation
 Estimation: The statistical technique of estimating unknown population parameters
from corresponding sample statistic is known as estimation.
 Estimator: A function (or algebraic expression) which uses sample information to
estimate a population parameter is known as estimator.
Let sample mean (x̅) is used as an estimate of the population mean (𝜇). Here population
mean (𝜇) is the parameter to be used estimated, x̅ = ΣX/n. is an estimator which is a
function (or formula) of sample values and the numerical values of x̅ for a particular
sample is an estimate of the population parameter (𝜇).
 Estimates: A specific numerical value of estimator is called estimate.
 Point Estimate: A point estimate is a single numerical value used to estimate the
corresponding population parameter. For example: The number of subscribers of the
Namaste mobile in the next year will be 85, 00,000 estimated by general manager of
Nepal Telecom.
 Interval Estimate: An interval estimate consists of two numerical values defining a
range of values that, with a specified degree of confidence, most likely includes the
parameter being estimated. For example: The number of subscribers of the Namaste
mobile in the next year will be 80,00,000 to 90,00,000 estimated by general manager
of Nepal Telecom.
Criteria of good estimator
 Unbiasedness: The estimator is said to be unbiased if expected value of
sample statistic is equal to population parameter. An estimator, say T of
the parameter ‘𝜃’ is said to be an unbiased estimator of ‘𝜃’ if E(T) =
𝜃. For example: If the expected sample mean is equal to the
population mean i. e. E(X ഥ) = 𝜇, then sample mean is said to be
unbiased estimator of population mean. Similarly E(p) = P. hence
sample mean and sample proportion are unbiased estimator.
 Note: (a) if E(Xഥ) − 𝜇 ≠ 0 then it is called biased.
 ഥ) − 𝜇 > 0 then it is called positively biased.
(b) if E(X
 ഥ) − 𝜇 < 0 then it is called negatively biased.
(c) if E(X
 Consistency: A statistic is considered to be consistent estimator of the
population parameter if as the sample size increases; the sample value
is more close to the population parameter. Thus a consistent estimator
is more reliable with large sample. An estimator T calculated from a
sample variate is said to be consistent estimator of a parameter 𝜃 if,
𝑇 → 𝜃 𝑎𝑠 𝑛 → ∞. Example: A sample mean come close to the
population mean as the sample size increases.
Contd…
 Efficiency: Efficiency refers to the size of the standard error of the
sample statistic. The estimator with the lesser variance is considered as
the most efficient estimator. Let t1 and t2 be two consistent estimators
of parameter ‘𝜃’ such that Var. (t1) < Var. (t2) for all ‘n’ then t1 is said
Var.(t1 )
to be more efficient than t2 i. e. E = . Example: The standard
Var (t2 )
deviation is the least compared standard deviation of median and
mode. So mean is efficient estimator of the population mean.
 Sufficiency: An estimator is said to be sufficient estimator if it uses all
the information about the population parameter contained in the
sample. For example: Sample mean (X ഥ) is sufficient estimator of the
population mean (𝜇) because it uses all the information given in the
sample but median uses the information of two extreme classes and
mode uses information of three classes. Therefore mean is more
sufficient estimator than median and mode.
 Confidence interval: The interval within which unknown
value of population parameter is expected to lie is known as
confidence interval. The limits within which parameter value are
estimated is called confidence limits/Fudicial limits. The lower
limit 𝑋𝐿 and upper limit 𝑋𝑈 of the interval are called confidence
limits.

α/2 (1 – α)
α/2
Acceptance region

−∞ 𝑋𝑈 +∞
𝑋𝐿 𝜃
 Confidence level : The probability that we associate with an interval estimate is
called the confidence level. This probability then indicates how confident we are that
the interval estimate will include the population parameter. It is denoted by
1 − 𝛼 . Example: 99% confidence level indicated that there is 95% probability of
estimated random value will lie within the confidence limits and there is 5% risk to
lies the estimator value on the outside of the confidence limits.

 Level of significance: The maximum size of probability assigned to tolerate in

decision making based on sample evidence is called level of significance. It is denoted
by ‘𝛼’ (alpha).

Confidence level 50% 90% 95% 96% 97% 98% 99% 99.73%
(1 – α)
Level of 50% 10% 5% 4% 3% 2% 1%
Significance/Risk(𝛼)
Value of Z 0.6745 1.645 1.96 2.05 2.17 2.33 2.58 3
(Two Tailed)
Confidence Interval for Large Samples ( n > 30)
Confidence Interval estimate of population mean from large sample:
ഥ ± 𝒁𝜶 . 𝐒. 𝐄. 𝑿
𝐂. 𝐈. 𝝁 = 𝑿 ഥ
𝝈
ഥ ± 𝒁𝜶 .
=𝑿 [When 𝝈 is unknown we use 𝝈
ෝ = s for large samples ]
𝒏

𝝈 𝑵− 𝒏
ഥ ± 𝒁𝜶 .
=𝑿 . [In case of simple random sampling without replacement from a finite
𝒏 𝑵−𝟏
population of size N]

Confidence Interval estimate of population Proportion from large sample:

𝐂. 𝐈. 𝑷 = 𝐩 ± 𝒁𝜶 . 𝐒. 𝐄. 𝒑
𝑷.𝑸
= 𝐩 ± 𝒁𝜶 . ෡ = 𝐩 for large samples ]
[ When P is unknown 𝑷
𝒏

𝑷.𝑸 𝑵− 𝒏
= 𝐩 ± 𝒁𝜶 . . [In case of simple random sampling without replacement from a finite
𝒏 𝑵 −𝟏
population of size N]
𝒑.𝒒 𝑵− 𝒏
= 𝒑 ± 𝒁𝜶 . . [ It is used when P and Q are unknown but N is finite ]
𝑵 𝒏 −𝟏
Example: A sample of 400 students taking Entrance Exam for B.sc CSIT
revealed an average score of 56 and sample standard deviation of 10.
Construct a 98% as well as 99% confidence interval for the population
mean.
Solution:
With usual notation, n =400, μ X ഥ = 56, 𝑠 = 10
For 98% confidence level: 1 – α = 98% , α = 2% 𝑍𝛼 = 2.33
Hence 98% confidence limits for population mean is given by
ഥ ± Z α . S. E. X
X ഥ
𝒔
ഥ
=X±Z α .
𝒏
𝟏𝟎
= 56 ± 2.33 ×
𝟒𝟎𝟎
= 56 ± 1.165
= (56 – 1.165, 56+ 1.165) = ( 54.83, 57.165)
Example From a population of 540, a sample of 60 individuals is taken.
From this sample the mean is found to be 6.2 and the standard deviation
1.368.
(i) Find the standard error of the mean.
(ii) Construct a 96% confidence interval for the mean.
Solution:
With usual notations, N = 540, n = 60, 𝑋ത = 6.2, 𝑠 = 1.368
𝒔 𝑵− 𝒏
ത =
(i) Standard error of mean S.E. (𝑋) . [ for large sample 𝝈
ෝ = s]
𝒏 𝑵−𝟏

𝟏.𝟑𝟔𝟖 𝟓𝟒𝟎−𝟔𝟎
= .
𝟔𝟎 𝟓𝟒𝟎−𝟏
= 0.17
(ii) For 96% confidence level, 1 – α = 96% , α = 4%
𝑍𝛼 = 2.05
Hence 96% confidence interval for the mean is given by
= X ഥ ± Z α . S. E. X
ഥ
= 6.2 ± 2.05 × 0.17
= 6.2 ± 0.3485
= (5.85 , 6.55)
Example: In laboratory experiment, for the test of a material in good
condition, a sample of 400 units was drawn. When they were tested, 80
were good. Find 95% confidence limits for the percentage of good.
Solution:
Sample size(n) = 400 number of good material (x) = 80
𝑥 80
Sample proportion(p) = = = 0.2, q = 1 – p = 1 – 0.2 = 0.8
𝑛 400
For 95% confidence limits, (1 – α) = 95% , α = 5%, 𝑍𝛼 = 1.96
95% Confidence interval for population proportion (P) is given by
=p±Z α . S. E. p
𝒑.𝒒
= p ± 1.96 ×
𝒏

𝟎.𝟐×𝟎.𝟖
= 0.2 ± 1.96 ×
𝟒𝟎𝟎
= 0.2 ± 0.039 = (0.161 , 0.239)
Example: A factory is producing 5000 CD daily from a sample of 500
CD, 2% were found to be substandard quality. Estimate the percentage
of CD that can be reasonable expected to spoiled in the daily production
at 95% confidence interval.
Solution:
Confidence Interval for Small Samples ( n≤ 𝟑𝟎)

 Confidence intervals using ‘t’: The general procedure for constructing

confidence intervals is as:
 estimator ± (reliability constant) x (standard error of the estimate)
 Reliability coefficient is obtained from the table of t-distribution rather
than from the table of standard normal distribution. When sampling is
from a normal distribution whose standard deviation, σ, is unknown,
then Confidence interval is given by:
ഥ 𝑠
 C. I. (μ) = 𝑋 ± 𝑡𝛼,𝑛−1 . [ For unbiased estimator/when
𝑛
actual data is given]
 C. I. (μ) = 𝑋 ഥ ± 𝑡𝛼,𝑛−1 . 𝑠 [ For biased estimator/when
𝑛−1
actual data is not given]
 Degrees of freedom: The number of independent observations in a sample
is called degrees of freedom. It is defined as the difference between the
total number of items and the total number of constraints. ‘t’-
distribution follows (n – 1) degrees of freedom.
Computation of S2 for Numerical Problems:

1
 𝑆 2 = (𝑛−1) σ(𝑥 − 𝑥̅ )2
2 1 2 σ𝑥 2
 S = σ𝑥 −
(n− 1) 𝑛
1 σ𝑑 2 σ𝑑
 𝑆2 = σ 𝑑2 − where d = x – A , 𝑥̅ = 𝐴 +
(𝑛−1) 𝑛 𝑛

Level of significance for one tailed test

0.10 0.05 0.025 0.01 0.005 0.0005
Level of significance for two tailed test
d.f. 0.20 0.10 0.05 0.02 0.01 0.001
1 3.078 6.314 12.706 31.821 63.657 636.619

24 2.064
Example: A random sample of size 25 showed a mean of 172.50 cm with a
standard deviation of 15.40 cm. Determine 95% confidence interval for the
mean of the population.
Solution:
95% confidence interval for population mean:
n= 25, 𝑋ത = 172.50 , s = 15.40
1- 𝛼 = 0.95 , 𝛼 = 0.05 d.f. = n-1 = 25-1 = 24
𝑡𝛼,𝑛−1 = 𝑡0.05,24 = 2.064
95% CI for Population mean( 𝜇) is
𝑠
= 𝑋ത ± 𝑡𝛼,𝑛−1 .
𝑛−1
15.40
= 172.50 ± 2.064 .
25−1
= 172.50 ± 6.48
= (172.50 – 6.48 , 172.50 + 6.48)
= (166.01 , 178.98)
 A machine produces metal rods used in an automobile suspension
system. A random sample of 6 rods is selected and diameter is
measured. To measuring data (in mm) are as follows. Assuming
that the samples drawn from the normally distributed population
8.24 8.26 8.20 8.28 8.21 8.23

Find 95% two sided confidence interval on the mean rod diameter
and interpret the result with reference to the given problem.
Solution :
Calculate sample s.d.(s) from the given data and use the formula
𝑠
𝑋ത ± 𝑡𝛼,𝑛−1 . here d.f. = n-1 = 6-1 = 5
𝑛
X X2
8.24 67.8976
8.26 68.2276
8.20 67.24
8.28 68.5584
8.21 67.4041
8.23 67.7329

∑X2= 407.0606
∑X = 49.42
Determination of Sample Size

➢ Sample size describes about the number of samples that is taken from
the population for the study.
➢ It is said that if sample size is higher than chances error will be lower
and vice-versa.
➢ Samples will totally represent the population when sample size equals
the population.
➢ Any number of samples can be taken but it should always properly
represent the population.
Sample Size for Estimating a Population Mean
𝒁𝜶 .𝝈 𝟐
Sample size (𝐧) = , for infinite population
𝒆
Where, n = sample size
𝜎 = population standard deviation
E/e/d = permissible error /allowable error which is the
difference between the sample mean and population mean
𝑍𝛼 = significant value or critical value of Z corresponding
to 𝛼 level of significance
(𝑍𝛼 .𝜎)2 𝑛
Sample size (n) = (𝑍𝛼 .𝜎)2
= 𝑛 , for finite population
𝐸2 + 𝑁 1 +𝑁

Note: Note: In the case of sample size determination if value of confidence level is
not given take 95% and for almost certainty 𝑍𝛼 = 3.
Example: A manufacturing concern wants to estimate the average
amount of purchase of its product in a month by the customers. If the
standard deviation is Rs. 10. Find the sample size if the maximum error is
not exceed to Rs. 3 with probability of 0.99.
Solution:
Standard deviation 𝜎 = Rs. 10, Permissible error (𝑒 ) = Rs. 3
Confidence level (1 – α) = 0.99 = 99% , α = 1%
Significant value (𝑍𝛼 ) = 2.58
𝑍𝛼 .𝜎 2 (2.58)2 .(10)2
Sample Size (n) = = = 73.96 ≅ 74
𝑒 32
Hence the required sample size is 74.
Example: A health officer wishes to estimate the mean hemoglobin
level in defined community. Preliminary information is that the mean is
about 150mg/dl with standard deviation of 30 mg/ dl. If sampling error
of to 5 mg/ dl in the estimate to be tolerated, how many people should
be included in the study at 95 % confidence level? If the community to
be sampled has 1000 people, what should be the sample size?
Solution:
Mean hemoglobin level (𝑥̅) = 150 mg/dl, Standard Deviation (𝜎) = 30
mg/dl, Allowable error (e) = 5 mg/dl,
Confidence level (1 – α) = 95%, Population Size (N) = 1000
𝑍𝛼 .𝜎 2 (1.96)2 .(30)2
Sample size(𝑛) = = = 138.29 ≈ 138.
𝑒 52
Hence minimum sample size is 138.

When population size (N) is given i. e. N = 1000

𝑛 138
then n = 𝑛 = 138 = 121.26 ≅ 121
1+ 𝑁 1+ 1000
Hence minimum required sample size is 121 when population size is
1000.
Sample Size for estimating a Population Proportion
𝒁𝜶 . 𝟐 𝒁𝟐 𝑷𝑸
Sample Size(𝒏) = . 𝑷𝑸 = , for infinite population size.
𝒆 𝒆𝟐
Where, E/d/e = permissible error/allowable error
𝑍𝛼 = significant value or critical value of Z corresponding
to 𝛼 level of significance
෠ p)
P = population proportion( if not given we use 𝑃=
p = sample proportion
P + Q = 1, p + q = 1
𝑛 𝑍𝛼 . 2
 n= 𝑛 , 𝑓𝑜𝑟 𝑓𝑖𝑛𝑖𝑡𝑒 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛. 𝑤ℎ𝑒𝑟𝑒 𝑛 = . 𝑃𝑄
1+ 𝑁 𝑒
𝑁 = 𝑃𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑠𝑖𝑧𝑒
If prevalence rate (P or p ) unknown or previous study is not given we can use the
value of P or p = 0.5 (50%)
Example: It is desired to estimate the proportion of children watching
television on Saturday morning in order to develop a promotional
strategy for electronics games. We want to be 95% confident that our
estimate will be within ±2% of the true population proportion.
(i) What sample size should we take if a previous survey showed that
40% of children watched television on Saturday mornings?
(ii) What would be the sample size be for the same degree of confidence
and same maximum allowable error if no such previous survey had
been taken?
Solution:
(i) confidence level(1 – α) = 95% , α = 5% , 𝑍𝛼 = 1.96
P = 0.40, Q = 1- P = 1 – 0.40 = 0.60 Error (𝑒 ) = 0.02
Now we know that,
𝑍 2 𝛼 𝑃𝑄 (1.96)2 × 0.4 × 0.6
sample size (n) = = = 2305
𝑒2 (0.02)2
(ii) Since no previous study have been taken, we assume P = 0.5
Q = 1- 0.5 = 0.5
Now,
𝑍 2 𝛼 𝑃𝑄 (1.96)2 × 0.5 × 0.5
Sample size (n) = = = 2401
𝑒2 (0.02)2
Question:
If the population proportion of success is 0.65 and n= 100, what will be
the value of sampling error when acceptance region is 0.95?
 A study of 1000 computer engineers conducted by their
professional organization reported that 300 stated that their firm’s
greatest concern was to uplift the professional quality of work. In
order to conduct a follow up study to estimate the population
proportion of computer engineers to fulfill their greatest concern
within ±0.01 with 99% confidence interval, how many computer
engineers would be required to surveyed?
Solution :
From the study of previous survey proportion of computer engineers
300
wants to uplift is P = then Q = 1 – P
1000
Then use formula
𝑍 2 𝛼 𝑃𝑄
Sample size (n) =
𝑒2
Question: A sample of 64 students appearing in an examination
yield the error as 5 with standard deviation of 4. Find the risk.
If the sample size is increased to 144, how will risk be affected, the
standard deviation and error remaining the same.
Solution:
Sample size (n) = 64, Error (E) = 5 sample s.d. (s) = 4 Risk(𝛼)=?
𝑍𝛼 .𝜎 2 𝑍𝛼 .𝑠 2
We have, 𝑛 = = 𝑒
𝑒
𝑍 .4 2
𝛼
or, 64 = or, 𝑍𝛼 2 = 100 ∴ 𝑍𝛼 = ±10
5
Now, P(-10 < Z < 10) = 1
And P(-10 < Z < 10) = 1 - 𝛼 ∴ 1 = 1 - 𝛼 so, risk (𝛼 ) = 0
- 1-α
-10 0 +10

𝑍𝛼 .𝜎 2 𝑍𝛼 .𝑠 2
If n is increased to 144, we have 𝑛 = 𝑒
= 𝑒
𝑍𝛼 . 4 2
or, 144 = or, 𝑍𝛼 2 = 225 ∴ 𝑍𝛼 = ±15
5
Now, P(-15 < Z < 15) = 1
and P(-15 < Z < 15) = 1 - 𝛼 ∴ 1 = 1 - 𝛼 so, risk (𝛼 ) = 0
Hence, by increasing sample size from 64 to 144 risk will not affected.
THANKYOU

TwinCAM Tutorial en
No ratings yet
TwinCAM Tutorial en
91 pages
Minex Modelling Flow Chart
50% (2)
Minex Modelling Flow Chart
6 pages
Process Analysis by Statistical Methods D. Himmelblau
100% (4)
Process Analysis by Statistical Methods D. Himmelblau
474 pages
Classical Theory BA 2nd Sem
No ratings yet
Classical Theory BA 2nd Sem
9 pages
NC 5 Prezentare Eng
No ratings yet
NC 5 Prezentare Eng
39 pages
ECSS E ST 50 53C (5february2010) PDF
No ratings yet
ECSS E ST 50 53C (5february2010) PDF
21 pages
Deutz SerDia Description ENG 28042008 Lev4
100% (5)
Deutz SerDia Description ENG 28042008 Lev4
85 pages
Trade Cycles and Interaction Notes
100% (1)
Trade Cycles and Interaction Notes
23 pages
Theories of Foeign Exchange Determination
No ratings yet
Theories of Foeign Exchange Determination
57 pages
Unit 14 Business Cycles Theory - PMD
No ratings yet
Unit 14 Business Cycles Theory - PMD
15 pages
Numerical On Mean Median and Mode
No ratings yet
Numerical On Mean Median and Mode
3 pages
Assessment ManageEconomics
No ratings yet
Assessment ManageEconomics
21 pages
Ignou Assignment
No ratings yet
Ignou Assignment
8 pages
Correlation & Simple Regression
No ratings yet
Correlation & Simple Regression
15 pages
Econometrics I: TA Session 5: Giovanna Ubida
No ratings yet
Econometrics I: TA Session 5: Giovanna Ubida
20 pages
Multicollinearity Among The Regressors Included in The Regression Model
No ratings yet
Multicollinearity Among The Regressors Included in The Regression Model
13 pages
Heteroscedasticity Notes
No ratings yet
Heteroscedasticity Notes
9 pages
ARCH Model
No ratings yet
ARCH Model
26 pages
Axiomatic Probability and Concepts
No ratings yet
Axiomatic Probability and Concepts
6 pages
Robinsons Growth Model 2nd Sem
No ratings yet
Robinsons Growth Model 2nd Sem
27 pages
Correlation Regression
100% (1)
Correlation Regression
25 pages
Chapter 5 - The Standard Trade Model
No ratings yet
Chapter 5 - The Standard Trade Model
57 pages
Introduction To Descriptive Statistics
No ratings yet
Introduction To Descriptive Statistics
73 pages
The Classical Theory Of Inflation
100% (1)
The Classical Theory Of Inflation
3 pages
Autocorrelation
No ratings yet
Autocorrelation
36 pages
Determinants of Dividend Payout Decisions: A Dynamic Panel Data Analysis of Turkish Stock Market
No ratings yet
Determinants of Dividend Payout Decisions: A Dynamic Panel Data Analysis of Turkish Stock Market
16 pages
Index Numbers II
No ratings yet
Index Numbers II
13 pages
Correlation and Covariance
No ratings yet
Correlation and Covariance
11 pages
Loanable Fund Theory
No ratings yet
Loanable Fund Theory
2 pages
12th Business Maths 3 Mark Questions Prepared by Ravi Tuition
No ratings yet
12th Business Maths 3 Mark Questions Prepared by Ravi Tuition
12 pages
INDIAN ECONOMY QUESTIONS
No ratings yet
INDIAN ECONOMY QUESTIONS
46 pages
Research Methods - Chapter 6
100% (1)
Research Methods - Chapter 6
17 pages
Unit-17 IGNOU STATISTICS
No ratings yet
Unit-17 IGNOU STATISTICS
15 pages
Fishers Theory
No ratings yet
Fishers Theory
11 pages
Varian Chapter28 Game Theory
No ratings yet
Varian Chapter28 Game Theory
94 pages
Econometric S
No ratings yet
Econometric S
26 pages
International Economics I-1
No ratings yet
International Economics I-1
103 pages
Measures of Disperson
No ratings yet
Measures of Disperson
17 pages
Sampling Distribution Revised For IBS 2020 Batch
No ratings yet
Sampling Distribution Revised For IBS 2020 Batch
48 pages
Chap 5 Two Variable Regression Interval Estimation and Hypothesis Testing
100% (1)
Chap 5 Two Variable Regression Interval Estimation and Hypothesis Testing
46 pages
Managerial Economics in A Global Economy, 5th Edition by Dominick Salvatore
No ratings yet
Managerial Economics in A Global Economy, 5th Edition by Dominick Salvatore
21 pages
Determination of National Income
100% (1)
Determination of National Income
27 pages
Income Determination and Multiplier
No ratings yet
Income Determination and Multiplier
27 pages
Ratex
No ratings yet
Ratex
5 pages
Economics: Don't Just Learn Understand!
No ratings yet
Economics: Don't Just Learn Understand!
2 pages
Vogel Approximation Method (VAM)
No ratings yet
Vogel Approximation Method (VAM)
5 pages
Panel Data
No ratings yet
Panel Data
9 pages
Decision Variable
No ratings yet
Decision Variable
10 pages
Business Cycles & Theories of Business Cycles
No ratings yet
Business Cycles & Theories of Business Cycles
7 pages
Chapter 22 General Equilibrium Theory: A. Interdependence in The Economy
No ratings yet
Chapter 22 General Equilibrium Theory: A. Interdependence in The Economy
3 pages
Integrated Harrod Domar Growth Model
No ratings yet
Integrated Harrod Domar Growth Model
3 pages
Econometrics: Specification Errors
100% (2)
Econometrics: Specification Errors
13 pages
Unit 3 & 4 - Ignou Part 3
No ratings yet
Unit 3 & 4 - Ignou Part 3
19 pages
ch12 Autocorrelation
100% (1)
ch12 Autocorrelation
36 pages
Maxima & Minima-I-1
No ratings yet
Maxima & Minima-I-1
25 pages
Measure of Locations
No ratings yet
Measure of Locations
6 pages
Micro Economics Part II
No ratings yet
Micro Economics Part II
30 pages
Nature and Scope of Econometrics
86% (7)
Nature and Scope of Econometrics
2 pages
215 Final Exam Formula Sheet
No ratings yet
215 Final Exam Formula Sheet
2 pages
Demand Estimation & Forecasting
No ratings yet
Demand Estimation & Forecasting
15 pages
econometrics notes 2024
100% (1)
econometrics notes 2024
46 pages
Development Economics: by Debraj Ray, New York University
0% (2)
Development Economics: by Debraj Ray, New York University
31 pages
6Sampling Distribution
No ratings yet
6Sampling Distribution
82 pages
Business Statistics CH (2)
No ratings yet
Business Statistics CH (2)
29 pages
Statistics and Probability Module 3 CLT - RPUNO - Digital
No ratings yet
Statistics and Probability Module 3 CLT - RPUNO - Digital
17 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Handout-1 - Site Investigations
No ratings yet
Handout-1 - Site Investigations
36 pages
213N159
No ratings yet
213N159
39 pages
Chapter 6 (Economic Selection Indexes)
0% (1)
Chapter 6 (Economic Selection Indexes)
40 pages
Statistics and Probability
0% (1)
Statistics and Probability
5 pages
Interview Questions
No ratings yet
Interview Questions
13 pages
StatsPro Introduction
0% (1)
StatsPro Introduction
10 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
Acm, Artificial Life For Graphics, Animation, Multimedia and Virtual Reality
No ratings yet
Acm, Artificial Life For Graphics, Animation, Multimedia and Virtual Reality
386 pages
Technical Specifications en HFM 446 Lambda Series 01
No ratings yet
Technical Specifications en HFM 446 Lambda Series 01
1 page
ATV21 Parameters (1)
No ratings yet
ATV21 Parameters (1)
2 pages
Tutorial: Using SAGA For Least Cost Path Analysis: Developed by Kim Cimmery (Kapcimmery at Hot Mail Dot Com) March 2013
No ratings yet
Tutorial: Using SAGA For Least Cost Path Analysis: Developed by Kim Cimmery (Kapcimmery at Hot Mail Dot Com) March 2013
124 pages
Usuario STC Mod Maestro (En-Es-Fr) 260721
No ratings yet
Usuario STC Mod Maestro (En-Es-Fr) 260721
50 pages
Design and Analysis of Composite Rotor Blades For ActivePassive Vibration Reduction
No ratings yet
Design and Analysis of Composite Rotor Blades For ActivePassive Vibration Reduction
401 pages
Manual Termostato LC Au
No ratings yet
Manual Termostato LC Au
22 pages
Econometrics Revision Work
100% (6)
Econometrics Revision Work
6 pages
Getting Started With Maxwell:: Designing A Rotational Actuator
No ratings yet
Getting Started With Maxwell:: Designing A Rotational Actuator
58 pages
Download the updated Test Bank for Business Statistics: Communicating with Numbers 3rd Edition (PDF) containing all chapters.
100% (18)
Download the updated Test Bank for Business Statistics: Communicating with Numbers 3rd Edition (PDF) containing all chapters.
81 pages
7.riskreliability Based Hydraulic Engineering Design
No ratings yet
7.riskreliability Based Hydraulic Engineering Design
54 pages
(BS 5760-23 - 1997) - Reliability of Systems, Equipment and Components. Guide To Life Cycle Costing
No ratings yet
(BS 5760-23 - 1997) - Reliability of Systems, Equipment and Components. Guide To Life Cycle Costing
22 pages
09 Domain Analysis Testing Examples - Done
No ratings yet
09 Domain Analysis Testing Examples - Done
6 pages
FluoFit Manual
No ratings yet
FluoFit Manual
48 pages
Multivariate GARCH Models: Software Choice and Estimation Issues
No ratings yet
Multivariate GARCH Models: Software Choice and Estimation Issues
21 pages
Discuss The Importance of Sampling When Carrying Out Educational Research
No ratings yet
Discuss The Importance of Sampling When Carrying Out Educational Research
7 pages
Mil STD 472
No ratings yet
Mil STD 472
176 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Sampling Distribution and Estimation

Uploaded by

Sampling Distribution and Estimation

Uploaded by

Sampling Distribution and Estimation

Symbols Population size = N Sample size = n

A population parameter is always a constant, whereas a sample

Sample number Sample values Sample (ഥ

Use of standard error:

Mean 𝝈 For infinite population (If sample is drawn with replacement)

Mean 𝝈 𝑵− 𝒏 For finite population

Mean 𝒔 𝑵− 𝒏 For finite population (sample is drawn without replacement)

Proportion 𝑷.𝑸 For infinite population where P + Q = 1

Proportion 𝑷.𝑸 𝑵− 𝒏 For finite population (N is given)

Proportion 𝒑.𝒒 𝑵− 𝒏 When P and Q are unknown but N is given

Difference of means 𝝈𝟏 𝟐 𝝈𝟐 𝟐 If 𝝈𝟏 & 𝝈𝟐 are known

Difference of means 𝟏 𝟏 If 𝝈𝟏 & 𝝈𝟐 are not known then combined variance

Difference of 𝑷𝟏 𝑸𝟏 𝑷𝟐 𝑸𝟐 If two population proportions are known

Difference of 𝟏 𝟏 If two population proportions are equal

Combined proportion 𝑷 ෡ = 𝒏𝟏𝒑𝟏+𝒏𝟐𝒑𝟐 =

Quality Assurance Manager may be interested in

Manager of the credit department needs to estimate the

 Level of significance: The maximum size of probability assigned to tolerate in

Confidence Interval estimate of population Proportion from large sample:

 Confidence intervals using ‘t’: The general procedure for constructing

Level of significance for one tailed test

When population size (N) is given i. e. N = 1000

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.