IE101 Reviewer
IE101 Reviewer
union (∪)
intersection (∩)
complement (‘)
Variance
Standard 2
Deviation 𝜎 = √𝜎
Variance
Standard 2
Deviation 𝜎 = √𝜎
Discrete Probability Distribution
When to use a Distribution
Discrete Uniform → each outcomes have equal probability of
Distribution occurrence (equally likely outcomes)
→ n repeated trials
→ each trial has two possible outcomes
Binomial (success/failure)
Distribution → probability of success is same on all
trials
→ trials are independent
Negative → same w/ BD but experiment continues
Binomial (trials are performed) until a total of k
Distribution successes
Geometric → same w/ NBD but experiment stops
Distribution after the 1st success (k=1)
Multinomial → same with binomial but can have
Distribution multiple outcomes instead of two only
→ consists of finite population N
→ Each individual can be characterized as a
success/failure, and there are k
Hypergeometric successes in the population.
Distribution → sample of n individuals is selected from
N without replacement in such a way
that each subset of size n is equally likely
to be chosen
Multivariate → N items can be partitioned into cells
Hypergeometric → random variable has probability of being
Distribution selected from subtypes
a = lower limit
Parameters
b = upper limit
1
Probability Distribution Function 𝑃(𝑥) =
𝑏−𝑎+1
𝑥−𝑎+1
Cumulative Distribution Function 𝐹(𝑥) =
𝑏−𝑎+1
𝑎+𝑏
Mean E(X)
2
(𝑏−𝑎+1)2 −1
Variance
12
Binomial Distribution
n = number of attempts
p = probability of success per attempt
Parameters
q = (1-p) = probability of failure per attempt
x = number of successful attempts among n
Probability
Distribution 𝑃(𝑥) = 𝑛∁𝑥 × 𝑝 𝑥 × 𝑞 𝑛−𝑥
Function
Cumulative
Distribution 𝐹(𝑥) = ∑𝑥0 𝑛∁𝑥 × 𝑝 𝑥 × 𝑞 𝑛−𝑥
Function
Mean E(X) 𝑛𝑝
Variance 𝑛𝑝𝑞
Negative Binomial Distribution
k = number of successful attempts needed
p = probability of success per attempt
Parameters q = (1-p) = probability of failure per attempt
x = number of attempts, kth success should occur
here
Probability
Distribution 𝑃(𝑥) = (𝑥−1)∁(𝑘−1) × 𝑝 𝑘 × 𝑞 𝑥−𝑘
Function
𝑘
Mean E(X) 𝑝
𝑘(1−𝑝)
Variance
𝑝2
Geometric Distributions
p = probability of success per attempt
q = (1-p) = probability of failure per attempt
Parameters
x = number of attempts, 1st success should occur
here
Probability
Distribution 𝑃(𝑥) = 𝑝1 × 𝑞 𝑥−1
Function
1
Mean E(X) 𝑝
(1−𝑝)
Variance
𝑝2
Multinomial Distribution
Probability
Distribution
Function
Parameters
Hypergeometric Distribution
k = number of successes in the population
n = number of sample objects from the population
Parameters N = size of the population
x = number of K typed obj among the sampled n
obj
Probability 𝑁−
(𝑘 )( −𝑘 )
Distribution 𝑃(𝑥) = 𝑥 𝑛 𝑥
𝑁
( )
Function 𝑛
𝐾
Mean E(X) 𝑛
𝑁
𝐾 (𝑁−𝐾) 𝑁−𝑛
Variance 𝑛
𝑁 𝑁 𝑁−1
Poisson Distribution
Continuous Probability Distribution
Continuous Uniform Distribution
a = lower limit
Parameters
b = upper limit
1
Probability Distribution Function 𝑃(𝑥) =
𝑏−𝑎
𝑥−𝑎
Cumulative Distribution Function 𝐹(𝑥) =
𝑏−𝑎
𝑎+𝑏
Mean E(X)
2
(𝑏−𝑎)2
Variance
12
𝑨 𝑩 𝑪
( )( )( )
𝑷(𝑿, 𝒀) = 𝑿 𝒀 𝟐 − 𝑿 −𝒀
𝑨+𝑩+𝑪
( )
𝟐
Sampling Distribution
group that includes all the cases (individuals, objects,
Population or groups) in which the researcher is interested.
Size=N.
Sample A relatively small subset from a population. Size=n.
Each individual is chosen entirely by chance and
Random each member of the population has a known, but
sampling possibly non-equal, chance of being included in the
sample.
process of selecting a sample that allows individual
Simple in the defined population to have an equal and
Random independent chance of being selected for the
Sampling sample. It can be done using a computer, calculator,
or a table of random numbers
every Kth member (K is a ration obtained by
dividing the population size by the desired sample
Systematic
size) in the total population is chosen for inclusion
Random
in the sample after the first member of the sample is
sampling
selected at random from among the first K
members of the population.
There may often be factors which divide up the
Stratified
population into subpopulations (groups / strata) and
Random
we may expect the measurement of interest to vary
Sampling
among the different sub-populations.
entire population is divided into groups, or clusters,
Cluster and a random sample of these clusters are selected.
Sampling All observations in the selected clusters are included
in the sample.
characteristics of a population or, more specifically,
a
Parameters
target population. Parameters may also be termed
population values.
Central
Limit
Theorem
point
specific numerical value estimate of a parameter
estimate
difference between the expected value of the
bias estimator and the value of the parameter being
estimated.
standard error of an estimator is its approximated
standard deviation
standard
error
Sampling Distribution
Confidence Level - probability that the value of a parameter falls
within a specified range of values. It is percentage equivalent to the
decimal value of 1–α.
Confidence Intervals for the Mean (σ Known or n≥30)
Confidence Level
90% = 1.645
95% = 1.96
99% = 2.58
Maximum difference
between the point estimate
Maximum error of
of a parameter and the
estimate
actual value of the
parameter
Minimum size
needed for an
Interval Estimate
Minimum size
needed for an
Interval Estimate
Confidence Intervals for Variances and Standard Deviations
Variance
Standard Deviation
Prediction Intervals
σ2 Known
σ2 Unknown
Tolerance Intervals
𝑥̅ ± 𝑘𝑠
Hypothesis Testing
process that uses sample statistics to test a claim
hypothesis test
about the value of a population parameter
statistical verbal statement, or claim, about a population
hypothesis parameter
states that there is no difference between groups
null hypothesis
or no relationship between variables.
alternative states what you expect the data to show, based
hypothesis on your research on the topic.
type I error Rejection of the null hypothesis when it is true
Non-rejection of the null hypothesis when it is
type II error
false
Statistics Tests
t-test for sample mean
(σ Unknown or n<30)