Topic 4 - Prob Distributions
Topic 4 - Prob Distributions
Topic 4 - Prob Distributions
Joint
probabilities
The marginal probabilities for gender and brand
preference are calculated by adding the joint
probabilities across the rows and columns
◦ E.g., the event F, (respondent is female) is comprised of the
outcomes O1, O2, and O3, and therefore P(F) = P(F and B1) +
P(F and B2) + P(F and B3) = 0.37
Marginal
probabilities
Calculations of marginal probabilities leads to the
following probability rule:
P(Ai) = P(Ai and B1) + P(Ai and B2) + … + P(Ai and Bn)
Events F and M are mutually exclusive, as are events B1, B2, and B3
since a respondent may be only male or female and prefer exactly
one of the three brands. We can use Rule 3 to find, for example,
P(B1 or B2) = 0.34 + 0.23 = 0.57.
Events F and B1, however, are not mutually exclusive because a
respondent can be both female and prefer brand 1. Therefore, using
Rule 4, we have P(F or B1) = P(F) + P(B1) – P(F and B1) = 0.37 +
0.34 – 0.09 = 0.62.
Conditional probability is the probability of
occurrence of one event A, given that another
event B is known to be true or has already
occurred.
Suppose we know a respondent is male. What is the probability that
he prefers Brand 1?
Using cross-tabulation: Of 63 males, 25 prefer Brand 1, so the
probability of preferring Brand 1 given that a respondent is male =
25/63
Using joint probability table: divide the joint probability 0.25 (the
probability that the respondent is male and prefers brand 1) by the
marginal probability 0.63 (the probability that the respondent is male).
Apple Purchase History
The PivotTable shows the count of the
type of second purchase given that
each product was purchased first.
Probability of purchasing an
iPad given that a customer already
purchased an iMac = 2/13
The conditional probability of an event A given
that event B is known to have occurred is
P(B1|M) = 0.397
E[X] = p
Var[X] = p(1 − p)
The Bernoulli distribution can be used to model whether
an individual responds positively (x = 1), or negatively
(x = 0) to a telemarketing promotion.
For example, if you estimate that 20% of customers
contacted will make a purchase, the probability
distribution that describes whether or not a particular
individual makes a purchase is Bernoulli with p = 0.2
Models n independent replications of a Bernoulli experiment, each
with a probability p of success.
◦ X represents the number of successes in these n experiments
Probability mass function:
Excel function:
=BINOM.DIST(number_s, trials, probability_s, cumulative)
If cumulative is set to TRUE, then this function will provide
cumulative probabilities; otherwise the default is FALSE, and it
provides values of the probability mass function, f(x).
The probability that exactly 3 of 10 individuals will make
a purchase is P(x = 3): =BINOM.DIST(3,10,0.2,TRUE) =
0.20133
The probability that 3 or fewer of 10 individuals will make
a purchase is P(x ≤ 3): =BINOM.DIST(3,10,0.2,FALSE)
= 0.87913
The binomial distribution is symmetric when
p = 0.5; positively skewed when p < 0.5,
and negatively skewed when p > 0.5.
Example of
negatively-skewed
distribution
Models the number of occurrences in some unit of
measure (often time or distance).
There is no limit on the number of occurrences.
The average number of occurrence per unit is a constant
denoted as λ.
Probability mass function:
Mean = = 1/l
Excel function:
◦ =EXPON.DIST(x, lambda, cumulative)
If the number of events occurring
during an interval of time has a
Poisson distribution, then the
time between events is
exponentially distributed.
The mean time to failure of a critical engine component is µ = 8,000
hours. What is the probability of failing before 5000 hours?
P(X < x) =EXPON.DIST(x, lambda, cumulative)
λ = 1/8000
P(X < 5000) =EXPON.DIST(5000, 1/8000, TRUE)
= 0.4647
Triangular Distribution
Lognormal Distribution
Beta Distribution
A random number is one that is uniformly
distributed between 0 to 1.
Excel function: =RAND( )
Probability distribution Intervals for random sampling
◦ It is not clear what the distribution might be. It does not appear to
be exponential, but it might be lognormal or another distribution.
A better approach that simply visually examining a
histogram and summary statistics is to analytically fit the
data to the best type of probability distribution.
Three statistics measure goodness of fit:
◦ Chi-square (need at least 50 data points)
◦ Kolmogorov-Smirnov (works well for small samples)
◦ Anderson-Darling (puts more weight on the differences between
the tails of the distributions)
Analytic Solver Platform has the capability of fitting a
probability distribution to data.
1. Highlight the data
Analytic Solver Platform >
Tools > Fit
2. Fit Options dialog
Type: Continuous
Test: Kolmorgov-Smirnov
Click Fit button
The best-fitting distribution is called an Erlang
distribution.