Session 4-6
Session 4-6
Session 4-6
Probability Distributions
• Earlier, we understood about the frequency distribution to summarize the variation in the
observed data.
• A probability distributions can be thought as a theoretical frequency distribution, describing
how outcomes are expected to vary in a random experiment.
• It is a listing of probabilities of all possible outcomes that could result if the experiment were
done.
Example: You have a fair coin (equal probability of heads or tails). You will toss it two times.
Formulate a probability distribution of the number of heads that could possibly come
when we toss the coin twice.
Probability Distribution
Example: Possible outcomes from two tosses of a fair coin
First Toss Second Toss Number of Heads on Probability of the Four
two Tosses Possible Outcomes
H H 2 0.5
H T 1 0.5
T H 1 0.5
T T 0 0.5
Total = 1.00
NOTE: The outcomes in the table is not actual outcomes of tossing a fair coin but it is
theoretical outcome. That means it presents the way in which we expect two toss coin
experiment to behave over time.
Probability Distribution
Example: Probability Distribution of the possible number of heads from two tosses of a fair coin
Number of Tosses Probability of Outcome
Heads H P(H)
0 (T, T)
1 (T, H) + (H, T) 0.50
2 (H, H)
NOTE: The outcomes in the table is not actual outcomes of tossing a fair coin but it is
theoretical outcome. That means it presents the way in which we expect two toss coin
experiment to behave over time.
Frequency Distribution vs Probability
Distribution
• Frequency Distribution is listing of all the frequencies of all the outcomes of an experiment that
actually occurred when the experiment was done.
• Probability distribution is listing of the probabilities of all the possible outcomes that could result
if the experiment was done.
Note: Probability distribution can be based on theoretical consideration (toss of a fair dice) or
depending on the subjective judgement of the likelihood of certain outcomes based on experience.
Defining Random Variables
A variable is random, if it takes on different values as a result of the outcomes of a random
experiment.
Note:
• The probability that the random variable takes a given value can be computed using the rules
governing probability.
• For example, the probability that means we get 1 head when we tossed the coin for two
times is 0.50. Symbolically, it is denoted as P(X=1) = 0.50.
Types of Random Variables
• A random variable can be either discrete or continuous:
A discrete random variable can assume a countable number of values.
Example: Number of steps to the top of the Eiffel Tower*
A continuous random variable can assume any value along a given interval of a number line.
Example: The time a tourist stays at the top once s/he gets there
*Believe it or not, the answer ranges from 1,652 to 1,789. See Great Buildings
Types of Random Variables
Discrete random variables
• Number of sales
• Number of calls
• People in line
• Mistakes per page
In a Continuous Probability Distribution, the variable under consideration is allowed to take any
value within a given range, all values cannot be listed.
Weight of the students in your class
Note: Continuous distribution are very convenient ways to represent discrete distribution that have
many possible outcomes, all very close to each other.
Discrete Probability Distributions
• The discrete probability distribution (probability distribution of a discrete random
variable X) is a list of each possible value of X together with the probability that X takes in
one trial of the experiment.
• The probabilities in the probability distribution of a random variable X must satisfy the
following two conditions:
Each probability P(x) must be between 0 and 1:
1 ≥ p(x) ≥ 0
The sum of all the possible probabilities is 1:
p(x) = 1
10
Discrete Probability Distributions
Example: Probability Distribution of the Possible number of heads from two tosses of a fair coin
0 (T, T)
1 (T, H) + (H, T) 0.50
2 (H, H)
Expected Values of Discrete Random Variables
12
•Construct a probability distribution based
on the following frequency distribution.
• Draw a graph of the hypothetical Outcome 102 105 108 111 114 117
probability distribution.
• Compute the expected value of the Frequency 10 20 45 15 20 15
outcome.
Outcome Frequency P(Outcome)
(1) (2) (3) (1) (3)
If Walters purchases stock whenever the expected rate of return exceeds 10 percent, will he purchase the stock, according to
these data?
Return P(Return)
(1) (2) (1) (2)
0 0.20 0.00
10 0.25 2.50
15 0.30 4.50
25 0.15 3.75
50 0.10 5.00
Bob will purchase the stock because the expected return of $15.75 is greater than 10 percent of
the $150 purchase price.
Taxonomy of Probability Distributions
Discrete probability distributions
•Binomial distribution
•Multinomial distribution
•Poisson distribution
•Hypergeometric distribution
19
Binomial Distribution
• In many situations, an experiment has only two outcomes: success and failure.
• Such outcome is called dichotomous outcome.
• An experiment when consists of repeated trials, each with dichotomous outcome is called
Bernoulli process. Each trial in it is called a Bernoulli trial.
• The binomial distribution describes discrete probability distribution, resulting from a
Bernoulli process experiment.
• Conditions/Requirements of Bernoulli process:
Each trial has only two possible (dichotomous) outcomes.
Probability of the outcome of any trial remains fixed over time.
Trials are statistically independent.
Binomial Distribution
Example: Tossing a fair coin 3 times
Each trial has only two possible
(dichotomous) outcomes. Outcomes are Heads or Tails
Probability of the outcome of any trial
remains fixed over time. Coin is fair
Trials are statistically independent. A head on flip i doesn’t change P(H)
of flip i + 1
21
Binomial Distribution
Example: Firing bullets to hit a target.
• Suppose, in a Bernoulli process, we define a random variable X number of successes in
trials.
• Such a random variable obeys the binomial probability distribution as follows:
The experiment consists of n trials.
Each trial results in one of two mutually exclusive outcomes, "success” and “failure”.
The probability of a success on a single trial is equal to . The value of remains constant
throughout the experiment.
The trials are independent.
22
Binomial Distribution
• Binomial distribution is the discrete probability distribution of the number of successes in
a sequence of n independent dichotomous experiments, each of which yields success with
probability p.
• Binomial distribution is frequently used to model the number of successes in a sample of
size n drawn with replacement from a population of size N.
• Parameters of binomial distribution:
n (number of trials)
p (probability of success)
q = 1-p (probability of failure)
r number of successes desired
• The function for computing the probability of r successes in n trials using the binomial
formula is given by
for r = 0, 1, 2, …., n
Binomial Distribution Example
Example: Tossing a coin
X = number of heads a success
p = 0.5, the probability that we get head when we toss a coin
n = 2, number of times the coin is tossed.
Thus,
24
Suppose 40% of the class is female.
Question What is the probability that 6 of the first 10
students walking in will be female?
25
Binomial Distribution Problem
Problem: Suppose 40% of the class is female.
What is the probability that 6 of the first 10 students walking in will be female?
Solution:
26
Binomial Distribution
• The mean (expected value), variance, and standard deviation of a Binomial Distribution
is as follows:
Mean
Variance
Standard Deviation
27
Binomial Distribution Example
Example: Consider a packaging machine that produces 20 percent defective packages. Take a
random sample of 10 packages and find the mean and standard deviation of the process.
Solution:
28
The latest nationwide political poll indicates that for Americans who are randomly selected, the probability that they are conservative is
0.55, the probability that they are liberal is 0.30, and the probability that they are middle-of-the-road is 0.15. Assuming that
these probabilities are accurate, answer the following questions pertaining to a randomly chosen group of 10 Americans. (Do
not use Appendix Table 3.)
(a) What is the probability that four are liberal?
(b) What is the probability that none are conservative?
(c) What is the probability that two are middle-of-the-road?
(d) What is the probability that at least eight are liberal?
(a) n 10, p 0.30, P(r 4) (0.30)4(0.70)6 0.2001
The probability distribution of the Poisson random variable , representing the number of outcomes
occurring in per unit time is
where mean number of occurrences in the given unit of time, area, volume, etc. and
• Poisson as an approximation of the binomial distribution: when n >= 20 and p <= 0.05
such that their product, is finite.
The Poisson Distribution Example
Example: Suppose in a given stream there are an average of 3 striped trout per 100 yards.
What is the probability of seeing 5 striped trout in the next 100 yards, assuming a Poisson
distribution?
34
The Poisson Distribution Example
Example: Suppose in a given stream there are an average of 3 striped trout per 100 yards.
• What is the probability of seeing 5 striped trout in the next 100 yards, assuming a Poisson
distribution?
• How about in the next 50 yards, assuming a Poisson distribution?
Solution: Since the distance is only half as long, λ is only half as large.
35
Given λ 4.2, for a Poisson distribution, find
(a) P(x 2).
(b) P(x 5).
(c) P(x 8).
(a) P(x 2) P(x 0) + P(x 1) + P(x 2)
(c)
Continuous Probability Distributions
38
The Normal Distribution…
The normal distribution is the most important of all probability distributions. The
probability density function of a normal random variable is given by:
– The mathematical equation for the probability distribution of the normal variable depends
upon the two parameters and , its mean and standard deviation.
The station manager has just opened the station for business and notes that there
is exactly 1,100 gallons of regular gasoline in storage.
The next delivery is scheduled later today at the close of business. The manager
would like to know the probability that he will have enough regular gasoline to
satisfy today’s demands.
X 1,100 1,000
P(X < 1,100) = P = P(Z < 1.00)
100
Consequently, we need only one table, the standard normal probability table.
That is, this table lists cumulative probabilities P(Z < z) for values of z ranging
from −3.09 to +3.09
For example, we find the probability that Z is greater than 1.80 by determining
the probability that Z is less than 1.80 and subtracting that value from 1.
P(X < 0)
P(X < 0) =
= .1587