Mod 2 Stats
Mod 2 Stats
● Vaiance
● Standard deviation
Graphical representation
of PDF
● Pdf of r.v. X shown graphically as histogram
Bernoulli Distribution
● A r.v. X taking value in {0,1} (0,1) is said to follow the Bernoulli distribution, denoted
by X Ber (p)
● If it describes the binary outcome 0 (failure) or 1 (success), with probability of
success p or P(X=1)=p
● The probability distribution function (pdf) is P( X=x ; p)= p x ¿
- Where x=0 or 1
● The bernoulli distribution has one parameter
- P: probability of success p ϵ (0 , 1) (ie. 0 ≤ p ≤ 1 ¿
- E(X)=p; Var(X)=p(1-p)
- Apparently, bernoulli distribution is a special case of
binomial distribution where the number of trials n=1
random variable: a r.v. that takes discrete values. Discrete r.v. typically counts - Eg.
number of kids in a household, number of successes in n trials
● Continuous random variable: a r.v. that takes values on (part of) the real line. - Eg.
height of soldiers, inflation rates, waiting time in a queue
→ Scores add up to 1
Probability
density
function
● continuous probability distribution for X is defined via the means of probability density
function (pdf) which assigns a positive value to possible outcomes of X such that the
density is integrated to 1 (this means that the area under the curve is 1). The
probability that X lies between two numbers is the area under pdf function between
those numbers
Inequaltiies implications
●
2
x distribution
● x 2 distribution is a continuous distribution characterised by one parameter dof v. It is
asymmetric and takes only positive values (because variance is always positiv)
2
x
● PDF of x 2 for different dof v. Note that the mean of the r.v. (n−1) s is v=n-1
σ 2x
Lecture 6: point and Interval Estimates
Confidence Interval
Statistical inference
● Given different samples, we have different sample statistics, so different point
estimates
● Sampling error is the discrepancy between sample statistics and (unobserved or
2
unmeasurable) population statistics (eg. difference between X and μ x or s and σ 2x )
x
Confidence interval
● Based on sampling distributions of a sample
statistic, we can construct an interval (an
upper and lower limit) called confidence
interval, to pin down a possible range into
which the true but unknown population
parameter may fall
● Based on sampling distribution, we construct confidence intervals for:
- μ x when the population standard deviation σ x is known; μ x when σ x is
unknown)
2
- σx
- σx