lec1
lec1
Estimation theory.
1.1 Introduction
Let us consider a set X (probability space) which is the set of possible values that
some random variables (random object) may take. Usually X will be a subset of ,
for example {0, 1}, [0, 1], [0, ∞), , etc.
I. Parametric Statistics.
We will start by considering a family of distributions on X :
• { θ , θ ∈ Θ}, indexed by parameter θ. Here, Θ is a set of possible parameters
from the above family, for some θ0 ∈ Θ, and suppose that θ0 is unknown. In this
setting we will study the following questions.
1. Estimation Theory.
Based on the observations X1 , · · · , Xn we would like to estimate unknown pa-
rameter θ0 , i.e. find θ̂ = θ̂(X1 , · · · , Xn ) such that θ̂ approximates θ0 . In this
case we also want to understand how well θ̂ approximates θ0 .
1
LECTURE 1. ESTIMATION THEORY. 2
2. Hypothesis Testing.
Decide which of the hypotheses about θ0 are likely or unlikely. Typical hypothe-
ses:
• θ0 = θ1 ? for some particular θn ?
• θ0 ≷ θ1
• θ0 6= θ1
Example: In a simple yes/no vote (or two candidate vote) our variable (vote)
can take two values, i.e. we can take the space X = {0, 1}. Then the distribution is
described by
(1) = p, (0) = 1 − p
for some parameter p ∈ Θ = [0, 1]. The true parameter p0 is unknown. If we conduct
a poll by picking n people randomly and if X1 , · · · , Xn are their votes then:
1.Estimation theory. What is a natural estimate of p0 ?
#(10 s among X1 , · · · , Xn )
p̂ = ∼ p0
n
How close is p̂ to p0 ?
2. Hypothesis testing. How likely or unlikely are the following:
1
• Hypothesis 1: p0 > 2
1
• Hypothesis 2: p0 < 2
II. Non-parametric Statistics
In the second part of the class the questions that we will study will be somewhat
different. We will still assume that the observations X = (X1 , · · · , Xn ) have unknown
distribution , but we won’t assume that comes from a certain parametric family
{ θ , θ ∈ Θ}. Examples of questions that may be asked in this case are the following:
θ, θ ∈ Θ}?
• Is =
0?