0% found this document useful (0 votes)

63 views

Bayesian Inference: by Hoai Nam Nguyen September 9, 2017

The document discusses Bayesian inference for parameter estimation. It explains how Bayesian inference treats parameters as random variables and uses prior distributions and likelihood functions to derive posterior distributions. Three examples are provided to illustrate how to construct Bayesian estimators for unknown population parameters by taking the mean of the posterior distributions. Key steps of Bayesian inference including deriving the likelihood, prior, and posterior distributions are demonstrated.

Uploaded by

Thảo Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

Bayesian Inference: by Hoai Nam Nguyen September 9, 2017

Uploaded by

Thảo Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Bayesian Inference

by Hoai Nam Nguyen

September 9, 2017

1
The setting is the same. Given a population that follows a distribution P ,
where P contains 1 or more unknown parameters, we want to construct an
estimator for each of them. In this course, I consider the simple case, where
there is only 1 unknown parameter . To do this, we proceed by collecting
an i.i.d sample X1 , ..., Xn P

Similar to Maximum Likelihood Estimation, we first find the Likelihood

Function L():

L() = fX1 ,...,Xn (x1 , ..., xn |)

In Bayesian inference, we treat the parameter as a random variable. That

is, follows a probability distribution with pdf (). We call () the prior
distribution of

By Bayess formula, we have

fX1 ,...,Xn (x1 , ..., xn |)()

(|x1 , ..., xn ) =
fX1 ,...,Xn (x1 , ..., xn )

fX1 ,...,Xn (x1 , ..., xn |)()

where (|x1 , ..., xn ) is the pdf of given the sample data. This is called the
posterior distribution of

Let me clarify the last step further. The symbol means proportional
to. Since the left-hand side is the distribution of conditional on the sam-
ple data {x1 , ..., xn }, all the xi are assumed to be known and the denominator
fX1 ,...,Xn (x1 , ..., xn ) is, therefore, no more than a constant

In this setting, we are given the population distribution P and the prior
distribution (). We have to find the posterior distribution (|x1 , ..., xn ).
We then use the posterior mean E[|x1 , ..., xn ] to estimate the unknown
parameter . That is,

= E[|x1 , ..., xn ]

NOTE: when calculating (|x1 , ..., xn ), always use proportionality by re-

moving constants because this will simply the calculation a lot

2
Example 1

The population distribution is Bernoulli(p), where p U nif orm(0, 1). Use

Bayesian inference to construct an estimator p

The likelihood function is given by:

n
Y
L(p) = fXi (xi |p)
i=1

n
Y
= pxi (1 p)1xi
i=1

P P
xi
=p (1 p)n xi

The pdf of the prior distribution is (p) = 1, for 0 < p < 1

Therefore, the posterior distribution is given by:

(p|x1 , ..., xn ) fX1 ,...,Xn (x1 , ..., xn |p)(p)

P P
xi
=p (1 p)n xi
, for 0 < p < 1
Recall the pdf of Beta(, ):
(+) 1
fX (x) = ()()
x (1 x)1 , for 0 < x < 1
P
By comparing,
P we can see that the posterior distribution p is Beta( xi +
1, n xi + 1)

We know that the expectation of Beta(, ) is +
. Therefore, the posterior
mean is given by:
P
xi +1
E[p|x1 , ..., xn ] = n+2
P
Xi +1
Thus, p = n+2
is the Bayesian estimator for p

Note that we used proportionality when calculating the posterior distribu-

tion. By comparing with the pdf of Beta(, ), we can easily recover the
missing constant:
(+) (n+2) P
c= ()()
= P
( xi +1)(n xi +1)

3
Example 2

Same as example 1, except that p Beta(a, b), where both a and b are
given constants

The likelihood function stays unchanged:

P P
xi
L(p) = p (1 p)n xi

The pdf of the prior distribution is given by:

(a+b) a1
(p) = (a)(b)
p (1 p)b1 , for 0 < p < 1

Therefore, the pdf of the posterior distribution is given by:

(p|x1 , ..., xn ) fX1 ,...,Xn (x1 , ..., xn |p)(p)

P P
xi
p (1 p)n xi a1
p (1 p)b1

P P
p xi +a1 (1 p)n xi +b1 , for 0 < p < 1
P P
We recognise this is Beta( xi + a, n xi + b)
P
xi +a
The posterior mean is E[p|x1 , ..., xn ] = n+a+b
. The Bayesian estimator for p
is given by:
P
Xi +a
p = n+a+b

Again, you can recover the normalising constant in the pdf of the posterior
distribution:

c= P (n+a+b) P
( xi +a)(n xi +b)

4
Example 3

The population distribution is N (, 2 ), where is unknown and 2 is known.

The parameter follows a prior distribution N (, 2 ), where both and 2
are given constants. Use Bayesian inference to construct an estimator p

The likelihood function is given by:

n
Y
L() = fXi (xi |)
i=1

n (x )2
Y 1 i
= exp
i=1 2 2 2 2

n
Y (xi )2

exp 2
, because 2 is known
i=1
2

Also, the pdf of the prior distribution is given by:

1 ( )2
() = p exp
2 2 2 2

( )2
exp , because 2 is known
2 2
Then, calculate the pdf of the posterior distribution:
Yn (x )2 ( )2
i
(|x1 , ..., xn ) exp exp
i=1
2 2 2 2

n
1 X
1
= exp 2 (xi )2 exp 2 (2 2 + 2 )
2 i=1 2

n
1 X 1
exp 2 (xi )2 exp 2 (2 2)
2 i=1 2

2
by removing exp
2 2

5
n
1 X 2

2
1 2

= exp 2 (x 2xi + ) exp 2 ( 2)
2 i=1 i 2

n
1 X
1
exp 2 (2xi + 2 ) exp 2 (2 2)
2 i=1 2

n
1 X 2
by removing exp 2 x
2 i=1 i

n
X n2 1 2

= exp xi exp ( 2)
2 i=1
2 2 2 2

n n
h 1 2 1 X i
= exp + + xi + 2
2 2 2 2 2 i=1

= exp(A2 + B)

2 B/A
= exp
1/A

2 B/A + B 2 /4A2
exp
1/A

( B/2A)2 i
h
= exp
1/A

Comparing with the pdf of a Normal distribution, we deduce that the pos-
terior distribution of is given by:

B 1
|x1 , ...xn N 2A , 2A

Clearly, E[|x1 , ..., xn ] = B/2A. Therefore, = B/2A is the Bayesian esti-

mator for

6
Example 4

Consider the following types of treatment:

Treatment 1: 100% of the patients are cured (3 out of 3)

Treatment 2: 95% of the patients are cured (19 out of 20)

Treatment 3: 90% of the patients are cured (90,000 out of 100,000)

Which one is the best???

Treatment 1 cured 100% of the patients. But the sample was so small that
we should cast doubt on the result. On the other hand, Treatment 3 was
very reassuring, but the percentage was a bit lower

Let p be the probability that a patient is cured. Then, the probability that
a patient is not cured is 1 p

Therefore, the population follows Bernoulli(p), where p is an unknown pa-

rameter
P
xi +1
In example 1, we found that p = n+2
provided an estimate for p
3+1 4
Treatment 1: p = 3+2
= 5
= 0.8
19+1 20
Treatment 2: p = 20+2
= 22
0.909
90000+1 90001
Treatment 3: p = 100000+2
= 100002
0.9

We can see that p for Treatment 2 is the highest. Therefore, we predict that
Treatment 2 is the best one. Treatment 1, despite curing everyone in the
sample, is predicted to be the worst due to its small sample size

Linear Models 2nd Edition Shayle R. Searle - Download the full ebook now to never miss any detail
100% (2)
Linear Models 2nd Edition Shayle R. Searle - Download the full ebook now to never miss any detail
57 pages
Clinical Epidemiology: The Essentials. ISBN 9781451144475, 978-1451144475
100% (32)
Clinical Epidemiology: The Essentials. ISBN 9781451144475, 978-1451144475
23 pages
Regression Modeling Strategies
No ratings yet
Regression Modeling Strategies
506 pages
Colgate Financial Model Solved
No ratings yet
Colgate Financial Model Solved
33 pages
Switching Models Workbook
No ratings yet
Switching Models Workbook
239 pages
Risk Assessment Template
100% (1)
Risk Assessment Template
5 pages
1 Clarida-Gali-Gertler Model: T T t+1 T
No ratings yet
1 Clarida-Gali-Gertler Model: T T t+1 T
16 pages
DS 630_Lec 5_St
No ratings yet
DS 630_Lec 5_St
15 pages
Bayesian Statistics 01
100% (1)
Bayesian Statistics 01
22 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
DS 630_Lec 4_St
No ratings yet
DS 630_Lec 4_St
27 pages
Lecture 6. Bayesian Estimation
No ratings yet
Lecture 6. Bayesian Estimation
14 pages
CH 5
No ratings yet
CH 5
45 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
MIT18 650F16 Bayesian Statistics
No ratings yet
MIT18 650F16 Bayesian Statistics
18 pages
Single Parameter Models
No ratings yet
Single Parameter Models
37 pages
Notes4_BayesianLearning
No ratings yet
Notes4_BayesianLearning
8 pages
8. Bayesian_Lec_3
No ratings yet
8. Bayesian_Lec_3
24 pages
Bayesian Statistics: MA501, Statistics For Insurance
No ratings yet
Bayesian Statistics: MA501, Statistics For Insurance
28 pages
Single Parametric Models
No ratings yet
Single Parametric Models
10 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
Lecture 20 - Bayesian Analysis
No ratings yet
Lecture 20 - Bayesian Analysis
4 pages
Chapter 1 B
No ratings yet
Chapter 1 B
35 pages
Chap 2
No ratings yet
Chap 2
28 pages
bayesian-inference
No ratings yet
bayesian-inference
18 pages
DS 630_Lec 3_St
No ratings yet
DS 630_Lec 3_St
24 pages
Lecture4 More Bayes
No ratings yet
Lecture4 More Bayes
24 pages
25 Intro to Bayesian Inference (1)
No ratings yet
25 Intro to Bayesian Inference (1)
31 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
20 pages
Slides 1
No ratings yet
Slides 1
73 pages
Lectures 5
No ratings yet
Lectures 5
31 pages
MCMC Bayes PDF
No ratings yet
MCMC Bayes PDF
27 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
Lecture Material 2.5 - Bayesian Estimation & Concepts
No ratings yet
Lecture Material 2.5 - Bayesian Estimation & Concepts
12 pages
Multi Parametric Models
No ratings yet
Multi Parametric Models
5 pages
Bayesian-inference-slides-2021
No ratings yet
Bayesian-inference-slides-2021
37 pages
ln13
No ratings yet
ln13
5 pages
Lecture 3
No ratings yet
Lecture 3
4 pages
Lecture 5 - 8 Bayesian Estimation
No ratings yet
Lecture 5 - 8 Bayesian Estimation
65 pages
A Beginner's Notes On Bayesian Econometrics (Art)
No ratings yet
A Beginner's Notes On Bayesian Econometrics (Art)
21 pages
BaYesian Models Machine Learning 2016
No ratings yet
BaYesian Models Machine Learning 2016
126 pages
BML Lecture Notes
No ratings yet
BML Lecture Notes
126 pages
Bayesian Inference: The Basics
No ratings yet
Bayesian Inference: The Basics
37 pages
Introduction To Bayesian Methods With An Example
No ratings yet
Introduction To Bayesian Methods With An Example
25 pages
24 Intro to Bayesian Inference (1)
No ratings yet
24 Intro to Bayesian Inference (1)
33 pages
W10 Notes
No ratings yet
W10 Notes
2 pages
Week 10
No ratings yet
Week 10
2 pages
Bayesian Statistics: Thomas Bayes
No ratings yet
Bayesian Statistics: Thomas Bayes
22 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
02_solution_Bayes_example
No ratings yet
02_solution_Bayes_example
2 pages
Bayesian Ibrahim
No ratings yet
Bayesian Ibrahim
370 pages
Chapter 5. Bayesian Statistics (II)
No ratings yet
Chapter 5. Bayesian Statistics (II)
30 pages
Mstat Note14 Bayesian Inference FSP
No ratings yet
Mstat Note14 Bayesian Inference FSP
30 pages
Lecture 4
No ratings yet
Lecture 4
7 pages
Notes 2 BayesianStatistics
No ratings yet
Notes 2 BayesianStatistics
6 pages
Cs 13 Batch 1
No ratings yet
Cs 13 Batch 1
84 pages
Slides PDF
No ratings yet
Slides PDF
40 pages
BT_Wk3_LectureNotes(3)
No ratings yet
BT_Wk3_LectureNotes(3)
16 pages
Bayes 2 V
No ratings yet
Bayes 2 V
32 pages
1-MS2 (Intro Bayes)
No ratings yet
1-MS2 (Intro Bayes)
38 pages
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
No ratings yet
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
36 pages
An Overview of Bayesian Econometrics
No ratings yet
An Overview of Bayesian Econometrics
30 pages
Basics of Probabilistic/Bayesian Modeling and Parameter Estimation
No ratings yet
Basics of Probabilistic/Bayesian Modeling and Parameter Estimation
21 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
16 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Sample Data - Ecommerce Dashboard
No ratings yet
Sample Data - Ecommerce Dashboard
384 pages
Ex01 - Using Macro in Excel
No ratings yet
Ex01 - Using Macro in Excel
10 pages
Cs Movavg
No ratings yet
Cs Movavg
1 page
Excel Howto Dashboard
No ratings yet
Excel Howto Dashboard
9 pages
Annual Financial Report: Your Company Name
No ratings yet
Annual Financial Report: Your Company Name
3 pages
Sample Data Sets For Linear Regression1
No ratings yet
Sample Data Sets For Linear Regression1
6 pages
Continuity Property of Probability
No ratings yet
Continuity Property of Probability
1 page
Wood 1992
No ratings yet
Wood 1992
28 pages
JSS2008
No ratings yet
JSS2008
23 pages
Patient Sickness Prediction System
No ratings yet
Patient Sickness Prediction System
8 pages
Emailing PREDICTIVE ANALYSIS 2
No ratings yet
Emailing PREDICTIVE ANALYSIS 2
14 pages
Csci567 Hw1 Spring 2016
No ratings yet
Csci567 Hw1 Spring 2016
9 pages
Fast Bayesian Ambient Modal Identification Multiple Setups
No ratings yet
Fast Bayesian Ambient Modal Identification Multiple Setups
17 pages
Quantitative Psychology The 85th Annual Meeting Of The Psychometric Society Virtual Marie Wiberg pdf download
100% (1)
Quantitative Psychology The 85th Annual Meeting Of The Psychometric Society Virtual Marie Wiberg pdf download
82 pages
Super Cheatsheet Machine Learning
100% (1)
Super Cheatsheet Machine Learning
15 pages
Statistical Properties of Exponential Rayleigh Distribution and Its Applications To Medical Science and Engineering
No ratings yet
Statistical Properties of Exponential Rayleigh Distribution and Its Applications To Medical Science and Engineering
16 pages
Normal Distribution
No ratings yet
Normal Distribution
30 pages
MLE_for_OR_mean_reverting (1)
No ratings yet
MLE_for_OR_mean_reverting (1)
10 pages
(Springer Series in Statistics) Michael L. Stein (Auth.) - Interpolation of Spatial Data - Some Theory For Kriging-Springer-Verlag New York (1999)
No ratings yet
(Springer Series in Statistics) Michael L. Stein (Auth.) - Interpolation of Spatial Data - Some Theory For Kriging-Springer-Verlag New York (1999)
262 pages
The Actuary's Toolkit: A View From EMB: P.D. England
No ratings yet
The Actuary's Toolkit: A View From EMB: P.D. England
9 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Turbofan Exhaust Gas Temperature Forecasting and Performance Monitoring With A Neural Network Model
No ratings yet
Turbofan Exhaust Gas Temperature Forecasting and Performance Monitoring With A Neural Network Model
9 pages
SMBI Ch1 - Introduction To Bayesian Statistics
No ratings yet
SMBI Ch1 - Introduction To Bayesian Statistics
92 pages
Get From Statistical Physics To Data-Driven Modelling: With Applications To Quantitative Biology Simona Cocco Free All Chapters
100% (8)
Get From Statistical Physics To Data-Driven Modelling: With Applications To Quantitative Biology Simona Cocco Free All Chapters
48 pages
Buy Ebook Mathematical and Statistical Methods For Actuarial Sciences and Finance eMAF2020 1st Edition Marco Corazza Cheap Price
100% (4)
Buy Ebook Mathematical and Statistical Methods For Actuarial Sciences and Finance eMAF2020 1st Edition Marco Corazza Cheap Price
31 pages
MCMC: Gibbs Sampling: D K k1 k+1 D
No ratings yet
MCMC: Gibbs Sampling: D K k1 k+1 D
7 pages
Probability Theory The Logic of Science 1st Edition E.T. Jaynes - The full ebook set is available with all chapters for download
No ratings yet
Probability Theory The Logic of Science 1st Edition E.T. Jaynes - The full ebook set is available with all chapters for download
49 pages
Practical Weibull Analysis Monograph 5th Ed
No ratings yet
Practical Weibull Analysis Monograph 5th Ed
103 pages
Jolly 1965
No ratings yet
Jolly 1965
24 pages
Probabilistic Diffusion Tractography With Multiple Fibre Orientations: What Can We Gain?
No ratings yet
Probabilistic Diffusion Tractography With Multiple Fibre Orientations: What Can We Gain?
12 pages
Compiled Notes: Mscfe 610 Econometrics
100% (1)
Compiled Notes: Mscfe 610 Econometrics
29 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Bayesian Inference: by Hoai Nam Nguyen September 9, 2017

Uploaded by

Bayesian Inference: by Hoai Nam Nguyen September 9, 2017

Uploaded by

Bayesian Inference

by Hoai Nam Nguyen

Similar to Maximum Likelihood Estimation, we first find the Likelihood

L() = fX1 ,...,Xn (x1 , ..., xn |)

In Bayesian inference, we treat the parameter as a random variable. That

By Bayess formula, we have

fX1 ,...,Xn (x1 , ..., xn |)()

fX1 ,...,Xn (x1 , ..., xn |)()

NOTE: when calculating (|x1 , ..., xn ), always use proportionality by re-

The population distribution is Bernoulli(p), where p U nif orm(0, 1). Use

The likelihood function is given by:

The pdf of the prior distribution is (p) = 1, for 0 < p < 1

Therefore, the posterior distribution is given by:

Note that we used proportionality when calculating the posterior distribu-

The likelihood function stays unchanged:

The pdf of the prior distribution is given by:

Therefore, the pdf of the posterior distribution is given by:

(p|x1 , ..., xn ) fX1 ,...,Xn (x1 , ..., xn |p)(p)

The population distribution is N (, 2 ), where is unknown and 2 is known.

The likelihood function is given by:

Also, the pdf of the prior distribution is given by:

Clearly, E[|x1 , ..., xn ] = B/2A. Therefore, = B/2A is the Bayesian esti-

Consider the following types of treatment:

Treatment 1: 100% of the patients are cured (3 out of 3)

Treatment 2: 95% of the patients are cured (19 out of 20)

Treatment 3: 90% of the patients are cured (90,000 out of 100,000)

Which one is the best???

Therefore, the population follows Bernoulli(p), where p is an unknown pa-

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.