0% found this document useful (0 votes)

194 views8 pages

L31 Bayesian Logistic Regression PDF

Uploaded by

Ananya Agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

194 views8 pages

L31 Bayesian Logistic Regression PDF

Uploaded by

Ananya Agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

MTH 511a - 2020: Lecture 31

Instructor: Dootika Vats

The instructor of this course owns the copyright of all the course materials. This lecture
material was distributed only to the students attending the course MTH511a: “Statistical
Simulation and Data Analysis” of IIT Kanpur, and should not be distributed in print or
through electronic media without the consent of the instructor. Students can make their own
copies of the course materials for their use.
A popular Bayesian model is the Bayesian logistic regression model. In this lecture we
will present the model and analyze the Titanic dataset from the exam.

1 Bayesian logistic regression

Consider a Bayesian logistic regression model. For i = 1, . . . , n, let
T
xi = (1, xi2 , . . . , xi(p 1) )

be the vector of covariates for the ith observation and 2 Rp be the corresponding
vector of regression coefficients. Suppose response yi is a realization of Yi with

exp(xTi )
Yi |xi , ⇠ Bern (pi ) where pi = .
1 + exp(xTi )

Since this is a Bayesian model, we also assume that has the following prior distribu-
tion
⇠ Np (0, Ip ) .

Our goal is to find the posterior distribution and report the posterior mean and credible
intervals of . In order to do this, the first thing we do is write down the posterior
distribution.
n
Y
⇡( |y) / ⇡( ) f (yi | )
i=1
n
Y
T
/e /2
(pi )yi (1 pi ) 1 yi

i=1

1
The posterior is p dimensional, so here we need to sample from a p dimensional distri-
bution. Our proposal distribution will be
p
Y
⇤ 2 ⇤
q( | , )= q( | ).
k=1

So each component is given it’s own proposal value, independent of each other. We will
use all normal distributions, with di↵erent step sizes h1 , . . . , hp . So we propose from
0 2 31
h ... 0 0
B 6 1 7C
B 6 7C
B 6 0 h2 0 0 7C
Np B 6
B t , 6 .. ..
7C
7C
B 6 . . 0 7C
@ 4 5A
0 0 0 hp

Note that this is a symmetric proposal, so the MH ratio is simplifies. Since we already
know the MLE of the logistic regression model, we can start from the MLE solution!
###########################################
## Bayesian logistic regression
## with MH implementation
###########################################
# log posterior
logf <- function(beta)
{
one.minus.yx <- (1 - y)*X
-sum(beta^2)/2 - sum(log(1 + exp(-X%*%beta))) - sum(one.minus.yx%*%beta)
}

bayes_logit_mh <- function(y, X, N = 1e4, prop.sd = .35)

{
p <- dim(X)[2]
one.minus.yx <- (1 - y)*X

# starting value is the MLE

foo <- glm(y ~X - 1, family = binomial("logit"))$coef
beta <- as.matrix(foo, ncol = 1)
beta.mat <- matrix(0, nrow = N, ncol = p)
beta.mat[1, ] <- as.numeric(beta)
accept <- 0

for(i in 2:N)
{
#symmetric density
prop <- rnorm(p, mean = beta, sd = prop.sd)

# log of the MH ratio

2
log.rat <- logf(prop) - logf(beta)
if(log(runif(1)) < log.rat)
{
beta <- prop
accept <- accept + 1
}
beta.mat[i, ] <- beta
}
print(paste("Acceptance Prob = ", accept/N))
return(beta.mat)
}

When we run the above function, it automatically prints the acceptance probability.
We now load the dataset.
titanic <- read.csv("https://dvats.github.io/assets/titanic.csv")

y <- titanic[,1]
X <- as.matrix(titanic[, -1])

First we will try to find a proposal variance h that words reasonably well to give about
23% acceptance. We will do this by running the sampler for short (103 ) runs. First we
let all proposal variance to be the same.
## acceptance is too low. we want 23%
## so decrease proposal variance
chain <- bayes_logit_mh(y = y, X = X, N = 1e3, prop.sd = .35)
#[1] "Acceptance Prob = 0"

# still too low

chain <- bayes_logit_mh(y = y, X = X, N = 1e3, prop.sd = .1)
#[1] "Acceptance Prob = 0"

# now its better

chain <- bayes_logit_mh(y = y, X = X, N = 1e3, prop.sd = .0065)
#[1] "Acceptance Prob = 0.218"

Notice our first two runs did not work well since our acceptance rate was too low. This
means we were proposing large jumps and it would be better to take smaller jumps
to increase acceptance. When we reduced the proposal sd to .0065 we got a decent
acceptance rate. Now we will run this same run for longer (105 ) and print diagonstics.
# will now run the chain much longer for 10^5
# takes a few seconds
chain <- bayes_logit_mh(y = y, X = X, N = 1e5, prop.sd = .0065)

# all trace plots

3
plot.ts(chain)

par(mfrow = c(2,3))
# all ACF plots
for(i in 1:dim(chain)[2])
{
acf(chain[,i], main = paste("ACF of Comp ", i))
}

# all density plots plots

for(i in 1:dim(chain)[2])
{
plot(density(chain[,i]), main = paste("Density of Comp ", i))
}

Trace plots

−0.1
−2.00.8 1.0 1.2 1.4 1.6 1.8
Series 1

Series 4
−0.3
−0.5
0.1 −0.7
−2.2

−0.1
Series 2

Series 5
−2.4

−0.3
−2.6

0.030 −0.5
0.00 −2.8
Series 3

Series 6
0.020
−0.02

0.010
−0.04

0e+00 2e+04 4e+04 6e+04 8e+04 1e+05 0e+00 2e+04 4e+04 6e+04 8e+04 1e+05

Time Time

4
ACF of Comp 1 ACF of Comp 2 ACF of Comp 3

1.0

1.0
0.8

0.8

0.8
0.6

0.6

0.6
ACF

ACF

ACF
0.4

0.4

0.4
0.2

0.2

0.2
0.0

0.0

0.0
0 10 20 30 40 50 0 10 20 30 40 50 0 10 20 30 40 50

Lag Lag Lag

ACF of Comp 4 ACF of Comp 5 ACF of Comp 6

1.0

1.0
0.8

0.8

0.8
0.6

0.6

0.6
ACF

ACF

ACF
0.4

0.4

0.4
0.2

0.2

0.2
0.0

0.0

0.0
0 10 20 30 40 50 0 10 20 30 40 50 0 10 20 30 40 50

Lag Lag Lag

Density of Comp 1 Density of Comp 2 Density of Comp 3

2.0

2.5

60
50
2.0
1.5

40
1.5
Density

Density

Density
1.0

30
1.0

20
0.5

0.5

10
0.0

0.0

0.8 1.0 1.2 1.4 1.6 1.8 2.0 −2.8 −2.6 −2.4 −2.2 −2.0 −0.04 −0.03 −0.02 −0.01 0.00

N = 100000 Bandwidth = 0.01975 N = 100000 Bandwidth = 0.01604 N = 100000 Bandwidth = 0.0005816

Density of Comp 4 Density of Comp 5 Density of Comp 6

4
4

100
3
3

80
Density

Density

Density
2
2

60
40
1
1

20
0

−0.6 −0.4 −0.2 0.0 −0.5 −0.3 −0.1 0.1 0.005 0.015 0.025

N = 100000 Bandwidth = 0.009918 N = 100000 Bandwidth = 0.009745 N = 100000 Bandwidth = 0.0002776

What we see is that although components 3 and 6 are well estimated with sample
size 105 , the other four are very poorly moving. This is because we choose the same
proposal variance for each component which is not ideal here. We will now give each
component a di↵erent proposal variance and run the sampler again for 105 steps.

# we see above that some components are ok, but 4 components are

5
# moving very slowly. This is because we are using the same proposal
# variance for each component, which is not adequate here.
# Below now I use different proposal variances for different
#components.

chain <- bayes_logit_mh(y = y, X = X, N = 1e5, prop.sd = c(.08, .08, .0065,

.03, .03, .0065))

# all trace plots

plot.ts(chain, main = "Trace plots")

par(mfrow = c(2,3))
# all ACF plots
for(i in 1:dim(chain)[2])
{
acf(chain[,i], main = paste("ACF of Comp ", i))
}

# all density plots plots

for(i in 1:dim(chain)[2])
{
plot(density(chain[,i]), main = paste("Density of Comp ", i))
}

Trace plots
2.5

−0.2 0.0
2.0
Series 1

Series 4
1.5

−0.6
1.0

−1.0
0.5
−1.8

0.2
0.0
−2.2
Series 2

Series 5
−0.2
−2.6

−0.4
−3.0

−0.6
0.030
−0.01
Series 3

Series 6
0.020
−0.03

0.010
−0.05

0e+00 2e+04 4e+04 6e+04 8e+04 1e+05 0e+00 2e+04 4e+04 6e+04 8e+04 1e+05

Time Time

6
ACF of Comp 1 ACF of Comp 2 ACF of Comp 3

1.0

1.0
0.8

0.8

0.8
0.6

0.6

0.6
ACF

ACF

ACF
0.4

0.4

0.4
0.2

0.2

0.2
0.0

0.0

0.0
0 10 20 30 40 50 0 10 20 30 40 50 0 10 20 30 40 50

Lag Lag Lag

ACF of Comp 4 ACF of Comp 5 ACF of Comp 6

1.0

1.0
0.8

0.8

0.8
0.6

0.6

0.6
ACF

ACF

ACF
0.4

0.4

0.4
0.2

0.2

0.2
0.0

0.0

0.0
0 10 20 30 40 50 0 10 20 30 40 50 0 10 20 30 40 50

Lag Lag Lag

Density of Comp 1 Density of Comp 2 Density of Comp 3

1.5

2.0

60
50
1.5
1.0

40
Density

Density

Density
1.0

30
0.5

20
0.5

10
0.0

0.0

0.5 1.0 1.5 2.0 2.5 −3.2 −2.8 −2.4 −2.0 −0.05 −0.03 −0.01 0.01

N = 100000 Bandwidth = 0.02346 N = 100000 Bandwidth = 0.01756 N = 100000 Bandwidth = 0.000606

Density of Comp 4 Density of Comp 5 Density of Comp 6

0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5
0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5

100
80
Density

Density

60
40
20
0

−1.0 −0.8 −0.6 −0.4 −0.2 0.0 −0.6 −0.4 −0.2 0.0 0.2 0.005 0.015 0.025

N = 100000 Bandwidth = 0.01082 N = 100000 Bandwidth = 0.01045 N = 100000 Bandwidth = 0.0002826

The estimated density plots, acfs, and trace plots are much better!
Thus, we see that MCMC, although powerful, can be difficult to tune. However, once
you can make it work, it works reasonable well.

7
2 Questions to think about
• Try and implement MCMC for the Bayesian regression model.
• Obtain posterior mean and quantiles for the above implemented example. How
do the final estimates compare to the MLE estimates?

Introduction To Regression With Statsmodels in Python
No ratings yet
Introduction To Regression With Statsmodels in Python
142 pages
Homework 8
100% (1)
Homework 8
6 pages
PR2 Questionnaire
100% (1)
PR2 Questionnaire
45 pages
qt9kb6x0bw Nosplash
No ratings yet
qt9kb6x0bw Nosplash
18 pages
Principles of Communications - EE320A
No ratings yet
Principles of Communications - EE320A
401 pages
Nonparametric Statistics Epiphany 2024-25
No ratings yet
Nonparametric Statistics Epiphany 2024-25
102 pages
Solutions Manual For Statistical Computing With R Rizzo 2 1
No ratings yet
Solutions Manual For Statistical Computing With R Rizzo 2 1
137 pages
Lecture_13-GLMM2 (1)
No ratings yet
Lecture_13-GLMM2 (1)
70 pages
Solutions Manual For Statistical Computing With R - Rizzo
100% (1)
Solutions Manual For Statistical Computing With R - Rizzo
136 pages
ASWCCFO SBE 15e PPT CH10
No ratings yet
ASWCCFO SBE 15e PPT CH10
31 pages
ParameterEstimation
No ratings yet
ParameterEstimation
50 pages
Fuskpaper Bayes
No ratings yet
Fuskpaper Bayes
51 pages
Simon Shaw Bayes Theory
No ratings yet
Simon Shaw Bayes Theory
72 pages
DH302 Spring2025 Assignment01-Solution
No ratings yet
DH302 Spring2025 Assignment01-Solution
25 pages
ModelComparison
No ratings yet
ModelComparison
22 pages
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
No ratings yet
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
36 pages
Statistical Data Analysis
No ratings yet
Statistical Data Analysis
23 pages
implementation
No ratings yet
implementation
22 pages
Nust College of Electrical and Mechanical Engineering: Digital Communication
No ratings yet
Nust College of Electrical and Mechanical Engineering: Digital Communication
8 pages
Lecture 07 - Large and Small Estimation
No ratings yet
Lecture 07 - Large and Small Estimation
44 pages
RegressionModels_
No ratings yet
RegressionModels_
17 pages
Mclogit
No ratings yet
Mclogit
19 pages
Quartile of Ungrouped Data
100% (4)
Quartile of Ungrouped Data
16 pages
ECON3002_2013_final_merged_answer
No ratings yet
ECON3002_2013_final_merged_answer
23 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
MA40189 Notes
No ratings yet
MA40189 Notes
70 pages
CKPN
100% (1)
CKPN
12 pages
Barthelme EP2
No ratings yet
Barthelme EP2
58 pages
ex08
No ratings yet
ex08
10 pages
Density Estimation
No ratings yet
Density Estimation
17 pages
datamining
No ratings yet
datamining
20 pages
MCEM Evidenced Based Medicine MCQ
100% (2)
MCEM Evidenced Based Medicine MCQ
93 pages
EE311A 2021 AV Slides L23
No ratings yet
EE311A 2021 AV Slides L23
13 pages
Bayesian-inference-slides-2021
No ratings yet
Bayesian-inference-slides-2021
37 pages
Notice-Syllabus RET Research Methodology (2)
No ratings yet
Notice-Syllabus RET Research Methodology (2)
2 pages
test for heteroskedasticity in logit_ probit models - Statalist
No ratings yet
test for heteroskedasticity in logit_ probit models - Statalist
3 pages
The Beta Distribution
No ratings yet
The Beta Distribution
11 pages
EE311A 2021 AV Slides L20
No ratings yet
EE311A 2021 AV Slides L20
12 pages
Chapter 3 Notes-Alyssa
No ratings yet
Chapter 3 Notes-Alyssa
10 pages
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
No ratings yet
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
78 pages
EE311A 2021 AV Slides L24
No ratings yet
EE311A 2021 AV Slides L24
9 pages
EE311A 2021 AV Slides L21
No ratings yet
EE311A 2021 AV Slides L21
9 pages
Em Algo For Multivariate GMM
No ratings yet
Em Algo For Multivariate GMM
9 pages
L24 Simulated Annelaing
No ratings yet
L24 Simulated Annelaing
9 pages
EE320A Solutions For Tutorial 2
No ratings yet
EE320A Solutions For Tutorial 2
14 pages
BLOCO 8 - Issues in Outcomes Research An Overview of Randomization Techniques For Clinical Trials
No ratings yet
BLOCO 8 - Issues in Outcomes Research An Overview of Randomization Techniques For Clinical Trials
7 pages
Probit Analysis
No ratings yet
Probit Analysis
9 pages
TD 1
No ratings yet
TD 1
6 pages
backman1972
No ratings yet
backman1972
13 pages
EE340A: Electromagnetic Theory: 1 Laplace Transform of Transmission Line Equations
No ratings yet
EE340A: Electromagnetic Theory: 1 Laplace Transform of Transmission Line Equations
7 pages
6 R Session: Multivariate Extremes and Bayesian Inference
No ratings yet
6 R Session: Multivariate Extremes and Bayesian Inference
7 pages
Assignment 2
No ratings yet
Assignment 2
8 pages
2024-Fourier Basis Density Model
No ratings yet
2024-Fourier Basis Density Model
5 pages
L28 Bayseian Linear Regression Linchpin Sampler PDF
No ratings yet
L28 Bayseian Linear Regression Linchpin Sampler PDF
6 pages
ex09
No ratings yet
ex09
5 pages
Chapter 4 a Statistics
No ratings yet
Chapter 4 a Statistics
49 pages
Chapter 01 - The Roles of Statistics in Engineering
No ratings yet
Chapter 01 - The Roles of Statistics in Engineering
13 pages
Week 2-A.Guess The Distribution
No ratings yet
Week 2-A.Guess The Distribution
10 pages
Bayes Intro PT 2
No ratings yet
Bayes Intro PT 2
13 pages
1.2.6 Advanced
No ratings yet
1.2.6 Advanced
5 pages
Akesmawan,+2 Cornelia Matani
No ratings yet
Akesmawan,+2 Cornelia Matani
25 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
Confidence interval and credintial interval
No ratings yet
Confidence interval and credintial interval
15 pages
Universiti Malaysia Pahang BKU2032 Probability & Statistics Tutorial 4 (Chapter 4: Analysis of Variance)
No ratings yet
Universiti Malaysia Pahang BKU2032 Probability & Statistics Tutorial 4 (Chapter 4: Analysis of Variance)
3 pages
Assignment
No ratings yet
Assignment
2 pages
3 Practical
No ratings yet
3 Practical
2 pages
Cat Ii
No ratings yet
Cat Ii
2 pages
Stat365 Homework 4
No ratings yet
Stat365 Homework 4
2 pages
Cheat Sheet 1
No ratings yet
Cheat Sheet 1
2 pages
CH Density Estimation
No ratings yet
CH Density Estimation
15 pages
Normalization 1
No ratings yet
Normalization 1
23 pages
L23 Stochastic Gradient and Mini Batch
No ratings yet
L23 Stochastic Gradient and Mini Batch
9 pages
Dsa Assignment Theory
No ratings yet
Dsa Assignment Theory
2 pages
5
No ratings yet
5
2 pages
4
No ratings yet
4
2 pages
3
No ratings yet
3
2 pages
MIT 402 CAT 2 S
No ratings yet
MIT 402 CAT 2 S
8 pages
Branches of Statistics Quantitative and Qualitative Data
No ratings yet
Branches of Statistics Quantitative and Qualitative Data
2 pages
MATH 437/ MATH 535: Applied Stochastic Processes/ Advanced Applied Stochastic Processes
No ratings yet
MATH 437/ MATH 535: Applied Stochastic Processes/ Advanced Applied Stochastic Processes
7 pages
William Gosset
No ratings yet
William Gosset
1 page
A Short Course On Nonparametric Curve Estimation R PDF
No ratings yet
A Short Course On Nonparametric Curve Estimation R PDF
114 pages
Midsem cs201 PDF
No ratings yet
Midsem cs201 PDF
1 page
00 Lab Notes
No ratings yet
00 Lab Notes
8 pages
Distributions Plotting
No ratings yet
Distributions Plotting
8 pages
Putational Statistics Using Matlab
No ratings yet
Putational Statistics Using Matlab
78 pages
1 PB PDF
No ratings yet
1 PB PDF
8 pages
CM6 - Mathematics As A Tool - Dispersion and Correlation
No ratings yet
CM6 - Mathematics As A Tool - Dispersion and Correlation
18 pages
Lecture 2 - R Graphics PDF
No ratings yet
Lecture 2 - R Graphics PDF
68 pages
Week 1 Lab - Coursera
No ratings yet
Week 1 Lab - Coursera
4 pages
Tutorial 5 Solutions
No ratings yet
Tutorial 5 Solutions
13 pages
Machine Learning Assignment 1 Basic Concepts: Due: 27 March 2015, 15:00pm
No ratings yet
Machine Learning Assignment 1 Basic Concepts: Due: 27 March 2015, 15:00pm
3 pages
Sample Level I Formula Sheet 1
No ratings yet
Sample Level I Formula Sheet 1
7 pages
An Introduction To Model-Fitting With The R Package GLMM: Christina Knudson April 26, 2015
No ratings yet
An Introduction To Model-Fitting With The R Package GLMM: Christina Knudson April 26, 2015
15 pages
Lab5 Solutions
No ratings yet
Lab5 Solutions
4 pages
R Commands
No ratings yet
R Commands
5 pages
Non-Linear Models With BSSM
No ratings yet
Non-Linear Models With BSSM
1 page
08 Introductory Econometrics Fourth Sem
No ratings yet
08 Introductory Econometrics Fourth Sem
4 pages
L22 Bootstrap
No ratings yet
L22 Bootstrap
7 pages
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
No ratings yet
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
4 pages
BRM Project
No ratings yet
BRM Project
23 pages
Practice Problem 2 - Forecasting Ordroid Devices
No ratings yet
Practice Problem 2 - Forecasting Ordroid Devices
7 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

L31 Bayesian Logistic Regression PDF

Uploaded by

L31 Bayesian Logistic Regression PDF

Uploaded by

MTH 511a - 2020: Lecture 31

Instructor: Dootika Vats

1 Bayesian logistic regression

bayes_logit_mh <- function(y, X, N = 1e4, prop.sd = .35)

# starting value is the MLE

# log of the MH ratio

# still too low

# now its better

# all trace plots

# all density plots plots

Lag Lag Lag

ACF of Comp 4 ACF of Comp 5 ACF of Comp 6

Lag Lag Lag

Density of Comp 1 Density of Comp 2 Density of Comp 3

N = 100000 Bandwidth = 0.01975 N = 100000 Bandwidth = 0.01604 N = 100000 Bandwidth = 0.0005816

Density of Comp 4 Density of Comp 5 Density of Comp 6

N = 100000 Bandwidth = 0.009918 N = 100000 Bandwidth = 0.009745 N = 100000 Bandwidth = 0.0002776

chain <- bayes_logit_mh(y = y, X = X, N = 1e5, prop.sd = c(.08, .08, .0065,

# all trace plots

# all density plots plots

Lag Lag Lag

ACF of Comp 4 ACF of Comp 5 ACF of Comp 6

Lag Lag Lag

Density of Comp 1 Density of Comp 2 Density of Comp 3

N = 100000 Bandwidth = 0.02346 N = 100000 Bandwidth = 0.01756 N = 100000 Bandwidth = 0.000606

Density of Comp 4 Density of Comp 5 Density of Comp 6

N = 100000 Bandwidth = 0.01082 N = 100000 Bandwidth = 0.01045 N = 100000 Bandwidth = 0.0002826

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.