R2WinBUGS - Logistic Regression with Simulated Data

Question

R2WinBUGS - Logistic Regression with Simulated Data

I'm just wondering if anyone has R code that uses the R2WinBUGS package to run logistic regression — ideally with simulated data to generate “truth” and two continuous co-variations.

Thanks.

Christian

PS:

Potential code to generate artificial data (one-dimensional case) and run winbugs through r2winbugs (it still does not work).

library(MASS) library(R2WinBUGS) setwd("d:/BayesianLogisticRegression") n.site <- 150 X1<- sort(runif(n = n.site, min = -1, max =1)) xb <- 0.0 + 3.0*X1 occ.prob <- 1/(1+exp(-xb)) plot(X1, occ.prob,xlab="X1",ylab="occ.prob") true.presence <- rbinom(n = n.site, size = 1, prob = occ.prob) plot(X1, true.presence,xlab="X1",ylab="true.presence") # combine data as data frame and save data <- data.frame(X1, true.presence) write.matrix(data, file = "data.txt", sep = "\t") sink("model.txt") cat(" model { # Priors alpha ~ dnorm(0,0.01) beta ~ dnorm(0,0.01) # Likelihood for (i in 1:n) { C[i] ~ dbin(p[i], N) # Note p before N logit(p[i]) <- alpha + beta *X1[i] } } ",fill=TRUE) sink() # Bundle data win.data <- list(mass = X1, n = length(X1)) # Inits function inits <- function(){ list(alpha=rlnorm(1), beta=rlnorm(1))} # Parameters to estimate params <- c("alpha", "beta") # MCMC settings nc <- 3 #Number of Chains ni <- 1200 #Number of draws from posterior nb <- 200 #Number of draws to discard as burn-in nt <- 2 Thinning rate # Start Gibbs sampling out <- bugs(data=win.data, inits=inits, parameters.to.save=params, model.file="model.txt", n.thin=nt, n.chains=nc, n.burnin=nb, n.iter=ni, debug = TRUE)

+6

r winbugs14 r2winbugs

cs0815 Nov 24 '11 at 17:30

source share

1 answer

Tms · Accepted Answer · 2011-11-28T18:39:14+0000

Your script is exactly how to do this. It almost works, it just needs one simple change to make it work:

 win.data <- list(X1 = X1, n = length(X1), C = true.presence, N = 1)

Determines what data is transferred to WinBugs. The variable C must be filled with the value true.presence, N must be 1 according to the data that you generated - note that this is a special case of the binomial distribution for N = 1, which is called Bernoulli - a simple "flip of coins."

Here is the result:

 > print(out, dig = 3) Inference for Bugs model at "model.txt", fit using WinBUGS, 3 chains, each with 1200 iterations (first 200 discarded), n.thin = 2 n.sims = 1500 iterations saved mean sd 2.5% 25% 50% 75% 97.5% Rhat n.eff alpha -0.040 0.221 -0.465 -0.187 -0.037 0.114 0.390 1.001 1500 beta 3.177 0.478 2.302 2.845 3.159 3.481 4.165 1.000 1500 deviance 136.438 2.059 134.500 135.000 135.800 137.200 141.852 1.001 1500

as you can see, the parameters correspond to the parameters used to generate the data. Also, if you compare with the sentinel’s solution, you’ll see that it matches.

EDIT : but a typical logistic (~ binomial) regression would measure some calculations with some upper value N [i], and this would make it possible to distinguish N [i] for different observations. For example, indicate the proportion of minors for the entire population (N). This will require just adding the index to N in your model:

 C[i] ~ dbin(p[i], N[i])

Data generation will look something like this:

 N = rpois(n = n.site, lambda = 50) juveniles <- rbinom(n = n.site, size = N, prob = occ.prob) win.data <- list(X1 = X1, n = length(X1), C = juveniles, N = N)

(end of editing)

For more examples from population ecology, see Mark Carey's books (An Introduction to WinBUGS for the Ecologist, and Especially Bayesian Population Analysis Using WinBUGS: A Hierarchical Perspective, which is a great book).

The full script I used is the adjusted script from you is listed here (comparison with the frequent solution at the end):

 #library(MASS) library(R2WinBUGS) #setwd("d:/BayesianLogisticRegression") n.site <- 150 X1<- sort(runif(n = n.site, min = -1, max =1)) xb <- 0.0 + 3.0*X1 occ.prob <- 1/(1+exp(-xb)) # inverse logit plot(X1, occ.prob,xlab="X1",ylab="occ.prob") true.presence <- rbinom(n = n.site, size = 1, prob = occ.prob) plot(X1, true.presence,xlab="X1",ylab="true.presence") # combine data as data frame and save data <- data.frame(X1, true.presence) write.matrix(data, file = "data.txt", sep = "\t") sink("tmp_bugs/model.txt") cat(" model { # Priors alpha ~ dnorm(0,0.01) beta ~ dnorm(0,0.01) # Likelihood for (i in 1:n) { C[i] ~ dbin(p[i], N) # Note p before N logit(p[i]) <- alpha + beta *X1[i] } } ",fill=TRUE) sink() # Bundle data win.data <- list(X1 = X1, n = length(X1), C = true.presence, N = 1) # Inits function inits <- function(){ list(alpha=rlnorm(1), beta=rlnorm(1))} # Parameters to estimate params <- c("alpha", "beta") # MCMC settings nc <- 3 #Number of Chains ni <- 1200 #Number of draws from posterior nb <- 200 #Number of draws to discard as burn-in nt <- 2 #Thinning rate # Start Gibbs sampling out <- bugs(data=win.data, inits=inits, parameters.to.save=params, model.file="model.txt", n.thin=nt, n.chains=nc, n.burnin=nb, n.iter=ni, working.directory = paste(getwd(), "/tmp_bugs/", sep = ""), debug = TRUE) print(out, dig = 3) # Frequentist approach for comparison m1 = glm(true.presence ~ X1, family = binomial) summary(m1) # normally, you should do it this way, but the above also works: #m2 = glm(cbind(true.presence, 1 - true.presence) ~ X1, family = binomial)

R2WinBUGS - Logistic Regression with Simulated Data

More articles: