Local level model set
up
The following exercise is inspired in Gamerman, Reis and Salazar
(2006) Comparison of sampling schemes for dynamic linear models.
International Statistical Review, 74, 203-214.
The purpose of the exercise is to compare two MCMC schemes to sample
states and fixed parameters in the first order dynamic linear model,
also known as the local level model that we have already discussed in
class. More specifically, the local level model assumes that the time
series data, \(\mbox{data}=\{y_1,\ldots,y_n\}\), can be
described as follows, for \(t=1,\ldots,n\) \[\begin{eqnarray*}
y_t &=& \beta_t + \nu_t \qquad \nu_t \sim N(0,V)\\
\beta_t &=& \beta_{t-1} + \omega_t \qquad \omega_t \sim N(0,W),
\end{eqnarray*}\] with \(\beta_0 \sim
N(m_0,C_0)\) and where \(W/V\)
is commonly known as the signal to noise ratio (SNR). Here \(m_0\) and \(C_0\) are known hyperparameters.
The model is completed with prior distributions for the error
variances, \(V\) and \(W\). Here, we will assume that \[\begin{eqnarray*}
V &\sim& IG(a_\nu,b_\nu)\\
W &\sim& IG(a_\omega,b_\omega),
\end{eqnarray*}\] \(a_\nu,b_\nu,a_\omega,b_\omega\) known
hyperparameters.
Simulating one
dataset
For simplication, we created the data simulation function
sample.data.
sample.data = function(n,W,V,beta0){
sdW = sqrt(W)
sdV = sqrt(V)
beta = rep(0,n)
y = rep(0,n)
beta[1] = rnorm(1,beta0,sdW)
y[1] = rnorm(1,beta[1],sdV)
for (t in 2:n){
beta[t] = rnorm(1,beta[t-1],sdW)
y[t] = rnorm(1,beta[t],sdV)
}
return(y)
}
We simulated \(n=100\) observations,
starting with \(\beta_0=0\) and fixing
\(V=1.0\) and \(W=0.25\) so the SNR is 0.25.
set.seed(17389)
n = 100
V = 1.0
W = 0.25
beta0 = 0.0
y = sample.data(n,W,V,beta0)
ts.plot(y)

Block-move
The following routine mcmc.joint samples \((\beta_1,\ldots,\beta_n)\), \(V\) and \(W\) iteratively from their full conditional
posterior distributions, ie. \[\begin{eqnarray*}
p(\beta_1,\ldots,\beta_n|V,W,\mbox{data})\\
p(V|\beta_1,\ldots,\beta_n,W,\mbox{data})\\
p(W|\beta_1,\ldots,\beta_n,V,\mbox{data}),
\end{eqnarray*}\] and returns an \(M
\times (n+2)\) matrix with \(M\)
draws of the \((n+2)\)-dimensional
vector \((\beta_1,\ldots,\beta_n,V,W)\).
The full conditional distributions of \(V\) and \(W\) are both Inverse-Gamma distributions,
while the full conditional distribution of \((\beta_1,\ldots,\beta_n)\) is \(n\)-variate normal distribution and its
joint sampling is facilitated by the kalman recursions, both filtering
and smoothing.
Below, \(a_1=m_0\) and \(R_1=C_0+W\), while \(M_0\) and \(M\) are MCMC draws, with the first \(M_0\) draws discarded (burn-in or warm-up)
and the following \(M\) draws stores
for posterior inference. Initial values for \(V\) and \(W\) are the two last arguments of the
function mcmc.joint.
mcmc.joint = function(y,a1,R1,av,bv,aw,bw,M0,M,V,W){
n = length(y)
V.draw = V
W.draw = W
niter = M0+M
draws = matrix(0,niter,n+2)
for (iter in 1:niter){
beta.draw = ffbs(y,rep(1,n),0.0,V.draw,1.0,0.0,W.draw,a1,R1,nd=1)
par2v = bv+sum((y-beta.draw)^2)/2
par2w = bw+sum((beta.draw[2:n]-beta.draw[1:(n-1)])^2)/2
V.draw = 1/rgamma(1,av+n/2,par2v)
W.draw = 1/rgamma(1,aw+(n-1)/2,par2w)
draws[iter,] = c(beta.draw,V.draw,W.draw)
}
return(draws[(M0+1):niter,])
}
Sampling from \(p(\beta_1,\ldots,\beta_n|V,W,\mbox{data})\)
The following routine ffbs was actually written to
be able to sample the states jointly via the forward filtering backward
sampling (FFBS) scheme for a more general class of normal dynamic linear
model (NDLM):
\[\begin{eqnarray*}
y_t &=& F_t^T \theta_t + \nu_t \qquad \nu_t \sim N(0,V)\\
\theta_t &=& \gamma + G \theta_{t-1} + \omega_t \qquad \omega_t
\sim N(0,W),
\end{eqnarray*}\] where \(F_t\)
is a vector of observed components, with \(\theta_t\) a state vector of the same
dimensions. If \(F_t\) are the usual
definition of regressors in linear regression models, then \(\theta_t\) are the time-varying regression
coefficients. For simple local level model, we have that \(F_t=1, \ \ \forall t\), \(\gamma=0\), \(G=1\) and \(\theta_t=\beta_t\). Another special case,
that we have studied in this class, is the AR(1) plus noise model: \[\begin{eqnarray*}
y_t &=& \theta_t + \nu_t \qquad\qquad \nu_t \sim N(0,V)\\
\theta_t &=& \gamma + \phi \theta_{t-1} + \omega_t \qquad
\omega_t \sim N(0,W).
\end{eqnarray*}\]
We use the function ffbs to draw \(\beta_1,\ldots,\beta_n\) in the MCMC
function mcmc.joint.
ffbs = function(y,Ft,alpha,V,G,gamma,W,a1,R1,nd=1){
n = length(y)
if (length(Ft)==1){Ft=rep(Ft,n)}
if (length(alpha)==1){alpha=rep(alpha,n)}
if (length(V)==1){V=rep(V,n)}
a = rep(0,n)
R = rep(0,n)
m = rep(0,n)
C = rep(0,n)
B = rep(0,n-1)
H = rep(0,n-1)
# time t=1
a[1] = a1
R[1] = R1
f = alpha[1]+Ft[1]*a[1]
Q = R[1]*Ft[1]**2+V[1]
A = R[1]*Ft[1]/Q
m[1] = a[1]+A*(y[1]-f)
C[1] = R[1]-Q*A**2
# forward filtering
for (t in 2:n){
a[t] = gamma + G*m[t-1]
R[t] = C[t-1]*G**2 + W
f = alpha[t]+Ft[t]*a[t]
Q = R[t]*Ft[t]**2+V[t]
A = R[t]*Ft[t]/Q
m[t] = a[t]+A*(y[t]-f)
C[t] = R[t]-Q*A**2
B[t-1] = C[t-1]*G/R[t]
H[t-1] = sqrt(C[t-1]-R[t]*B[t-1]**2)
}
# backward sampling
theta = matrix(0,nd,n)
theta[,n] = rnorm(nd,m[n],sqrt(C[n]))
for (t in (n-1):1)
theta[,t] = rnorm(nd,m[t]+B[t]*(theta[,t+1]-a[t+1]),H[t])
if (nd==1){
theta[1,]
}
else{
theta
}
}
Posterior
inference
# Prior for beta(0) ~ N(m0,C0)
# Prior for beta(1) ~ N(a1,R1), a1=m0,R1=C0+W
a1 = 0
R1 = 10
av = 2.01
bv = 1.01
aw = 2.01
bw = 1.01*W
m0 = 0
C0 = 10-W
# MCMC initual values
V0 = 1
W0 = 0.5
M0 = 10000
M = 10000
set.seed(65432)
draws1 = mcmc.joint(y,a1,R1,av,bv,aw,bw,M0,M,V0,W0)
# states MCMC autocorrelations and posterior quantiles
q1 = t(apply(draws1[,1:n],2,quantile,c(0.05,0.5,0.95)))
lagmax = 30
acfs = array(0,c(2,n,lagmax))
for (i in 1:n)
acfs[1,i,] = acf(draws1[,i],plot=FALSE,lag.max=lagmax)$acf[2:(lagmax+1)]
Posterior inference
for \(V\) and \(W\)
par(mfrow=c(2,3),mar = c(2, 2, 1, 1))
ts.plot(draws1[,n+1],ylab="",xlab="Iteration",main="V")
acf(draws1[,n+1],main="")
hist(draws1[,n+1],prob=TRUE,main="",xlab="")
ts.plot(draws1[,n+2],ylab="",xlab="Iteration",main="W")
acf(draws1[,n+2],main="")
hist(draws1[,n+2],prob=TRUE,main="",xlab="")

MCMC
autocorrelation for states
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
ts.plot(t(acfs[1,,]),xlab="Lag",ylab="ACF",main="",ylim=range(acfs))

Posterior quantiles
for states
par(mfrow=c(1,1))
ts.plot(q1,col=c(3,2,3),ylim=range(y,q1),ylab="States")
points(y)
title("5th, 50th and 95th percentiles")

Single-move
The routine mcmc.single samples \(\beta_1,\ldots,\beta_n\), \(V\) and \(W\) iteratively from their univariate full
conditional posterior distributions, ie. \[\begin{eqnarray*}
p(V|\beta_1,\ldots,\beta_n,W,\mbox{data})\\
p(W|\beta_1,\ldots,\beta_n,V,\mbox{data}),
\end{eqnarray*}\] and \[
p(\beta_t|\beta_{-t},V,W,\mbox{data}) \qquad t=1,\ldots,n,
\] where \(\beta_{-t}\) is the
vector \((\beta_1,\ldots,\beta_n)\)
excluding the \(\beta_t\) state.
mcmc.single returns an \(M
\times (n+2)\) matrix with \(M\)
draws of the \((n+2)\)-dimensional
vector \((\beta_1,\ldots,\beta_n,V,W)\).
mcmc.single = function(y,a1,R1,av,bv,aw,bw,M0,M,beta,V,W){
n = length(y)
beta.draw = beta
V.draw = V
W.draw = W
niter = M0+M
draws = matrix(0,niter,n+2)
for (iter in 1:niter){
beta.draw = single.sampling(y,a1,R1,av,bv,aw,bw,M0,M,beta.draw,V.draw,W.draw)
par2v = bv+sum((y-beta.draw)^2)/2
par2w = bw+sum((beta.draw[2:n]-beta.draw[1:(n-1)])^2)/2
V.draw = 1/rgamma(1,av+n/2,par2v)
W.draw = 1/rgamma(1,aw+(n-1)/2,par2w)
draws[iter,] = c(beta.draw,V.draw,W.draw)
}
return(draws[(M0+1):niter,])
}
Sampling from \(p(\beta_t|\beta_{-t},V,W,\mbox{data})\)
The routine single.sampling samples, for \(t=1,\ldots,n\), from the univariate full
conditional distributons \[
p(\beta_t|\beta_{-t},V,W,\mbox{data}).
\]
single.sampling = function(y,a1,R1,av,bv,aw,bw,M0,M,beta,V,W){
V1 = 1/(1/R1+1/W+1/V)
b1 = V1*(a1/R1+beta[2]/W+y[1]/V)
beta[1] = rnorm(1,b1,sqrt(V1))
for (t in 2:(n-1)){
Vt = 1/(2/W+1/V)
bt = Vt*((beta[t-1]+beta[t+1])/W+y[t]/V)
beta[t] = rnorm(1,bt,sqrt(Vt))
}
Vn = 1/(1/W+1/V)
bn = Vn*(beta[n-1]/W+y[n]/V)
beta[n] = rnorm(1,bn,sqrt(Vn))
return(beta)
}
Posterior
inference
set.seed(65432)
beta0v = y
draws2 = mcmc.single(y,a1,R1,av,bv,aw,bw,M0,M,beta0v,V0,W0)
q2 = t(apply(draws2[,1:n],2,quantile,c(0.05,0.5,0.95)))
for (i in 1:n)
acfs[2,i,] = acf(draws2[,i],plot=FALSE,lag.max=lagmax)$acf[2:(lagmax+1)]
Posterior inference
for \(V\) and \(W\)
par(mfrow=c(2,3),mar = c(2, 2, 1, 1))
ts.plot(draws2[,n+1],ylab="",xlab="Iteration",main="V")
acf(draws2[,n+1],main="")
hist(draws2[,n+1],prob=TRUE,main="",xlab="")
ts.plot(draws2[,n+2],ylab="",xlab="Iteration",main="W")
acf(draws2[,n+2],main="")
hist(draws2[,n+2],prob=TRUE,main="",xlab="")

MCMC
autocorrelation for states
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
ts.plot(t(acfs[2,,]),xlab="Lag",ylab="ACF",main="",ylim=range(acfs))

Posterior quantiles
for states
par(mfrow=c(1,1))
ts.plot(q2,col=c(3,2,3),ylim=range(y,q2),ylab="States")
points(y)
title("5th, 50th and 95th percentiles")

Comparing MCMC
schemes
Comparing ACFs
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
ts.plot(t(acfs[2,,]),xlab="Lag",ylab="ACF",main="",ylim=range(acfs))
for (i in 1:n)
lines(acfs[1,i,],col=2)
legend("topright",legend=c("Single-move","Block-move"),col=1:2,lwd=2,bty="n")

Comparing state
posterior quantiles
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
ts.plot(q2)
lines(q1[,1],col=2)
lines(q1[,2],col=2)
lines(q1[,3],col=2)
legend("top",legend=c("Single-move","Block-move"),col=1:2,lwd=2,bty="n")

Comparing marginal
posterior for \(V\) and \(W\)
par(mfrow=c(1,2),mar = c(2, 2, 1, 1))
plot(density(draws2[,n+1]),main="V")
lines(density(draws1[,n+1]),col=2)
plot(density(draws2[,n+2]),main="W")
lines(density(draws1[,n+2]),col=2)
legend("topright",legend=c("Single-move","Block-move"),col=1:2,lwd=2,bty="n")

Replications
We replicate the above exercise for 20 sets of time series data and
run both MCMC schemes. We assume that for \(n=100\) and $W {0.01,0.25,1.0}.
set.seed(65432123)
nrep = 20
n = 100
V = 1.0
M0 = 1000
M = 1000
beta0 = 0.0
V0 = 1.0
W0 = 0.5
a1 = 0.0
R1 = 10.0
av = 2.01
bv = 1.01
aw = 2.01
m0 = 0.0
ws = c(0.01,0.25,1.0)
nws = length(ws)
neffs = array(0,c(nrep,nws,2))
for (rep in 1:nrep){
for (i in 1:3){
y = sample.data(n,ws[i],V,beta0)
beta0v = y
C0 = 10-W
bw = 1.01*W
draws1 = mcmc.joint(y,a1,R1,av,bv,aw,bw,M0,M,V0,W0)
acf1 = acf(draws1[,n+1]/draws1[,n+2],lag.max=31,plot=FALSE)
neff1 = M/(1+2*sum(acf1$acf[2:31]))
draws2 = mcmc.single(y,a1,R1,av,bv,aw,bw,M0,M,beta0v,V0,W0)
acf2 = acf(draws2[,n+1]/draws2[,n+2],lag.max=31,plot=FALSE)
neff2 = M/(1+2*sum(acf2$acf[2:31]))
neffs[rep,i,] = trunc(c(neff1,neff2))
}
}
Effective sample size
(ESS)
comb = c("(100,0.01)","(100,0.25)","(100,1)")
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
boxplot(cbind(neffs[,,1],neffs[,,2]),outline=FALSE,names=c(comb,comb),
xlab=paste("(n,W) for V=",V,sep=""))
abline(v=3.5)
legend("topleft",legend=c("Block-move"),bty="n")
legend("topright",legend=c("Single-move"),bty="n")
title(paste("MCMC posterior draws = ",M,sep=""))

Relative ESS
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
boxplot(neffs[,,2]/neffs[,,1],outline=FALSE,names=comb,ylab="Relative ESS",
xlab=paste("(n,W) for V=",V,sep=""))
abline(h=1,lty=2)
title("Ratio single-move to block-move")

---
title: "First order DLM"
subtitle: "Single-move vs block-move"
author: "Hedibert F. Lopes"
date: "`r Sys.Date()`"
output:
  html_document:
    theme: paper
    highlight: pygments
    toc: true
    toc_depth: 3
    toc_collapsed: true
#    toc_float: true
    code_download: true
    number_sections: true
---
  

# Local level model set up

The following exercise is inspired in Gamerman, Reis and Salazar (2006) Comparison of sampling schemes for dynamic linear models. *International Statistical Review*, 74, 203-214.

The purpose of the exercise is to compare two MCMC schemes to sample states and fixed parameters in the first order dynamic linear model, also known as the local level model that we have already discussed in class.  More specifically, the local level model assumes that the time series data, $\mbox{data}=\{y_1,\ldots,y_n\}$, can be described as follows, for $t=1,\ldots,n$
\begin{eqnarray*}
y_t &=& \beta_t + \nu_t \qquad \nu_t \sim N(0,V)\\
\beta_t &=& \beta_{t-1} + \omega_t \qquad \omega_t \sim N(0,W),
\end{eqnarray*}
with $\beta_0 \sim N(m_0,C_0)$ and where $W/V$ is commonly known as the signal to noise ratio (SNR).  Here $m_0$ and $C_0$ are known hyperparameters.

The model is completed with prior distributions for the error variances, $V$ and $W$.  Here, we will assume that
\begin{eqnarray*}
V &\sim& IG(a_\nu,b_\nu)\\
W &\sim& IG(a_\omega,b_\omega),
\end{eqnarray*}
$a_\nu,b_\nu,a_\omega,b_\omega$ known hyperparameters.


# Simulating one dataset

For simplication, we created the data simulation function **sample.data**.

```{r}
sample.data = function(n,W,V,beta0){
  sdW     = sqrt(W)
  sdV     = sqrt(V)
  beta    = rep(0,n)
  y       = rep(0,n)
  beta[1] = rnorm(1,beta0,sdW)
  y[1]    = rnorm(1,beta[1],sdV)
  for (t in 2:n){
    beta[t] = rnorm(1,beta[t-1],sdW)  
    y[t]    = rnorm(1,beta[t],sdV)
  }
  return(y)
}
```

We simulated $n=100$ observations, starting with $\beta_0=0$ and fixing $V=1.0$ and $W=0.25$ so the SNR is 0.25.

```{r fig.align='center', fig.width=7, fig.height=5}
set.seed(17389)
n     = 100
V     = 1.0
W     = 0.25
beta0 = 0.0
y     = sample.data(n,W,V,beta0)
ts.plot(y)
```

# Block-move

The following routine **mcmc.joint** samples $(\beta_1,\ldots,\beta_n)$, $V$ and $W$ iteratively from their full conditional posterior distributions, ie.
\begin{eqnarray*}
p(\beta_1,\ldots,\beta_n|V,W,\mbox{data})\\
p(V|\beta_1,\ldots,\beta_n,W,\mbox{data})\\
p(W|\beta_1,\ldots,\beta_n,V,\mbox{data}),
\end{eqnarray*}
and returns an $M \times (n+2)$ matrix with $M$ draws of the $(n+2)$-dimensional vector $(\beta_1,\ldots,\beta_n,V,W)$.

The full conditional distributions of $V$ and $W$ are both Inverse-Gamma distributions, while the full conditional distribution of
$(\beta_1,\ldots,\beta_n)$ is $n$-variate normal distribution and its joint sampling is facilitated by the kalman recursions, both filtering and smoothing. 

Below, $a_1=m_0$ and $R_1=C_0+W$, while $M_0$ and $M$ are MCMC draws, with the first $M_0$ draws discarded (burn-in or warm-up) and the following $M$ draws stores for posterior inference.  Initial values for $V$ and $W$ are the two last arguments of the function **mcmc.joint**. 

```{r}
mcmc.joint = function(y,a1,R1,av,bv,aw,bw,M0,M,V,W){
  n      = length(y)
  V.draw = V
  W.draw = W
  niter  = M0+M
  draws  = matrix(0,niter,n+2)
  for (iter in 1:niter){
    beta.draw = ffbs(y,rep(1,n),0.0,V.draw,1.0,0.0,W.draw,a1,R1,nd=1)
    par2v     = bv+sum((y-beta.draw)^2)/2
    par2w     = bw+sum((beta.draw[2:n]-beta.draw[1:(n-1)])^2)/2
    V.draw    = 1/rgamma(1,av+n/2,par2v)
    W.draw    = 1/rgamma(1,aw+(n-1)/2,par2w)
    draws[iter,] = c(beta.draw,V.draw,W.draw)
  }
  return(draws[(M0+1):niter,])
}
```

## Sampling from $p(\beta_1,\ldots,\beta_n|V,W,\mbox{data})$

The following routine **ffbs** was actually written to be able to sample the states jointly via the forward filtering backward sampling (FFBS) scheme for a more general class of normal dynamic linear model (NDLM):

\begin{eqnarray*}
y_t &=& F_t^T \theta_t + \nu_t \qquad \nu_t \sim N(0,V)\\
\theta_t &=& \gamma + G \theta_{t-1} + \omega_t \qquad \omega_t \sim N(0,W),
\end{eqnarray*}
where $F_t$ is a vector of observed components, with $\theta_t$ a state vector of the same dimensions.  If $F_t$ are the usual definition of regressors in linear regression models, then $\theta_t$ are the time-varying regression coefficients.  For simple local level model, 
we have that $F_t=1, \ \ \forall t$, $\gamma=0$, $G=1$ and $\theta_t=\beta_t$.  Another special case, that we have studied in this class, is the AR(1) plus noise model:
\begin{eqnarray*}
y_t &=& \theta_t + \nu_t \qquad\qquad \nu_t \sim N(0,V)\\
\theta_t &=& \gamma + \phi \theta_{t-1} + \omega_t \qquad \omega_t \sim N(0,W).
\end{eqnarray*}

We use the function **ffbs** to draw $\beta_1,\ldots,\beta_n$ in the MCMC function **mcmc.joint**.

```{r}
ffbs = function(y,Ft,alpha,V,G,gamma,W,a1,R1,nd=1){
  n = length(y)
  if (length(Ft)==1){Ft=rep(Ft,n)}
  if (length(alpha)==1){alpha=rep(alpha,n)}
  if (length(V)==1){V=rep(V,n)}
  a = rep(0,n)
  R = rep(0,n)
  m = rep(0,n)
  C = rep(0,n)
  B = rep(0,n-1)
  H = rep(0,n-1)
  # time t=1
  a[1] = a1
  R[1] = R1
  f    = alpha[1]+Ft[1]*a[1]
  Q    = R[1]*Ft[1]**2+V[1]
  A    = R[1]*Ft[1]/Q
  m[1] = a[1]+A*(y[1]-f)
  C[1] = R[1]-Q*A**2
  # forward filtering
  for (t in 2:n){
    a[t] = gamma + G*m[t-1]
    R[t] = C[t-1]*G**2 + W
    f    = alpha[t]+Ft[t]*a[t]
    Q    = R[t]*Ft[t]**2+V[t]
    A    = R[t]*Ft[t]/Q
    m[t] = a[t]+A*(y[t]-f)
    C[t] = R[t]-Q*A**2
    B[t-1] = C[t-1]*G/R[t]
    H[t-1] = sqrt(C[t-1]-R[t]*B[t-1]**2)
  }
  # backward sampling
  theta = matrix(0,nd,n)
  theta[,n] = rnorm(nd,m[n],sqrt(C[n]))
  for (t in (n-1):1)
    theta[,t] = rnorm(nd,m[t]+B[t]*(theta[,t+1]-a[t+1]),H[t])
  if (nd==1){
    theta[1,]
  }
  else{
    theta
  }
}
```                          

## Posterior inference

```{r}
# Prior for beta(0) ~ N(m0,C0)
# Prior for beta(1) ~ N(a1,R1), a1=m0,R1=C0+W
a1     = 0
R1     = 10
av     = 2.01
bv     = 1.01
aw     = 2.01
bw     = 1.01*W    
m0     = 0
C0     = 10-W

# MCMC initual values
V0     = 1
W0     = 0.5
M0     = 10000
M      = 10000

set.seed(65432)
draws1 = mcmc.joint(y,a1,R1,av,bv,aw,bw,M0,M,V0,W0)

# states MCMC autocorrelations and posterior quantiles
q1 = t(apply(draws1[,1:n],2,quantile,c(0.05,0.5,0.95)))
lagmax = 30
acfs = array(0,c(2,n,lagmax))
for (i in 1:n)
  acfs[1,i,] = acf(draws1[,i],plot=FALSE,lag.max=lagmax)$acf[2:(lagmax+1)]
```


### Posterior inference for $V$ and $W$
```{r fig.align='center', fig.width=7, fig.height=5}
par(mfrow=c(2,3),mar = c(2, 2, 1, 1))
ts.plot(draws1[,n+1],ylab="",xlab="Iteration",main="V")
acf(draws1[,n+1],main="")
hist(draws1[,n+1],prob=TRUE,main="",xlab="")
ts.plot(draws1[,n+2],ylab="",xlab="Iteration",main="W")
acf(draws1[,n+2],main="")
hist(draws1[,n+2],prob=TRUE,main="",xlab="")
```

### MCMC autocorrelation for states
```{r fig.align='center', fig.width=7, fig.height=5}
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
ts.plot(t(acfs[1,,]),xlab="Lag",ylab="ACF",main="",ylim=range(acfs))
```

### Posterior quantiles for states
```{r fig.align='center', fig.width=7, fig.height=5}
par(mfrow=c(1,1))
ts.plot(q1,col=c(3,2,3),ylim=range(y,q1),ylab="States")
points(y)
title("5th, 50th and 95th percentiles")
```




# Single-move

The routine **mcmc.single** samples $\beta_1,\ldots,\beta_n$, $V$ and $W$ iteratively from their univariate full conditional posterior distributions, ie.
\begin{eqnarray*}
p(V|\beta_1,\ldots,\beta_n,W,\mbox{data})\\
p(W|\beta_1,\ldots,\beta_n,V,\mbox{data}),
\end{eqnarray*}
and
$$
p(\beta_t|\beta_{-t},V,W,\mbox{data}) \qquad t=1,\ldots,n,
$$
where $\beta_{-t}$ is the vector $(\beta_1,\ldots,\beta_n)$ excluding the $\beta_t$ state.

**mcmc.single** returns an $M \times (n+2)$ matrix with $M$ draws of the $(n+2)$-dimensional vector $(\beta_1,\ldots,\beta_n,V,W)$.

```{r}
mcmc.single = function(y,a1,R1,av,bv,aw,bw,M0,M,beta,V,W){
  n         = length(y)
  beta.draw = beta
  V.draw    = V
  W.draw    = W
  niter     = M0+M
  draws     = matrix(0,niter,n+2)
  for (iter in 1:niter){
    beta.draw = single.sampling(y,a1,R1,av,bv,aw,bw,M0,M,beta.draw,V.draw,W.draw)
    par2v     = bv+sum((y-beta.draw)^2)/2
    par2w     = bw+sum((beta.draw[2:n]-beta.draw[1:(n-1)])^2)/2
    V.draw = 1/rgamma(1,av+n/2,par2v)
    W.draw = 1/rgamma(1,aw+(n-1)/2,par2w)
    draws[iter,] = c(beta.draw,V.draw,W.draw)
  }
  return(draws[(M0+1):niter,])
}
```

## Sampling from $p(\beta_t|\beta_{-t},V,W,\mbox{data})$

The routine **single.sampling** samples, for $t=1,\ldots,n$, from the univariate full conditional distributons
$$
p(\beta_t|\beta_{-t},V,W,\mbox{data}).
$$

```{r}
single.sampling = function(y,a1,R1,av,bv,aw,bw,M0,M,beta,V,W){
  V1 = 1/(1/R1+1/W+1/V)
  b1 = V1*(a1/R1+beta[2]/W+y[1]/V)
  beta[1] = rnorm(1,b1,sqrt(V1))
  for (t in 2:(n-1)){
    Vt = 1/(2/W+1/V)         
    bt = Vt*((beta[t-1]+beta[t+1])/W+y[t]/V)
    beta[t] = rnorm(1,bt,sqrt(Vt))
  }  
  Vn = 1/(1/W+1/V)
  bn = Vn*(beta[n-1]/W+y[n]/V)
  beta[n] = rnorm(1,bn,sqrt(Vn))
  return(beta)
}
```

## Posterior inference
```{r}
set.seed(65432)
beta0v = y
draws2 = mcmc.single(y,a1,R1,av,bv,aw,bw,M0,M,beta0v,V0,W0)
q2     = t(apply(draws2[,1:n],2,quantile,c(0.05,0.5,0.95)))
for (i in 1:n)
  acfs[2,i,] = acf(draws2[,i],plot=FALSE,lag.max=lagmax)$acf[2:(lagmax+1)]
```

### Posterior inference for $V$ and $W$

```{r fig.align='center', fig.width=7, fig.height=5}
par(mfrow=c(2,3),mar = c(2, 2, 1, 1))
ts.plot(draws2[,n+1],ylab="",xlab="Iteration",main="V")
acf(draws2[,n+1],main="")
hist(draws2[,n+1],prob=TRUE,main="",xlab="")
ts.plot(draws2[,n+2],ylab="",xlab="Iteration",main="W")
acf(draws2[,n+2],main="")
hist(draws2[,n+2],prob=TRUE,main="",xlab="")
```

### MCMC autocorrelation for states
```{r fig.align='center', fig.width=7, fig.height=5}
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
ts.plot(t(acfs[2,,]),xlab="Lag",ylab="ACF",main="",ylim=range(acfs))
```

### Posterior quantiles for states
```{r fig.align='center', fig.width=7, fig.height=5}
par(mfrow=c(1,1))
ts.plot(q2,col=c(3,2,3),ylim=range(y,q2),ylab="States")
points(y)
title("5th, 50th and 95th percentiles")
```

# Comparing MCMC schemes

## Comparing ACFs

```{r fig.align='center', fig.width=7, fig.height=5}
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
ts.plot(t(acfs[2,,]),xlab="Lag",ylab="ACF",main="",ylim=range(acfs))
for (i in 1:n)
  lines(acfs[1,i,],col=2)
legend("topright",legend=c("Single-move","Block-move"),col=1:2,lwd=2,bty="n")
```

## Comparing state posterior quantiles

```{r fig.align='center', fig.width=6, fig.height=5}
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
ts.plot(q2)
lines(q1[,1],col=2)
lines(q1[,2],col=2)
lines(q1[,3],col=2)
legend("top",legend=c("Single-move","Block-move"),col=1:2,lwd=2,bty="n")
```

## Comparing marginal posterior for $V$ and $W$

```{r fig.align='center', fig.width=7, fig.height=4}
par(mfrow=c(1,2),mar = c(2, 2, 1, 1))
plot(density(draws2[,n+1]),main="V")
lines(density(draws1[,n+1]),col=2)
plot(density(draws2[,n+2]),main="W")
lines(density(draws1[,n+2]),col=2)
legend("topright",legend=c("Single-move","Block-move"),col=1:2,lwd=2,bty="n")
```


# Replications

We replicate the above exercise for 20 sets of time series data and run both MCMC schemes.  We assume that for $n=100$ and $W \in \{0.01,0.25,1.0\}.

```{r}
set.seed(65432123)
nrep  = 20
n     = 100
V     = 1.0
M0    = 1000
M     = 1000
beta0 = 0.0
V0    = 1.0
W0    = 0.5
a1    = 0.0
R1    = 10.0
av    = 2.01
bv    = 1.01
aw    = 2.01
m0    = 0.0
ws    = c(0.01,0.25,1.0)
nws   = length(ws)
neffs = array(0,c(nrep,nws,2))
for (rep in 1:nrep){
  for (i in 1:3){
    y      = sample.data(n,ws[i],V,beta0)
    beta0v = y
    C0     = 10-W
    bw     = 1.01*W    
    draws1 = mcmc.joint(y,a1,R1,av,bv,aw,bw,M0,M,V0,W0)
    acf1  = acf(draws1[,n+1]/draws1[,n+2],lag.max=31,plot=FALSE)
    neff1 = M/(1+2*sum(acf1$acf[2:31]))
    draws2 = mcmc.single(y,a1,R1,av,bv,aw,bw,M0,M,beta0v,V0,W0)
    acf2  = acf(draws2[,n+1]/draws2[,n+2],lag.max=31,plot=FALSE)
    neff2 = M/(1+2*sum(acf2$acf[2:31]))
    neffs[rep,i,] = trunc(c(neff1,neff2))
  }
}
```

## Effective sample size (ESS)

```{r fig.align='center', fig.width=8, fig.height=5}
comb = c("(100,0.01)","(100,0.25)","(100,1)")
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
boxplot(cbind(neffs[,,1],neffs[,,2]),outline=FALSE,names=c(comb,comb),
        xlab=paste("(n,W) for V=",V,sep=""))
abline(v=3.5)
legend("topleft",legend=c("Block-move"),bty="n")
legend("topright",legend=c("Single-move"),bty="n")
title(paste("MCMC posterior draws = ",M,sep=""))
```

## Relative ESS
```{r fig.align='center', fig.width=7, fig.height=5}
par(mfrow=c(1,1),mar=c(5, 4, 4, 2))
boxplot(neffs[,,2]/neffs[,,1],outline=FALSE,names=comb,ylab="Relative ESS",
        xlab=paste("(n,W) for V=",V,sep=""))
abline(h=1,lty=2)
title("Ratio single-move to block-move")
```