1 Oct, 2015

Stochastic Processes

Definition

Consider a probability space $(\Omega, \mathcal F, P)$, where $\Omega$ is the sample space of the experiment, an index set $T$, and a state space $S$. A stochastic process (SP) is a collection $\mathcal X = \\{ X(\omega,t): \omega \in \Omega, t\in T \\}$ such that:

For any $n$ and any set of index points $t_i \in T, i = 1,…,n, (X_{t_1} , …, X_{t_n})$ is an $n$-dimensional random variable (random vector) defined on the probability space $(\Omega, \mathcal F, P)$ and taking values in $S^n \equiv S \times … \times S$.
- Hence, for each fixed $t_i \in T$, $X_{t_i}(\cdot) \equiv X(\cdot, t_i) : (\Omega, \mathcal F, P) \rightarrow S$ is a random variable (or is it a random function?).
For any fixed $\omega \in \Omega$, $X_\omega(\cdot) \equiv X(\omega, \cdot): T \rightarrow S$ is a function defined on $T$ and taking values in $S$, referred to as a sample (or sample path or realization) of the stochastic process $\mathcal X$.

What does this mean?

This means that the stochastic process $\mathcal X$ can be viewed as

a collection of random functions $\{ X_\omega: \omega \in \Omega \}$, by fixing $t$
a collection of random variables $\{ X_t: t \in T \}$, by fixing $\omega$

Note that $\omega$ is not a parameter. It is instead an element from the sample space $\Omega$. The $\omega$ is the input to the function $X$ while $t$ is the realization index (or in a time series problem, time).

The Gaussian Process (GP) and Dirichlet Process (DP) are therefore SP’s.

R Code: Sampling from the Posterior Distribution

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
  color.btwn <- function(x,ylo,yhi,from,to,col.area="grey") {
  x <- c(x,rev(x))
  y <- c(yhi,rev(ylo))

  polygon(c(x[x>=from & x<= to]),
          c(y[x>=from & x<=to]),
          col=col.area,border=F)
}

rdir <- function(N,a) {
  k <- length(a)
  x <- matrix(rgamma(k*N, a, 1), N, k, byrow=T)
  rowsums <- x %*% rep(1,k)
  x / as.vector(rowsums)
}

# Using the dirichlet distribution from gamma's
dp <- function(N=1,pG,a,xlim=c(0,1),n=1000) {
  x <- seq(xlim[1],xlim[2],length=n)
  x <- sort(c(-1e1000,x))
  dG0 <- pG(x[-1]) - pG(x[-length(x)])
  out <- rdir(N,a*dG0)
  G <- t(apply(out,1,cumsum))
  list("G"=G, "x"= x[-1])
}

### Plotting Function
dp.post <- function(X,...) {
  plot(0,xlim=range(X$x),ylim=c(0,1),cex=0,...)
  for (i in 1:nrow(X$G)) {
    lines(X$x,X$G[i,],type="l",col=rgb(.4,.4,.4,.1))
  }
}

# Posterior Simulation ################
# y_i | G ~ G
# G ~ DP(G_0,a)
n1 <- 1*5 
n2 <- 2*5
n <- n1+n2
y <- c(rgamma(n1,4,2),rgamma(n2,2,3))
pT <- function(x) (n1*pgamma(x,4,2) + n2*pgamma(x,2,3)) / n

# Gibbs
B <- 1000 # Big number for MCMC
k <- 100 # Number of obs for each DP draw, should be LARGE
alpha <- 10 # Later, put a prior. Large => More faith in prior.
pG <- function(x) pnorm(x,2,2)
pG1 <- function(x) {
  s <- sapply(x,function(w) sum(w >= y))
  (alpha * pG(x) +  s) / (alpha + n)
}
# Draw from G | y
G <- dp(N=B,pG1,a=alpha+n,xlim=c(-10,10),n=k)
EG <- apply(G$G,2,mean)

#pdf("~/temp/pdf/dp_post.pdf")
#svg("../../../../assets/ams241/03/plots/dp_post.svg")
  #dp.post(G,xlab='y',ylab="Fn(y)")
  plot(0,cex=.001,ylim=c(0,1),xlim=c(-10,10),main=paste("alpha =",alpha),
       ylab="Fn(y)",xlab="y")
  lines(ecdf(y),cex=.5)
  curve(pG,add=T,col="red",lwd=3)
  lines(G$x,EG,col=rgb(0,0,1,.3),lwd=10)
  curve(pT,add=T,col="green",lwd=3)
  legend("bottomright",
         legend=c('Truth','Data','G0','Posterior:\n 95% C.I.','E[G|y]',''),
         col=c('green','black','red','grey','blue',rgb(0,0,0,0)),lwd=3)
  qG <- apply(G$G,2,function(x) quantile(x,c(.025,.5,.975)))
  #apply(qG,1,function(x) lines(G$x,x))
  glo <- qG[1,]
  ghi <- qG[3,]
  color.btwn(G$x,glo,ghi,-100,100,col.area=rgb(.2,.2,.2,.5))
#dev.off()

#source("dp_posterior.R")

Sampling from the DP posterior

Ideas for Final Project:

HDP, LDA. Unbounded topic models for 4 Gospels, BOM
- https://en.wikipedia.org/wiki/Hierarchical_Dirichlet_process
- https://en.wikipedia.org/wiki/Latent_Dirichlet_allocation
- https://annalyzin.wordpress.com/2015/06/04/automated-biography/

Readings on Measure Theory

Measure Theory for Dummies - Gupta (5 pages)
Measure Theory and Probability - Grigoryan (122 pages)