Page 187 - Applied Probability

P. 187

9. Descent Graph Methods
172
Conversely, if an irreducible Markov chain satisﬁes Kolmogorov’s crite-
rion, then the chain is reversible. This fact can be demonstrated by explic-
itly constructing the equilibrium distribution and showing that it satisﬁes
detailed balance. The idea behind the construction is to choose some arbi-
trary reference state i and to pretend that π i is given. If j is another state,
let i → i 1 → ··· → i m → j be any path leading from i to j. Then the
formula
p
p ii 1 i 1 i 2 ··· p i m j
π j = π i (9.4)
p
p ji m i m i m−1 ··· p i 1 i
deﬁnes π j . A straightforward application of Kolmogorov’s criterion (9.3)
shows that the deﬁnition (9.4) does not depend on the particular path
chosen from i to j. To validate detailed balance, suppose that k is adjacent
to j. Then i → i 1 → ··· → i m → j → k furnishes a path from i to k
through j. It follows from (9.4) that π k = π j p jk /p kj , which is obviously
equivalent to detailed balance. In general, the value of π i is not known
beforehand. Setting π i = 1 produces the equilibrium distribution up to a
normalizing constant.
Example 9.2.1 Two Diﬀerent Markov Chains on a DNA Strand
As explained in Appendix A, a DNA strand is constructed from the four
bases A (adenine), G (guanine), C (cytosine), and T (thymine). The strand
has a directionality so that we can imagine starting at its 5 end and walking

toward its 3 end. As one proceeds along the strand, the bases encountered
are not independent. To a ﬁrst approximation [37], the successive bases
conform to a Markov chain with transition matrix
A C G T
 
A .32 .18 .23 .27
C  .37 .23 .05 .35 
P =   .
G  .30 .21 .25 .24 
T .23 .19 .25 .33

It is easy to check that the equilibrium distribution of this aperiodic chain
is π =(.30,.20,.20,.30).
Bishop et al. [2] use the above chain to construct a more complicated
Markov chain capturing the random distances between restriction sites.
Restriction enzymes recognize certain speciﬁc sequences of bases along
a DNA strand and cut the DNA at these restriction sites. For instance, the
enzyme AluI recognizes the sequence AGCT. To investigate the random
distance between restriction sites for AluI, it is helpful to construct a chain
with states A, C, G, T, AG, AGC, and AGCT. The ﬁrst four states are
interpreted as in the chain above. AG is the state where the current DNA
base is G and the previous base is A. Here part of the desired restriction
site pattern has been achieved. The state AGC is even further along on

182 183 184 185 186 187 188 189 190 191 192