Page 258 - Applied Probability

P. 258

11. Radiation Hybrid Mapping
245
Taking into account the deﬁning condition for an obligate break leads to
the recurrence relation

p k (l, j)t c,k (l, i)+
p k (l, j − 1)t c,k (l, i)
p k+1 (i, j)=
l∼i
l ∼i
for all 0 ≤ i ≤ c, where l ∼ i indicates that l and i are simultaneously in ei-
ther the set {0} or the set {1,... ,c}, and where the transition probabilities
t c,k (l, i) are deﬁned in (11.13). As already noted in the haploid case, when
the ﬁnal locus k = m is reached, the probabilities p m(i, j) can be summed
on i to produce the distribution of the number of obligate breaks.
11.9 Bayesian Methods
Bayesian methods oﬀer an attractive alternative to maximum likelihood
methods. To implement a Bayesian analysis of locus ordering, two technical
hurdles must be overcome. First, an appropriate prior must be chosen. Once
this choice is made, eﬃcient numerical schemes for estimating parameters
and posterior probabilities must be constructed.
It is more convenient to put a prior on the distances between the adja-
cent loci of an order than on the breakage probabilities determined by these
distances. In designing a prior for interlocus distances, we can assume with
impunity that the intensity of the breakage process satisﬁes λ =1.It is
also reasonable to assume that the m loci to be mapped are sampled uni-
formly from a chromosome interval of known physical length. This length
may be diﬃcult to estimate in base pairs. Furthermore, physical distances
measured in base pairs are less relevant than physical distances measured
in expected number of breaks (Rays). We can circumvent the calibration
problem of converting from one measure of physical distance to the other
by using the results of a maximum likelihood analysis. Suppose that un-
der the best maximum likelihood order, we estimate a total of b expected
breaks between the ﬁrst and last loci. With m uniformly distributed loci,
adjacent pairs of loci should be separated by an average distance of b .
m−1
This quantity should also approximate the average distance from the left
end of the interval to the ﬁrst locus and from the right end of the interval
(m+1)b
to the last locus. These considerations suggest that d = would be
m−1
a reasonable expected number of breaks to assign to the prior interval. In
practice, this value of d may be too conﬁning, and it is probably prudent
to inﬂate it somewhat.
Given a prior interval of length d,let δ i be the distance separating the
adjacent loci i and i + 1 under a given order. To calculate the joint dis-
tribution of the vector of distances (δ 1 ,...,δ m−1 ), expand this vector to
include the distance δ 0 separating the left end of the interval from the ﬁrst
locus. These spacings are related to the positions t 1 ,...,t m of the loci on

253 254 255 256 257 258 259 260 261 262 263