Page 258 - Applied Probability
P. 258

11. Radiation Hybrid Mapping
                                                                                            245
                              Taking into account the defining condition for an obligate break leads to
                              the recurrence relation


                                                      p k (l, j)t c,k (l, i)+
                                                                        p k (l, j − 1)t c,k (l, i)
                                     p k+1 (i, j)=
                                                   l∼i
                                                                     l ∼i
                              for all 0 ≤ i ≤ c, where l ∼ i indicates that l and i are simultaneously in ei-
                              ther the set {0} or the set {1,... ,c}, and where the transition probabilities
                              t c,k (l, i) are defined in (11.13). As already noted in the haploid case, when
                              the final locus k = m is reached, the probabilities p m(i, j) can be summed
                              on i to produce the distribution of the number of obligate breaks.
                              11.9 Bayesian Methods
                              Bayesian methods offer an attractive alternative to maximum likelihood
                              methods. To implement a Bayesian analysis of locus ordering, two technical
                              hurdles must be overcome. First, an appropriate prior must be chosen. Once
                              this choice is made, efficient numerical schemes for estimating parameters
                              and posterior probabilities must be constructed.
                                It is more convenient to put a prior on the distances between the adja-
                              cent loci of an order than on the breakage probabilities determined by these
                              distances. In designing a prior for interlocus distances, we can assume with
                              impunity that the intensity of the breakage process satisfies λ =1.It is
                              also reasonable to assume that the m loci to be mapped are sampled uni-
                              formly from a chromosome interval of known physical length. This length
                              may be difficult to estimate in base pairs. Furthermore, physical distances
                              measured in base pairs are less relevant than physical distances measured
                              in expected number of breaks (Rays). We can circumvent the calibration
                              problem of converting from one measure of physical distance to the other
                              by using the results of a maximum likelihood analysis. Suppose that un-
                              der the best maximum likelihood order, we estimate a total of b expected
                              breaks between the first and last loci. With m uniformly distributed loci,
                              adjacent pairs of loci should be separated by an average distance of  b  .
                                                                                           m−1
                              This quantity should also approximate the average distance from the left
                              end of the interval to the first locus and from the right end of the interval
                                                                                 (m+1)b
                              to the last locus. These considerations suggest that d =  would be
                                                                                  m−1
                              a reasonable expected number of breaks to assign to the prior interval. In
                              practice, this value of d may be too confining, and it is probably prudent
                              to inflate it somewhat.
                                Given a prior interval of length d,let δ i be the distance separating the
                              adjacent loci i and i + 1 under a given order. To calculate the joint dis-
                              tribution of the vector of distances (δ 1 ,...,δ m−1 ), expand this vector to
                              include the distance δ 0 separating the left end of the interval from the first
                              locus. These spacings are related to the positions t 1 ,...,t m of the loci on
   253   254   255   256   257   258   259   260   261   262   263