Page 246 - Applied Probability
P. 246

11. Radiation Hybrid Mapping
                              the hybrid clone; a “0” indicates that the locus was absent; and a “?” in-
                              dicates that the locus was untyped in the clone or gave conflicting typing
                              results.
                              11.3 Minimum Obligate Breaks Criterion                        233

                              Computing the minimum number of obligate breaks per order allows com-
                              parisons of different orders [3, 4, 5, 23]. If the order of the loci along the
                              chromosome is the same as the scoring order, then the clone described in
                              the last section requires three obligate breaks. These breaks occur when-
                              ever a run of 0’s is broken by a 1 or vice versa; untyped loci are ignored
                              in this accounting. The minimum number of obligate breaks for each clone
                              can be summed over all clones to give a grand sum for a given order. This
                              grand sum serves as a criterion for comparing orders. The minimum breaks
                              criterion can be minimized over orders by a stepwise algorithm [5] or by
                              standard combinatorial optimization techniques such as branch-and-bound
                              [19] and simulated annealing [17].
                                The advantage of the minimum breaks criterion is that it depends on al-
                              most no assumptions about how breaks occur and fragments are retained.
                              Given a common retention rate, this criterion is also strongly statistically
                              consistent. Following Barrett [1] and Speed et al. [20], let us demonstrate
                              this fact. Consider m loci taken in their natural order 1,... ,m along a
                              chromosome, and imagine an infinite number of independent, fully-typed
                              radiation hybrid clones at these loci. Let B i (σ) be the random number of
                              obligate breaks occurring in the ith clone when the loci are ordered accord-
                              ing to the permutation σ. In general, a permutation can be represented as
                              an m-vector (σ(1),...,σ(m)). Ambiguity about the left-to-right orienta-
                              tion of the loci can be avoided by confining our attention to permutations
                              σ with σ(1) <σ(m). The correct order is given by the identity permutation
                              id.
                                Given n clones, the best order is identified by the permutation giving
                                                         n
                              the smallest sum S n (σ)=     B i (σ). Consistency requires that S n (id)
                                                         i=1
                              be the smallest sum for n large enough. Now the law of large numbers
                                                   1
                              indicates that lim n→∞ S n (σ)= E[B 1 (σ)] with probability 1. Thus to
                                                   n
                              demonstrate consistency, it suffices to show that the expected number of
                              breaks E[B 1 (id)] under id is strictly smaller than the expected number of
                              breaks E[B 1 (σ)] under any other permutation σ.
                                To compute E[B 1 (id)], note that the interval separating loci i and i +1
                              manifests an obligate break if and only if there is a break between the two
                              loci and one locus is retained while the other locus is lost. This event occurs
                              with probability 2r(1 − r)θ i,i+1 , where r is the retention probability and
   241   242   243   244   245   246   247   248   249   250   251