Page 254 - Applied Probability
P. 254

Preliminary estimates of the parameters r and θ can be derived from the
                              empirically observed values of Pr(X 1 =0,X 2 = 0) and Pr(X 1 =1,X 2 = 1).
                              In fact, the equation
                                    Pr(X 1 =1,X 2 =1) − Pr(X 1 =0,X 2 = 0)=1 − 2(1 − r)
                              can be solved to give            11. Radiation Hybrid Mapping c  241
                                                                                         1

                                               1 − Pr(X 1 =1,X 2 = 1)+Pr(X 1 =0,X 2 =0)  c
                                    r  =1 −                                              . (11.10)
                                                                  2
                              Once r is known, θ is determined from the first equation in (11.9) as
                                                                              1
                                                     1 − r − [Pr(X 1 =0,X 2 = 0)]  c
                                              θ  =                             .         (11.11)
                                                              r(1 − r)
                              Thus, the map (θ, r) → (Pr(X 1 =1,X 2 =1), Pr(X 1 =0,X 2 = 0)) is one
                              to one. Its range is not the entire set {(s, t): s ≥ 0,t ≥ 0,s + t ≤ 1} since
                              one can demonstrate that any image point of the map must in addition
                              satisfy the inequality

                                                Pr(X 1 =1,X 2 =0) 2
                                             ≤ Pr(X 1 =1,X 2 =1) Pr(X 1 =0,X 2 =0).      (11.12)
                              See Problem 8 for elaboration.
                                The observed values of Pr(X 1 =1,X 2 = 1) and Pr(X 1 =0,X 2 =0)
                              are maximum likelihood estimates for the simplified model in which the
                              only constraints on the four probabilities displayed in (11.9) are nonnega-
                              tivity, the symmetry condition Pr(X 1 =1,X 2 =0)=Pr(X 1 =0,X 2 = 1),
                              and the requirement that the four probabilities sum to 1. This simplified
                              model has in effect two parameters, which we can identify with the prob-
                              abilities Pr(X 1 =1,X 2 = 1) and Pr(X 1 =0,X 2 = 0) and estimate by
                              their empirical values. These values are maximum likelihood estimates un-
                              der the simplified model. If these estimates satisfy inequality (11.12), then
                              they furnish maximum likelihood estimates of the radiation hybrid model
                              as well. Since maximum likelihood estimates are preserved under repara-
                              meterization, the maximum likelihood estimates of r and θ can then be
                              computed by substituting estimated values for theoretical values in (11.10)
                              and (11.11).
                                Under the polyploid model with many loci, likelihood calculation is hin-
                              dered by the fact that likelihoods no longer factor. Nonetheless, it is possible
                              to design a fast algorithm for likelihood calculation based on the theory of
                              hidden Markov chains [18]. In the current context, there exists a Markov
                              chain whose current state is the number of markers present in a clone at the
                              current locus. As the chain progresses from one locus to the next, starting
                              at the leftmost locus and ending at the rightmost locus, it counts the num-
                              ber of markers at each locus in the clone. These numbers are hidden from
   249   250   251   252   253   254   255   256   257   258   259