Page 249 - Applied Probability

P. 249

11. Radiation Hybrid Mapping
236
probabilities under this order must be substituted in the above calculations.
The next section addresses maximum likelihood estimation.
11.4 Maximum Likelihood Methods
The disadvantage of the minimum obligate breaks criterion is that it pro-
vides neither estimates of physical distances between loci nor comparisons
of likelihoods for competing orders. Maximum likelihood obviously reme-
dies the latter two defects, but does so at the expense of introducing some
of the explicit assumptions mentioned earlier. We will now brieﬂy discuss
how likelihoods are computed and maximized for a given order. Diﬀerent
orders can be compared on the basis of their maximum likelihoods.
Because diﬀerent clones are independent, it suﬃces to demonstrate how
to compute the likelihood of a single clone. Let X =(X 1 ,... ,X m ) be the
observation vector for a clone potentially typed at m loci. The component
X i is deﬁned as 0, 1, or ?, depending on what is observed at the ith locus.
We can gain a feel for how to compute the likelihood of X by considering
two simple cases. If m = 1 and X 1 = ?, then X 1 follows the Bernoulli
distribution
i
Pr(X 1 = i)= r (1 − r) 1−i (11.4)
for retention or nonretention. When m = 2 and both loci are typed, the
likelihood must reﬂect breakage as well as retention. If θ is the probability
of at least one break between the two loci, then
Pr(X 1 =0,X 2 =0) = (1 − r)(1 − θr)
Pr(X 1 =1,X 2 = 0) = Pr(X 1 =0,X 2 =1)
=(1 − r)θr (11.5)
Pr(X 1 =1,X 2 =1) = 1 − 2(1 − r)θr − (1 − r)(1 − θr)
=(1 − θ + θr)r.
Note that we parameterize in terms of the breakage probability θ between
the two loci rather than the physical distance δ between them. Besides the
obvious analytical simpliﬁcation entailed in using θ, only the product λδ
can be estimated anyway. The parameters λ and δ cannot be separately
identiﬁed.
As noted earlier, the probability of an obligate break between the two
loci is 2r(1 − r)θ, in agreement with the calculated value
Pr(X 1 = X 2 ) = Pr(X 1 =1,X 2 = 0)+Pr(X 1 =0,X 2 =1)
from (11.5). It is natural to estimate r and Pr(X 1 = X 2 ) by their empirical
values. Given these estimates, one can then estimate θ via the identity
Pr(X 1 = X 2 )
θ = .
2r(1 − r)

244 245 246 247 248 249 250 251 252 253 254