Page 123 - Applied Probability

P. 123

G j match in state. Although substituting i.b.d. for identity by state might
be attractive in this deﬁnition, the alternative statistic with i.b.d. matches
counted would be considerably more diﬃcult to evaluate. In any event, if
person i has observed genotype M i = a k /a l and person j has genotype
M j = a m /a n, then deﬁnition (6.7) reduces to
1
1 6. Applications of Identity Coeﬃcients 107
= 1 {a k =a m } f(p k )+ 1 {a k =a n } f(p k )
Z ij
4 4
1 1
+ 1 {a l =a m } f(p l )+ 1 {a l =a n} f(p l ).
4 4

From the pairwise statistics Z ij , we form an overall statistic Z = Z ij
{i,j}
by summing over all aﬀected pairs {i, j} typed in the pedigree. In most
applications we take i = j, but the contrary procedure of comparing an
aﬀected person to himself can be useful for inbred aﬀecteds if the disease
is thought to be caused by recessively acting genes.
Since the mean and variance of Z obviously are

E(Z) = E(Z ij )
{i,j}

Var(Z) = Cov(Z ij ,Z kl ),
{i,j} {k,l}
it suﬃces to calculate E(Z ij ) and Cov(Z ij ,Z kl ). If we condition on whether
the two sampled genes G i and G j are i.b.d., then it follows that
)]
E(Z ij )=E[1 {G i =G j } f(p G i
2
f(p k )p k +(1 − Φ ij ) f(p k )p .
=Φ ij
k
k k
The covariance Cov(Z ij ,Z kl )=E(Z ij Z kl ) − E(Z ij )E(Z kl ) can be com-
) depends only on the observed
puted by ﬁrst noting that 1 {G i =G j } f(p G i
) depends only on
marker genotypes M i and M j and that 1 {G k =G l } f(p G k
the observed marker genotypes M k and M l . These two facts imply that
E(Z ij Z kl )
) | M k ,M l )]
=E[E(1 {G i =G j } f(p G i ) | M i ,M j )E(1 {G k =G l } f(p G k
) | M i ,M j ,M k ,M l )]
=E[E(1 {G i =G j } f(p G i )1 {G k =G l } f(p G k
)].
=E[1 {G i=G j } f(p G i )1 {G k =G l } f(p G k
To evaluate the last expectation, we condition on how the four sam-
pled genes G i , G j , G k , and G l are partitioned under identity by descent.
Consider again the condensed identity states of Figure 5.3. In each state,
imagine genes G i and G j appearing on the top row in no particular order
and genes G k and G l appearing on the bottom row in no particular order.

118 119 120 121 122 123 124 125 126 127 128