Page 122 - Applied Probability
P. 122

6. Applications of Identity Coefficients
                              106
                              6.5 An Affecteds-Only Method of Linkage Analysis
                              Genetic epidemiologists are now actively attempting to map some of the
                              genes contributing to common diseases. This task is complicated by the
                              poorly understood inheritance patterns for many of these diseases. While
                              major genes certainly contribute to some common diseases such as breast
                              cancer and Alzheimer disease, the classical monogenic patterns of inheri-
                              tance typically do not fit pedigree and population data. One strategy for
                              identifying disease predisposing genes is to restrict mapping studies to pedi-
                              grees showing multiple affecteds with early age of onset. Such pedigrees are
                              more apt to segregate major genes than pedigrees with isolated affecteds
                              showing late onset. Even this enrichment strategy does not guarantee a
                              single Mendelian pattern of inheritance in the ascertained pedigrees.
                                In the absence of a well-defined disease inheritance model, it is still prof-
                              itable to pursue linkage analysis by robust methods. Robust linkage meth-
                              ods are predicated on the observation that a marker allele will track a
                              closely linked disease allele as both descend from a founder through a pedi-
                              gree. Only recombination can separate a pair of such alleles present in a
                              pedigree founder. Thus, marker genes can be used as surrogates for disease
                              genes. Robust linkage tests seek to assess the amount of marker allele shar-
                              ing among affecteds. Excess sharing is taken as evidence that the marker
                              locus is closely linked to a disease predisposing locus. The marker locus
                              may be a candidate locus for the disease. In this case it is perhaps better
                              to speak of association between the marker and the disease.
                                Our immediate goal is to examine one robust linkage statistic and to
                              compute the mean and variance of this statistic using kinship coefficients
                              [12]. These computations are valid under the null hypothesis of independent
                              segregation of the marker locus and the disease. Beyond this independence
                              assumption, nothing specific is assumed about disease causation.
                                Consider a pedigree and two affected individuals i and j in that pedigree
                              who are typed at a given marker locus. We assume that the marker locus
                              is in Hardy-Weinberg equilibrium and that its alleles are codominant with
                              the kth allele having population frequency p k . At the heart of our robust
                              statistic is the pairwise statistic Z ij assessing the marker sharing between
                              i and j. It is desirable for Z ij to give greater weight to shared rare alleles
                              than to shared common alleles. This weighting is
                                accomplished via a weighting function f(p) of the population frequency
                                                                                            √
                              p of the shared allele. Typical choices for f(p) are f(p)= 1, f(p)=1/ p,
                              and f(p)= 1/p.Now let M i and M j be the observed marker genotypes of
                              i and j. Imagine drawing one marker gene G i at random from i and one
                              marker gene G j at random from j.
                                The statistic Z ij is defined as the conditional expectation
                                              Z ij  =E(1 {G i =G j } f(p G i ) | M i ,M j ),  (6.7)
                              where the indicator function 1 {G i =G j } is 1 when the sampled genes G i and
   117   118   119   120   121   122   123   124   125   126   127