Page 142 - Applied Probability
P. 142

7. Computation of Mendelian Likelihoods
                              126
                              Example 7.6.2 Gamete Competition Model
                                When a new locus is investigated, one of the first statistical tasks is
                              to check whether its proposed alleles and genotypes conform to Mendelian
                              segregation ratios. The typical way of doing this is to subdivide all available
                              nuclear families into different mating types. For any given pair of parental
                              genotypes, there are from one to four possible offspring genotypes. The
                              numbers of offspring observed in the various genotypic categories are used
                                                                                      2
                                                         2
                              to compute an approximate χ statistic. These approximate χ statistics
                                                                                         2
                              are then added over the various mating types to give a grand χ . This
                                                                                      2
                              classical procedure suffers from the fact that the component χ statistics
                              often lack adequate numbers for large sample theory to apply. If the alleles
                              at the proposed locus involve dominance, then defining mating types is also
                              problematic.
                                An alternative procedure is to assign each allele A i a segregation pa-
                              rameter τ i . These parameters can be estimated by maximum likelihood
                              from the available pedigree data. The parameters enter the likelihood cal-
                              culations at the level of gamete transmission probabilities. For two dif-
                              ferent alleles A i and A j , take Pr(A i | A i /A j )= τ i /(τ i + τ j ) as suggested
                              in the Bradley-Terry model for ranking sports teams in the same league.
                              [3, 12, 14, 20, 29, 30]. For a homozygous parental genotype A i /A i , take
                              Pr(A i | A i /A i ) = 1. Under the hypothesis of Mendelian segregation, all
                              the τ’s are equal. Because multiplying the τ’s by the same constant pre-
                              serves segregation ratios, one should impose a constraint such as τ i = 1 for
                              one i. Using a likelihood ratio statistic, one can then test whether all other
                              τ i =1.
                                As a simple numerical example, consider the four alleles 1+, 1−, 2+,
                              and 2− of the PGM1 marker locus on chromosome 1. The PGM1 data of
                              Lewis et al. [23] lists 93 people in 5 pedigrees. For these data, the maximum
                              likelihood estimates are ˆ τ 1+ =1, ˆ τ 1− = .79, ˆ τ 2+ = .84, and ˆ τ 2− =1.26.
                                                                                     2
                              The likelihood ratio statistic is approximately distributed as a χ with three
                              degrees of freedom. The observed value of this statistic,

                                              2 × [(−114.70) − (−115.64)] =  1.88,
                              suggests that the alleles of the PGM1 locus do conform to Mendelian in-
                              heritance. Note that this analysis safely ignores the issue of ascertainment.
                                This gamete competition model also forms the basis of a attractive
                              parameteric generalization of the transmission/disequilibrium test (TDT).
                              In the absence of severity data, the most natural implementation uses
                              Mendelian segregation ratios for transmission to unaffected children and
                              Bradley-Terry segregation ratios for transmission to affected children. This
                              tactic permits detection of distorted transmission to affecteds. The model
                              also accommodates quantitative as well as qualitative outcomes, allows for
                              covariates, and makes effective use of full pedigree data. To use quantita-
                              tive outcomes and covariates, it is convenient to reparameterize by writing
   137   138   139   140   141   142   143   144   145   146   147