Page 88 - Applied Probability
P. 88

71
                                                      4. Hypothesis Testing and Categorical Data
                              of Britain at the time of the Clarke et al. study [7], this is a reasonable pro-
                              cedure. However, in racially mixed societies like that of the United States,
                              associations can result from population stratification rather than direct
                              causation of alleles at a candidate locus or linkage disequilibrium between
                              the alleles at a marker locus and deleterious alleles at a nearby disease-
                              predisposing locus. Thus, if a disease is concentrated in one racial or ethnic
                              group, then that group’s allele frequencies at a marker will predominate in
                              the affecteds regardless of whether or not the marker is linked to a disease
                              locus. If normal controls are not matched by ethnicity to affecteds, then
                              transmission association can be easily confused with ethnic association.
                                The transmission/disequilibrium test neatly circumvents these mislead-
                              ing ethnic associations by exploiting the internal controls provided by par-
                              ents [13, 39, 41]. If marker data are collected on the parents of an affected
                              as well as on the affected himself, then one can determine for a codominant
                              marker which of the maternal and paternal alleles are passed to the affected
                              and which are not. The only ambiguity arises when both parents and the
                              child share the same heterozygous genotype. Even in this case one can still
                              count the number of alleles of each type passed to the affected. In the trans-
                              mission/disequilibrium test, the marker alleles potentially contributed by
                              heterozygous parents to sampled affecteds are arranged in a 2 × m con-
                              tingency table, with one row counting parental alleles passed to affecteds
                              and the other row counting parental alleles not passed to affecteds. The m
                              columns correspond to the m different alleles seen among the parents. It
                              seems reasonable in this scheme to exclude contributions from homozygous
                              parents because these tell us nothing about transmission distortion.
                                In analyzing contingency table data of this sort, we should explicitly
                              condition on the parental genotypes. This eliminates ethnic association.
                              Once we have done this, there is no harm in counting alleles transmitted
                              to affected siblings or to related, affected individuals scattered throughout
                              an extended pedigree. The two inviolable rules to observe are that both
                              parents of an affected must be typed and that marker typing should done
                              in one part of a family without regard to the outcomes of marker typing in
                              another part of the family.
                                The transmission/disequilibrium test for two alleles permits exact cal-
                              culation of p-values [39]. In generalizing the test to multiple alleles, this
                              convenience is sacrificed, but one can approach the problem of calculating
                              approximate p-values by standard permutation techniques [37, 20, 25]. The
                              question of an appropriate test statistic also becomes murky unless we con-
                              sider rather simple, and probably unrealistic, alternative hypotheses. We
                              will suggest two statistics that are intuitively reasonable. Both are based on
                              computing a standardized residual for each cell of the 2×m table. Let c ij be
                              the count appearing in row i and column j of the table. If h j heterozygous
                              parents carry allele j, then under the null hypothesis of Mendelian trans-
                              mission, c ij is binomially distributed with h j trials and success probability
   83   84   85   86   87   88   89   90   91   92   93