Page 237 - Applied Probability
P. 237

10. Molecular Phylogeny
                                                                                            223
                                  TABLE 10.4. Maximum Loglikelihoods of Various Hemoglobin Models
                                 Model
                                            Rate
                                                       Penalty
                                                                                  Maximum
                                                                     Spatial
                                 Type
                                                                                    -1918.6
                                              1
                                                          0
                                                                        0
                                Nucleotide  Classes  Parameters    Parameters   Loglikelihood
                                 Codon        1           0             0           -1889.8
                                 Codon        1           1             0           -1853.2
                                 Codon        1           4             0           -1847.4
                                 Codon        2           4             2           -1825.4
                              10.11 Problems
                                 1. Compute the number of unrooted evolutionary trees possible for n
                                   contemporary taxa. (Hint: How does this relate to the number of
                                   rooted trees?)
                                 2. In the notation of Section 10.2, let S n = T 2 + ··· + T n . Prove the
                                   inequalities

                                                   n − 2                       1
                                           T n 1+          ≤ S n ≤ T n 1+
                                                    2n 2                     n − 1
                                   for all n ≥ 2.
                                 3. Consider four contemporary taxa numbered 1, 2, 3, and 4. A total of
                                   n shared DNA sites are sequenced for each taxon. Let N wxyz be the
                                   number of sites at which taxon 1 has base w, taxon 2 base x, and so
                                   forth. If we denote the three possible unrooted trees by E, F, and G,
                                   then we can define three statistics

                                                         =
                                                    N E                   N rrss
                                                             r∈{A,G,C,T } s=r


                                                    N F  =                N rsrs
                                                             r∈{A,G,C,T } s=r


                                                         =
                                                    N G                   N rssr
                                                             r∈{A,G,C,T } s=r

                                   for discriminating among the unrooted trees. Show that maximum
                                   parsimony selects the unrooted tree E, F, or G with largest statis-
                                   tic N E , N F,or N G . Draw the unrooted tree corresponding to each
                                   statistic.
                                 4. Let u and v be column vectors with the same number of components.
                                   Applying the definition of the matrix exponential (10.7), show that
                                                                             t
                                                         , I + suv t      if v u =0
                                               e suv t  =      e sv u −1  t
                                                                 t
                                                           I +       uv   otherwise.
                                                                v t u
   232   233   234   235   236   237   238   239   240   241   242